Message boards : Server programs : Scheduler request completed: got 0 new tasks
Message board moderation
Previous · 1 · 2
Author | Message |
---|---|
Send message Joined: 25 Oct 13 Posts: 14 |
The project the client is currently attaching to is "upper", however from the log it is also observed that it picks up older projects namely "mci" and "uc" while processing. Not sure why this is happening. 15-Nov-2013 15:20:16 [---] Starting BOINC client version 7.1.0 for x86_64-pc-linux-gnu 15-Nov-2013 15:20:16 [---] This a development version of BOINC and may not function properly 15-Nov-2013 15:20:16 [---] log flags: file_xfer, sched_ops, task, file_xfer_debug, sched_op_debug, task_debug 15-Nov-2013 15:20:16 [---] log flags: work_fetch_debug 15-Nov-2013 15:20:16 [---] Libraries: libcurl/7.22.0 GnuTLS/2.12.14 zlib/1.2.3.4 libidn/1.23 librtmp/2.3 15-Nov-2013 15:20:16 [---] Data directory: /home/shruti/boinc-new/client 15-Nov-2013 15:20:16 [---] Processor: 32 GenuineIntel Intel(R) Xeon(R) CPU E5-2450 0 @ 2.10GHz [Family 6 Model 45 Stepping 7] 15-Nov-2013 15:20:16 [---] Processor features: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon pebs bts rep_good nopl xtopology nonstop_tsc aperfmperf pni pclmulqdq dtes64 monitor ds_cpl vmx smx est tm2 ssse3 cx16 xtpr pdcm pcid dca sse4_1 sse4_2 x2apic popcnt tsc_deadline_timer aes xsave avx lahf_lm ida arat xsaveopt pln pts dtherm tpr_shadow vnmi flexpriority ept vpid 15-Nov-2013 15:20:16 [---] OS: Linux: 3.2.0-51-generic 15-Nov-2013 15:20:16 [---] Memory: 31.38 GB physical, 57.90 GB virtual 15-Nov-2013 15:20:16 [---] Disk: 1.77 TB total, 1.66 TB free 15-Nov-2013 15:20:16 [---] Local time is UTC +5 hours 15-Nov-2013 15:20:16 [---] No usable GPUs found 15-Nov-2013 15:20:16 [---] A new version of BOINC is available. (7.2.28) <a href=http://boinc.berkeley.edu/download.php>Download</a> 15-Nov-2013 15:20:16 [uc] URL http://13.XX.XX.XX/uc/; Computer ID 5; resource share 100 15-Nov-2013 15:20:16 [http://http:/13.XX.XX.XX/mci/] URL http://http:/13.XX.XX.XX/mci/; Computer ID not assigned yet; resource share 100 15-Nov-2013 15:20:16 [http://13.XX.XX.XX/upper/] URL http://13.XX.XX.XX/upper/; Computer ID not assigned yet; resource share 100 15-Nov-2013 15:20:16 [---] No general preferences found - using defaults 15-Nov-2013 15:20:16 [---] Preferences: 15-Nov-2013 15:20:16 [---] max memory usage when active: 16065.26MB 15-Nov-2013 15:20:16 [---] max memory usage when idle: 28917.46MB 15-Nov-2013 15:20:16 [---] max disk usage: 1000.00GB 15-Nov-2013 15:20:16 [---] don't use GPU while active 15-Nov-2013 15:20:16 [---] suspend work if non-BOINC CPU load exceeds 25 % 15-Nov-2013 15:20:16 [---] (to change preferences, visit a project web site or select Preferences in the Manager) 15-Nov-2013 15:20:16 [---] [work_fetch] Request work fetch: Prefs update 15-Nov-2013 15:20:16 [---] [work_fetch] Request work fetch: Startup 15-Nov-2013 15:20:16 [---] Not using a proxy Initialization completed 15-Nov-2013 15:20:17 [---] [work_fetch] work fetch start 15-Nov-2013 15:20:17 [---] [work_fetch] choose_project() for CPU: buffer_low: yes; sim_excluded_instances 0 15-Nov-2013 15:20:17 [http://13.XX.XX.XX/upper/] [work_fetch] set_request() for CPU: ninst 32 nused_total 0.000000 nidle_now 32.000000 fetch share 0.500000 req_inst 32.000000 req_secs 1658880.000000 15-Nov-2013 15:20:17 [---] [work_fetch] ------- start work fetch state ------- 15-Nov-2013 15:20:17 [---] [work_fetch] target work buffer: 8640.00 + 43200.00 sec 15-Nov-2013 15:20:17 [---] [work_fetch] --- project states --- 15-Nov-2013 15:20:17 [uc] [work_fetch] REC 0.000 prio -0.000000 can req work 15-Nov-2013 15:20:17 [http://http:/13.XX.XX.XX/mci/] [work_fetch] REC 0.000 prio -0.000000 can't req work: master URL fetch pending (backoff: 83810.34 sec) 15-Nov-2013 15:20:17 [http://13.XX.XX.XX/upper/] [work_fetch] REC 0.000 prio -0.000000 can req work 15-Nov-2013 15:20:17 [---] [work_fetch] --- state for CPU --- 15-Nov-2013 15:20:17 [---] [work_fetch] shortfall 1658880.00 nidle 32.00 saturated 0.00 busy 0.00 15-Nov-2013 15:20:17 [uc] [work_fetch] REC 0.000 prio -0.000000 can req work 15-Nov-2013 15:20:17 [http://http:/13.XX.XX.XX/mci/] [work_fetch] REC 0.000 prio -0.000000 can't req work: master URL fetch pending (backoff: 83810.34 sec) 15-Nov-2013 15:20:17 [http://13.XX.XX.XX/upper/] [work_fetch] REC 0.000 prio -0.000000 can req work 15-Nov-2013 15:20:17 [---] [work_fetch] --- state for CPU --- 15-Nov-2013 15:20:17 [---] [work_fetch] shortfall 1658880.00 nidle 32.00 saturated 0.00 busy 0.00 15-Nov-2013 15:20:17 [uc] [work_fetch] fetch share 0.500 15-Nov-2013 15:20:17 [http://http:/13.XX.XX.XX/mci/] [work_fetch] fetch share 0.000 15-Nov-2013 15:20:17 [http://13.XX.XX.XX/upper/] [work_fetch] fetch share 0.500 15-Nov-2013 15:20:17 [---] [work_fetch] ------- end work fetch state ------- 15-Nov-2013 15:20:17 [http://13.XX.XX.XX/upper/] [sched_op] Starting scheduler request 15-Nov-2013 15:20:17 [http://13.XX.XX.XX/upper/] [sched_op] Fetching master file 15-Nov-2013 15:20:56 [http://13.XX.XX.XX/upper/] [sched_op] Got master file; parsing 15-Nov-2013 15:20:56 [http://13.XX.XX.XX/upper/] [sched_op] Found 1 scheduler URLs in master file 15-Nov-2013 15:20:56 [http://13.XX.XX.XX/upper/] Master file download succeeded 15-Nov-2013 15:20:56 [---] [work_fetch] Request work fetch: Master fetch complete 15-Nov-2013 15:21:01 [---] [work_fetch] work fetch start 15-Nov-2013 15:21:01 [---] [work_fetch] choose_project() for CPU: buffer_low: yes; sim_excluded_instances 0 15-Nov-2013 15:21:01 [http://13.XX.XX.XX/upper/] [work_fetch] set_request() for CPU: ninst 32 nused_total 0.000000 nidle_now 32.000000 fetch share 0.500000 req_inst 32.000000 req_secs 1658880.000000 15-Nov-2013 15:21:01 [---] [work_fetch] ------- start work fetch state ------- 15-Nov-2013 15:21:01 [---] [work_fetch] target work buffer: 8640.00 + 43200.00 sec 15-Nov-2013 15:21:01 [---] [work_fetch] --- project states --- 15-Nov-2013 15:21:01 [uc] [work_fetch] REC 0.000 prio -0.000000 can req work |
Send message Joined: 4 Jul 12 Posts: 321 |
You have a funky setup. The IP you see in the scheduler.log is that from the client. So it seems your client has several IP's. Could you also please use a standard BOINC Client and not a self compiled one. It seems you have an older version of the source code so please update the client to 7.2.28 (official download) and try again. To see what projects the Client is actually attached to you can use the BOINC Manager GUI that can also control remote instances of the Client. You should also set a sched_debug_level of 3 to get the debug output in scheduler.log Regards Christian |
Send message Joined: 29 Aug 05 Posts: 15565 |
And added to what Christian said, could you also tell us what name you gave to your science application(s)? |
Send message Joined: 25 Oct 13 Posts: 14 |
I installed a new client (this time on a windows 7 box). Previously it was a self-compiled client on a ubuntu box. I still get the same error - 0 tasks found. Here is the event log: 11/20/13 16:39:53 | | cc_config.xml not found - using defaults 11/20/13 16:39:54 | | Starting BOINC client version 7.2.28 for windows_intelx86 11/20/13 16:39:54 | | log flags: file_xfer, sched_ops, task 11/20/13 16:39:54 | | Libraries: libcurl/7.25.0 OpenSSL/1.0.1 zlib/1.2.6 11/20/13 16:39:54 | | Data directory: C:\ProgramData\BOINC 11/20/13 16:39:54 | | Running under account x3Q3YYG9 11/20/13 16:39:54 | | OpenCL: Intel GPU 0: Intel(R) HD Graphics 4000 (driver version 8.15.10.2639, device version OpenCL 1.1, 1482MB, 1482MB available, 45 GFLOPS peak) 11/20/13 16:39:54 | | Creating new client state file 11/20/13 16:39:54 | | Host name: BLR201555 11/20/13 16:39:54 | | Processor: 4 GenuineIntel Intel(R) Core(TM) i5-3210M CPU @ 2.50GHz [Family 6 Model 58 Stepping 9] 11/20/13 16:39:54 | | Processor features: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss htt tm pni ssse3 cx16 sse4_1 sse4_2 popcnt aes nx lm vmx tm2 pbe 11/20/13 16:39:54 | | OS: Microsoft Windows 7: Professional x86 Edition, Service Pack 1, (06.01.7601.00) 11/20/13 16:39:54 | | Memory: 3.41 GB physical, 6.82 GB virtual 11/20/13 16:39:54 | | Disk: 148.66 GB total, 107.32 GB free 11/20/13 16:39:54 | | Local time is UTC +5 hours 11/20/13 16:39:54 | | No general preferences found - using defaults 11/20/13 16:39:54 | | Preferences: 11/20/13 16:39:54 | | max memory usage when active: 1746.29MB 11/20/13 16:39:54 | | max memory usage when idle: 3143.32MB 11/20/13 16:39:54 | | max disk usage: 107.22GB 11/20/13 16:39:54 | | don't use GPU while active 11/20/13 16:39:54 | | suspend work if non-BOINC CPU load exceeds 25% 11/20/13 16:39:54 | | (to change preferences, visit a project web site or select Preferences in the Manager) 11/20/13 16:39:54 | | Not using a proxy 11/20/13 16:39:54 | | This computer is not attached to any projects 11/20/13 16:39:54 | | Visit http://boinc.berkeley.edu for instructions 11/20/13 16:39:55 | | Suspending GPU computation - computer is in use 11/20/13 16:43:25 | | Resuming GPU computation 11/20/13 16:44:42 | | Suspending GPU computation - computer is in use 11/20/13 17:06:26 | | Suspending computation - CPU is busy 11/20/13 17:06:36 | | Resuming computation 11/20/13 18:23:54 | | Fetching configuration file from http://bam.boincstats.com/get_project_config.php 11/20/13 18:24:54 | | Fetching configuration file from http://13.218.150.14/upper/get_project_config.php 11/20/13 18:25:22 | | Running CPU benchmarks 11/20/13 18:25:22 | | Suspending computation - CPU benchmarks in progress 11/20/13 18:25:53 | | Benchmark results: 11/20/13 18:25:53 | | Number of CPUs: 4 11/20/13 18:25:53 | | 2717 floating point MIPS (Whetstone) per CPU 11/20/13 18:25:53 | | 7167 integer MIPS (Dhrystone) per CPU 11/20/13 18:25:53 | | Resuming computation 11/20/13 18:25:55 | upper | Master file download succeeded 11/20/13 18:26:00 | upper | Sending scheduler request: Project initialization. 11/20/13 18:26:00 | upper | Requesting new tasks for CPU and intel_gpu 11/20/13 18:26:02 | upper | Scheduler request completed: got 0 new tasks 11/20/13 18:26:02 | upper | No tasks sent 11/20/13 18:26:12 | upper | Sending scheduler request: To fetch work. 11/20/13 18:26:12 | upper | Requesting new tasks for CPU and intel_gpu 11/20/13 18:26:16 | upper | Scheduler request completed: got 0 new tasks 11/20/13 18:26:16 | upper | No tasks sent 11/20/13 18:28:11 | upper | update requested by user 11/20/13 18:28:16 | upper | Sending scheduler request: Requested by user. 11/20/13 18:28:16 | upper | Requesting new tasks for CPU and intel_gpu 11/20/13 18:28:19 | upper | Scheduler request completed: got 0 new tasks 11/20/13 18:28:19 | upper | No tasks sent 11/20/13 18:28:29 | upper | Sending scheduler request: To fetch work. 11/20/13 18:28:29 | upper | Requesting new tasks for CPU and intel_gpu 11/20/13 18:28:32 | upper | Scheduler request completed: got 0 new tasks 11/20/13 18:28:32 | upper | No tasks sent 11/20/13 18:38:42 | upper | Sending scheduler request: To fetch work. 11/20/13 18:38:42 | upper | Requesting new tasks for CPU and intel_gpu 11/20/13 18:38:46 | upper | Scheduler request completed: got 0 new tasks 11/20/13 18:38:46 | upper | No tasks sent 11/20/13 18:39:06 | | Suspending computation - user request 11/20/13 18:58:54 | upper | Resetting project 11/20/13 19:04:27 | upper | update requested by user 11/20/13 19:04:31 | upper | Master file download succeeded 11/20/13 19:04:36 | upper | Sending scheduler request: Requested by user. 11/20/13 19:04:36 | upper | Requesting new tasks for CPU and intel_gpu 11/20/13 19:04:39 | upper | Scheduler request completed: got 0 new tasks 11/20/13 19:04:39 | upper | No tasks sent 11/20/13 19:39:05 | | Resuming computation 11/20/13 19:39:50 | upper | Sending scheduler request: To fetch work. 11/20/13 19:39:50 | upper | Requesting new tasks for CPU and intel_gpu 11/20/13 19:39:52 | upper | Scheduler request completed: got 0 new tasks 11/20/13 19:39:52 | upper | No tasks sent |
Send message Joined: 4 Jul 12 Posts: 321 |
The reason why no task was send can only be found in the scheduler.log and only if you have increased the loglevel and restarted the project. Are you sure there are tasks available? On the server you also have the tool "./bin/show_shmem" to see what the scheduler knows about. P.S.: I whoised the Server-IP in the log. Is this an in house project for the company that owns this subnet? |
Send message Joined: 25 Oct 13 Posts: 14 |
The debug level has been set to 3 (right from the beginning). And this is the only information I have from scheduler.log: 2013-11-20 19:04:32.7363 [PID=10396] Request: [USER#2] [HOST#2] [IP 13.XX.XX.XX] client 7.2.28 2013-11-20 19:04:32.7941 [PID=10396] Sending reply to [HOST#2]: 0 results, delay req 7.00 2013-11-20 19:04:32.7942 [PID=10396] Scheduler ran 0.060 seconds 2013-11-20 19:39:46.2352 [PID=11786] Request: [USER#2] [HOST#2] [IP 13.XX.XX.XX] client 7.2.28 2013-11-20 19:39:46.3377 [PID=11786] Sending reply to [HOST#2]: 0 results, delay req 7.00 2013-11-20 19:39:46.3378 [PID=11786] Scheduler ran 0.106 seconds 2013-11-20 19:47:33.7966 [PID=12143] Request: [USER#2] [HOST#2] [IP 13.XX.XX.XX] client 7.2.28 2013-11-20 19:47:33.8446 [PID=12143] Sending reply to [HOST#2]: 0 results, delay req 7.00 2013-11-20 19:47:33.8446 [PID=12143] Scheduler ran 0.052 seconds 2013-11-20 19:47:47.5364 [PID=12144] Request: [USER#2] [HOST#2] [IP 13.XX.XX.XX] client 7.2.28 2013-11-20 19:47:47.6150 [PID=12144] Sending reply to [HOST#2]: 0 results, delay req 7.00 2013-11-20 19:47:47.6151 [PID=12144] Scheduler ran 0.082 seconds 2013-11-20 19:53:00.6629 [PID=12361] Request: [USER#2] [HOST#2] [IP 13.XX.XX.XX] client 7.2.28 2013-11-20 19:53:00.7505 [PID=12361] Sending reply to [HOST#2]: 0 results, delay req 7.00 2013-11-20 19:53:00.7506 [PID=12361] Scheduler ran 0.091 seconds 2013-11-20 19:55:14.0748 [PID=12486] Request: [USER#2] [HOST#2] [IP 13.XX.XX.XX] client 7.2.28 2013-11-20 19:55:14.1584 [PID=12486] Sending reply to [HOST#2]: 0 results, delay req 7.00 2013-11-20 19:55:14.1584 [PID=12486] Scheduler ran 0.087 seconds 2013-11-20 20:12:28.4779 [PID=13114] Request: [USER#2] [HOST#2] [IP 13.XX.XX.XX] client 7.2.28 2013-11-20 20:12:28.6164 [PID=13114] Sending reply to [HOST#2]: 0 results, delay req 7.00 2013-11-20 20:12:28.6164 [PID=13114] Scheduler ran 0.142 seconds 2013-11-20 20:21:41.8161 [PID=13473] Request: [USER#2] [HOST#2] [IP 13.XX.XX.XX] client 7.2.28 2013-11-20 20:21:41.8996 [PID=13473] Sending reply to [HOST#2]: 0 results, delay req 7.00 2013-11-20 20:21:41.8997 [PID=13473] Scheduler ran 0.087 seconds Also here is the info upon running this command ./bin/show_shmem (It shows the workunit). appid: 4 platformid: 3 version_num: 100 plan_class: have CPU apps: yes have NVIDIA GPU apps: no have AMD/ATI GPU apps: no have Intel GPU apps: no Jobs; key: ap: app ID ic: infeasible count wu: workunit ID rs: result ID hr: HR class nr: need reliable host fpops mean 2200000000.000000 stddev 700000000.000000 host fpops 50th pctile 3300000000.000000 95th pctile 3300000000.000000 ready: 1 max_wu_results: 100 slot app WU ID result ID batch HR class priority in shmem size (stdev) need reliable inf count 0 uppercase 1 1 0 0 0 453554s 0.000000 no 0 1: --- |
Send message Joined: 4 Jul 12 Posts: 321 |
I have no further idea. I will contact the main developer and ask if he knows another way to debug work send. |
Send message Joined: 25 Oct 13 Posts: 14 |
Thanks. Let me know if there are any additional inputs from the main developer. |
Send message Joined: 20 Nov 12 Posts: 801 |
I played a little bit with my test server. Even with debug level=3 there isn't much debugging information written to the log. You really need <debug_send> and maybe <debug_version_select> and <debug_quota> too to get anything useful information. |
Send message Joined: 18 Dec 13 Posts: 8 |
i have the same problem , i get no tasks :) but work units are there and jobs files are created i don't know whats the problem :(( i and have the same output as shruti |
Send message Joined: 4 Jul 12 Posts: 321 |
I recently also had the problem with unsend tasks and solved it by enabling the <debug_array/> logging for the scheduler. For everyone who also suffers this problem please enable the following options in your config.xml: <debug_send/> <debug_version_select/> <debug_array/> <sched_debug_level> 3 </sched_debug_level> Than restart the project (bin/stop; bin/start), update the client and post the part of the scheduler.log that corresponds to this RPC. Best way I found is to use grep as follows: cat scheduler.log | grep HOST#2354 This will give you the latest RPC of this host (change to suit your host) extract the PID and run another grep: cat scheduler.log | grep PID=26407 This will give you the whole request and messages. |
Copyright © 2024 University of California.
Permission is granted to copy, distribute and/or modify this document
under the terms of the GNU Free Documentation License,
Version 1.2 or any later version published by the Free Software Foundation.