Thread 'Scheduler request completed: got 0 new tasks'

Message boards : Server programs : Scheduler request completed: got 0 new tasks
Message board moderation

To post messages, you must log in.

Previous · 1 · 2

AuthorMessage
shruti

Send message
Joined: 25 Oct 13
Posts: 14
United States
Message 51329 - Posted: 15 Nov 2013, 9:58:55 UTC - in response to Message 51328.  

The project the client is currently attaching to is "upper", however from the log it is also observed that it picks up older projects namely "mci" and "uc" while processing. Not sure why this is happening.

15-Nov-2013 15:20:16 [---] Starting BOINC client version 7.1.0 for x86_64-pc-linux-gnu
15-Nov-2013 15:20:16 [---] This a development version of BOINC and may not function properly
15-Nov-2013 15:20:16 [---] log flags: file_xfer, sched_ops, task, file_xfer_debug, sched_op_debug, task_debug
15-Nov-2013 15:20:16 [---] log flags: work_fetch_debug
15-Nov-2013 15:20:16 [---] Libraries: libcurl/7.22.0 GnuTLS/2.12.14 zlib/1.2.3.4 libidn/1.23 librtmp/2.3
15-Nov-2013 15:20:16 [---] Data directory: /home/shruti/boinc-new/client
15-Nov-2013 15:20:16 [---] Processor: 32 GenuineIntel Intel(R) Xeon(R) CPU E5-2450 0 @ 2.10GHz [Family 6 Model 45 Stepping 7]
15-Nov-2013 15:20:16 [---] Processor features: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon pebs bts rep_good nopl xtopology nonstop_tsc aperfmperf pni pclmulqdq dtes64 monitor ds_cpl vmx smx est tm2 ssse3 cx16 xtpr pdcm pcid dca sse4_1 sse4_2 x2apic popcnt tsc_deadline_timer aes xsave avx lahf_lm ida arat xsaveopt pln pts dtherm tpr_shadow vnmi flexpriority ept vpid
15-Nov-2013 15:20:16 [---] OS: Linux: 3.2.0-51-generic
15-Nov-2013 15:20:16 [---] Memory: 31.38 GB physical, 57.90 GB virtual
15-Nov-2013 15:20:16 [---] Disk: 1.77 TB total, 1.66 TB free
15-Nov-2013 15:20:16 [---] Local time is UTC +5 hours
15-Nov-2013 15:20:16 [---] No usable GPUs found
15-Nov-2013 15:20:16 [---] A new version of BOINC is available. (7.2.28) <a href=http://boinc.berkeley.edu/download.php>Download</a>
15-Nov-2013 15:20:16 [uc] URL http://13.XX.XX.XX/uc/; Computer ID 5; resource share 100
15-Nov-2013 15:20:16 [http://http:/13.XX.XX.XX/mci/] URL http://http:/13.XX.XX.XX/mci/; Computer ID not assigned yet; resource share 100
15-Nov-2013 15:20:16 [http://13.XX.XX.XX/upper/] URL http://13.XX.XX.XX/upper/; Computer ID not assigned yet; resource share 100
15-Nov-2013 15:20:16 [---] No general preferences found - using defaults
15-Nov-2013 15:20:16 [---] Preferences:
15-Nov-2013 15:20:16 [---] max memory usage when active: 16065.26MB
15-Nov-2013 15:20:16 [---] max memory usage when idle: 28917.46MB
15-Nov-2013 15:20:16 [---] max disk usage: 1000.00GB
15-Nov-2013 15:20:16 [---] don't use GPU while active
15-Nov-2013 15:20:16 [---] suspend work if non-BOINC CPU load exceeds 25 %
15-Nov-2013 15:20:16 [---] (to change preferences, visit a project web site or select Preferences in the Manager)
15-Nov-2013 15:20:16 [---] [work_fetch] Request work fetch: Prefs update
15-Nov-2013 15:20:16 [---] [work_fetch] Request work fetch: Startup
15-Nov-2013 15:20:16 [---] Not using a proxy
Initialization completed
15-Nov-2013 15:20:17 [---] [work_fetch] work fetch start
15-Nov-2013 15:20:17 [---] [work_fetch] choose_project() for CPU: buffer_low: yes; sim_excluded_instances 0
15-Nov-2013 15:20:17 [http://13.XX.XX.XX/upper/] [work_fetch] set_request() for CPU: ninst 32 nused_total 0.000000 nidle_now 32.000000 fetch share 0.500000 req_inst 32.000000 req_secs 1658880.000000
15-Nov-2013 15:20:17 [---] [work_fetch] ------- start work fetch state -------
15-Nov-2013 15:20:17 [---] [work_fetch] target work buffer: 8640.00 + 43200.00 sec
15-Nov-2013 15:20:17 [---] [work_fetch] --- project states ---
15-Nov-2013 15:20:17 [uc] [work_fetch] REC 0.000 prio -0.000000 can req work
15-Nov-2013 15:20:17 [http://http:/13.XX.XX.XX/mci/] [work_fetch] REC 0.000 prio -0.000000 can't req work: master URL fetch pending (backoff: 83810.34 sec)
15-Nov-2013 15:20:17 [http://13.XX.XX.XX/upper/] [work_fetch] REC 0.000 prio -0.000000 can req work
15-Nov-2013 15:20:17 [---] [work_fetch] --- state for CPU ---
15-Nov-2013 15:20:17 [---] [work_fetch] shortfall 1658880.00 nidle 32.00 saturated 0.00 busy 0.00
15-Nov-2013 15:20:17 [uc] [work_fetch] REC 0.000 prio -0.000000 can req work
15-Nov-2013 15:20:17 [http://http:/13.XX.XX.XX/mci/] [work_fetch] REC 0.000 prio -0.000000 can't req work: master URL fetch pending (backoff: 83810.34 sec)
15-Nov-2013 15:20:17 [http://13.XX.XX.XX/upper/] [work_fetch] REC 0.000 prio -0.000000 can req work
15-Nov-2013 15:20:17 [---] [work_fetch] --- state for CPU ---
15-Nov-2013 15:20:17 [---] [work_fetch] shortfall 1658880.00 nidle 32.00 saturated 0.00 busy 0.00
15-Nov-2013 15:20:17 [uc] [work_fetch] fetch share 0.500
15-Nov-2013 15:20:17 [http://http:/13.XX.XX.XX/mci/] [work_fetch] fetch share 0.000
15-Nov-2013 15:20:17 [http://13.XX.XX.XX/upper/] [work_fetch] fetch share 0.500
15-Nov-2013 15:20:17 [---] [work_fetch] ------- end work fetch state -------
15-Nov-2013 15:20:17 [http://13.XX.XX.XX/upper/] [sched_op] Starting scheduler request
15-Nov-2013 15:20:17 [http://13.XX.XX.XX/upper/] [sched_op] Fetching master file
15-Nov-2013 15:20:56 [http://13.XX.XX.XX/upper/] [sched_op] Got master file; parsing
15-Nov-2013 15:20:56 [http://13.XX.XX.XX/upper/] [sched_op] Found 1 scheduler URLs in master file
15-Nov-2013 15:20:56 [http://13.XX.XX.XX/upper/] Master file download succeeded
15-Nov-2013 15:20:56 [---] [work_fetch] Request work fetch: Master fetch complete
15-Nov-2013 15:21:01 [---] [work_fetch] work fetch start
15-Nov-2013 15:21:01 [---] [work_fetch] choose_project() for CPU: buffer_low: yes; sim_excluded_instances 0
15-Nov-2013 15:21:01 [http://13.XX.XX.XX/upper/] [work_fetch] set_request() for CPU: ninst 32 nused_total 0.000000 nidle_now 32.000000 fetch share 0.500000 req_inst 32.000000 req_secs 1658880.000000
15-Nov-2013 15:21:01 [---] [work_fetch] ------- start work fetch state -------
15-Nov-2013 15:21:01 [---] [work_fetch] target work buffer: 8640.00 + 43200.00 sec
15-Nov-2013 15:21:01 [---] [work_fetch] --- project states ---
15-Nov-2013 15:21:01 [uc] [work_fetch] REC 0.000 prio -0.000000 can req work
ID: 51329 · Report as offensive
ChristianB
Volunteer developer
Volunteer tester

Send message
Joined: 4 Jul 12
Posts: 321
Germany
Message 51330 - Posted: 15 Nov 2013, 12:50:18 UTC

You have a funky setup. The IP you see in the scheduler.log is that from the client. So it seems your client has several IP's. Could you also please use a standard BOINC Client and not a self compiled one. It seems you have an older
version of the source code so please update the client to 7.2.28 (official download) and try again. To see what projects the Client is actually attached to you can use the BOINC Manager GUI that can also control remote instances of the Client.

You should also set a sched_debug_level of 3 to get the debug output in scheduler.log

Regards
Christian
ID: 51330 · Report as offensive
ProfileJord
Volunteer tester
Help desk expert
Avatar

Send message
Joined: 29 Aug 05
Posts: 15565
Netherlands
Message 51335 - Posted: 15 Nov 2013, 20:23:40 UTC

And added to what Christian said, could you also tell us what name you gave to your science application(s)?
ID: 51335 · Report as offensive
shruti

Send message
Joined: 25 Oct 13
Posts: 14
United States
Message 51386 - Posted: 20 Nov 2013, 14:15:02 UTC - in response to Message 51335.  

I installed a new client (this time on a windows 7 box). Previously it was a self-compiled client on a ubuntu box.
I still get the same error - 0 tasks found.
Here is the event log:
11/20/13 16:39:53 | | cc_config.xml not found - using defaults
11/20/13 16:39:54 | | Starting BOINC client version 7.2.28 for windows_intelx86
11/20/13 16:39:54 | | log flags: file_xfer, sched_ops, task
11/20/13 16:39:54 | | Libraries: libcurl/7.25.0 OpenSSL/1.0.1 zlib/1.2.6
11/20/13 16:39:54 | | Data directory: C:\ProgramData\BOINC
11/20/13 16:39:54 | | Running under account x3Q3YYG9
11/20/13 16:39:54 | | OpenCL: Intel GPU 0: Intel(R) HD Graphics 4000 (driver version 8.15.10.2639, device version OpenCL 1.1, 1482MB, 1482MB available, 45 GFLOPS peak)
11/20/13 16:39:54 | | Creating new client state file
11/20/13 16:39:54 | | Host name: BLR201555
11/20/13 16:39:54 | | Processor: 4 GenuineIntel Intel(R) Core(TM) i5-3210M CPU @ 2.50GHz [Family 6 Model 58 Stepping 9]
11/20/13 16:39:54 | | Processor features: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss htt tm pni ssse3 cx16 sse4_1 sse4_2 popcnt aes nx lm vmx tm2 pbe
11/20/13 16:39:54 | | OS: Microsoft Windows 7: Professional x86 Edition, Service Pack 1, (06.01.7601.00)
11/20/13 16:39:54 | | Memory: 3.41 GB physical, 6.82 GB virtual
11/20/13 16:39:54 | | Disk: 148.66 GB total, 107.32 GB free
11/20/13 16:39:54 | | Local time is UTC +5 hours
11/20/13 16:39:54 | | No general preferences found - using defaults
11/20/13 16:39:54 | | Preferences:
11/20/13 16:39:54 | | max memory usage when active: 1746.29MB
11/20/13 16:39:54 | | max memory usage when idle: 3143.32MB
11/20/13 16:39:54 | | max disk usage: 107.22GB
11/20/13 16:39:54 | | don't use GPU while active
11/20/13 16:39:54 | | suspend work if non-BOINC CPU load exceeds 25%
11/20/13 16:39:54 | | (to change preferences, visit a project web site or select Preferences in the Manager)
11/20/13 16:39:54 | | Not using a proxy
11/20/13 16:39:54 | | This computer is not attached to any projects
11/20/13 16:39:54 | | Visit http://boinc.berkeley.edu for instructions
11/20/13 16:39:55 | | Suspending GPU computation - computer is in use
11/20/13 16:43:25 | | Resuming GPU computation
11/20/13 16:44:42 | | Suspending GPU computation - computer is in use
11/20/13 17:06:26 | | Suspending computation - CPU is busy
11/20/13 17:06:36 | | Resuming computation
11/20/13 18:23:54 | | Fetching configuration file from http://bam.boincstats.com/get_project_config.php
11/20/13 18:24:54 | | Fetching configuration file from http://13.218.150.14/upper/get_project_config.php
11/20/13 18:25:22 | | Running CPU benchmarks
11/20/13 18:25:22 | | Suspending computation - CPU benchmarks in progress
11/20/13 18:25:53 | | Benchmark results:
11/20/13 18:25:53 | | Number of CPUs: 4
11/20/13 18:25:53 | | 2717 floating point MIPS (Whetstone) per CPU
11/20/13 18:25:53 | | 7167 integer MIPS (Dhrystone) per CPU
11/20/13 18:25:53 | | Resuming computation
11/20/13 18:25:55 | upper | Master file download succeeded
11/20/13 18:26:00 | upper | Sending scheduler request: Project initialization.
11/20/13 18:26:00 | upper | Requesting new tasks for CPU and intel_gpu
11/20/13 18:26:02 | upper | Scheduler request completed: got 0 new tasks
11/20/13 18:26:02 | upper | No tasks sent
11/20/13 18:26:12 | upper | Sending scheduler request: To fetch work.
11/20/13 18:26:12 | upper | Requesting new tasks for CPU and intel_gpu
11/20/13 18:26:16 | upper | Scheduler request completed: got 0 new tasks
11/20/13 18:26:16 | upper | No tasks sent
11/20/13 18:28:11 | upper | update requested by user
11/20/13 18:28:16 | upper | Sending scheduler request: Requested by user.
11/20/13 18:28:16 | upper | Requesting new tasks for CPU and intel_gpu
11/20/13 18:28:19 | upper | Scheduler request completed: got 0 new tasks
11/20/13 18:28:19 | upper | No tasks sent
11/20/13 18:28:29 | upper | Sending scheduler request: To fetch work.
11/20/13 18:28:29 | upper | Requesting new tasks for CPU and intel_gpu
11/20/13 18:28:32 | upper | Scheduler request completed: got 0 new tasks
11/20/13 18:28:32 | upper | No tasks sent
11/20/13 18:38:42 | upper | Sending scheduler request: To fetch work.
11/20/13 18:38:42 | upper | Requesting new tasks for CPU and intel_gpu
11/20/13 18:38:46 | upper | Scheduler request completed: got 0 new tasks
11/20/13 18:38:46 | upper | No tasks sent
11/20/13 18:39:06 | | Suspending computation - user request
11/20/13 18:58:54 | upper | Resetting project
11/20/13 19:04:27 | upper | update requested by user
11/20/13 19:04:31 | upper | Master file download succeeded
11/20/13 19:04:36 | upper | Sending scheduler request: Requested by user.
11/20/13 19:04:36 | upper | Requesting new tasks for CPU and intel_gpu
11/20/13 19:04:39 | upper | Scheduler request completed: got 0 new tasks
11/20/13 19:04:39 | upper | No tasks sent
11/20/13 19:39:05 | | Resuming computation
11/20/13 19:39:50 | upper | Sending scheduler request: To fetch work.
11/20/13 19:39:50 | upper | Requesting new tasks for CPU and intel_gpu
11/20/13 19:39:52 | upper | Scheduler request completed: got 0 new tasks
11/20/13 19:39:52 | upper | No tasks sent
ID: 51386 · Report as offensive
ChristianB
Volunteer developer
Volunteer tester

Send message
Joined: 4 Jul 12
Posts: 321
Germany
Message 51389 - Posted: 20 Nov 2013, 14:35:10 UTC

The reason why no task was send can only be found in the scheduler.log and only if you have increased the loglevel and restarted the project.

Are you sure there are tasks available? On the server you also have the tool "./bin/show_shmem" to see what the scheduler knows about.

P.S.: I whoised the Server-IP in the log. Is this an in house project for the company that owns this subnet?
ID: 51389 · Report as offensive
shruti

Send message
Joined: 25 Oct 13
Posts: 14
United States
Message 51392 - Posted: 20 Nov 2013, 15:15:53 UTC - in response to Message 51389.  

The debug level has been set to 3 (right from the beginning).
And this is the only information I have from scheduler.log:

2013-11-20 19:04:32.7363 [PID=10396] Request: [USER#2] [HOST#2] [IP 13.XX.XX.XX] client 7.2.28
2013-11-20 19:04:32.7941 [PID=10396] Sending reply to [HOST#2]: 0 results, delay req 7.00
2013-11-20 19:04:32.7942 [PID=10396] Scheduler ran 0.060 seconds
2013-11-20 19:39:46.2352 [PID=11786] Request: [USER#2] [HOST#2] [IP 13.XX.XX.XX] client 7.2.28
2013-11-20 19:39:46.3377 [PID=11786] Sending reply to [HOST#2]: 0 results, delay req 7.00
2013-11-20 19:39:46.3378 [PID=11786] Scheduler ran 0.106 seconds
2013-11-20 19:47:33.7966 [PID=12143] Request: [USER#2] [HOST#2] [IP 13.XX.XX.XX] client 7.2.28
2013-11-20 19:47:33.8446 [PID=12143] Sending reply to [HOST#2]: 0 results, delay req 7.00
2013-11-20 19:47:33.8446 [PID=12143] Scheduler ran 0.052 seconds
2013-11-20 19:47:47.5364 [PID=12144] Request: [USER#2] [HOST#2] [IP 13.XX.XX.XX] client 7.2.28
2013-11-20 19:47:47.6150 [PID=12144] Sending reply to [HOST#2]: 0 results, delay req 7.00
2013-11-20 19:47:47.6151 [PID=12144] Scheduler ran 0.082 seconds
2013-11-20 19:53:00.6629 [PID=12361] Request: [USER#2] [HOST#2] [IP 13.XX.XX.XX] client 7.2.28
2013-11-20 19:53:00.7505 [PID=12361] Sending reply to [HOST#2]: 0 results, delay req 7.00
2013-11-20 19:53:00.7506 [PID=12361] Scheduler ran 0.091 seconds
2013-11-20 19:55:14.0748 [PID=12486] Request: [USER#2] [HOST#2] [IP 13.XX.XX.XX] client 7.2.28
2013-11-20 19:55:14.1584 [PID=12486] Sending reply to [HOST#2]: 0 results, delay req 7.00
2013-11-20 19:55:14.1584 [PID=12486] Scheduler ran 0.087 seconds
2013-11-20 20:12:28.4779 [PID=13114] Request: [USER#2] [HOST#2] [IP 13.XX.XX.XX] client 7.2.28
2013-11-20 20:12:28.6164 [PID=13114] Sending reply to [HOST#2]: 0 results, delay req 7.00
2013-11-20 20:12:28.6164 [PID=13114] Scheduler ran 0.142 seconds
2013-11-20 20:21:41.8161 [PID=13473] Request: [USER#2] [HOST#2] [IP 13.XX.XX.XX] client 7.2.28
2013-11-20 20:21:41.8996 [PID=13473] Sending reply to [HOST#2]: 0 results, delay req 7.00
2013-11-20 20:21:41.8997 [PID=13473] Scheduler ran 0.087 seconds


Also here is the info upon running this command ./bin/show_shmem (It shows the workunit).

appid: 4 platformid: 3 version_num: 100 plan_class:
have CPU apps: yes
have NVIDIA GPU apps: no
have AMD/ATI GPU apps: no
have Intel GPU apps: no
Jobs; key:
ap: app ID
ic: infeasible count
wu: workunit ID
rs: result ID
hr: HR class
nr: need reliable
host fpops mean 2200000000.000000 stddev 700000000.000000
host fpops 50th pctile 3300000000.000000 95th pctile 3300000000.000000
ready: 1
max_wu_results: 100
slot app WU ID result ID batch HR class priority in shmem size (stdev) need reliable inf count
0 uppercase 1 1 0 0 0 453554s 0.000000 no 0
1: ---
ID: 51392 · Report as offensive
ChristianB
Volunteer developer
Volunteer tester

Send message
Joined: 4 Jul 12
Posts: 321
Germany
Message 51394 - Posted: 20 Nov 2013, 15:45:54 UTC

I have no further idea. I will contact the main developer and ask if he knows another way to debug work send.
ID: 51394 · Report as offensive
shruti

Send message
Joined: 25 Oct 13
Posts: 14
United States
Message 51395 - Posted: 20 Nov 2013, 15:53:53 UTC - in response to Message 51394.  

Thanks. Let me know if there are any additional inputs from the main developer.
ID: 51395 · Report as offensive
Juha
Volunteer developer
Volunteer tester
Help desk expert

Send message
Joined: 20 Nov 12
Posts: 801
Finland
Message 51400 - Posted: 20 Nov 2013, 19:11:52 UTC

I played a little bit with my test server. Even with debug level=3 there isn't much debugging information written to the log.

You really need <debug_send> and maybe <debug_version_select> and <debug_quota> too to get anything useful information.
ID: 51400 · Report as offensive
mynotos

Send message
Joined: 18 Dec 13
Posts: 8
Germany
Message 51786 - Posted: 18 Dec 2013, 22:15:06 UTC

i have the same problem , i get no tasks :) but work units are there and jobs files are created i don't know whats the problem :(( i and have the same output as shruti
ID: 51786 · Report as offensive
ChristianB
Volunteer developer
Volunteer tester

Send message
Joined: 4 Jul 12
Posts: 321
Germany
Message 51877 - Posted: 4 Jan 2014, 13:00:09 UTC

I recently also had the problem with unsend tasks and solved it by enabling the <debug_array/> logging for the scheduler. For everyone who also suffers this problem please enable the following options in your config.xml:
<debug_send/>
<debug_version_select/>
<debug_array/>
<sched_debug_level> 3 </sched_debug_level>

Than restart the project (bin/stop; bin/start), update the client and post the part of the scheduler.log that corresponds to this RPC. Best way I found is to use grep as follows:
cat scheduler.log | grep HOST#2354

This will give you the latest RPC of this host (change to suit your host) extract the PID and run another grep:
cat scheduler.log | grep PID=26407

This will give you the whole request and messages.
ID: 51877 · Report as offensive
Previous · 1 · 2

Message boards : Server programs : Scheduler request completed: got 0 new tasks

Copyright © 2024 University of California.
Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.