Thread 'Scheduler request completed: got 0 new tasks'

Message boards : Server programs : Scheduler request completed: got 0 new tasks
Message board moderation

To post messages, you must log in.

1 · 2 · Next

AuthorMessage
shruti

Send message
Joined: 25 Oct 13
Posts: 14
United States
Message 51016 - Posted: 25 Oct 2013, 10:41:20 UTC

Each time I run a boinc client for a simple "uppercase" application - that reads contents of a text file and converts it to uppercase I get the following error:

25-Oct-2013 15:44:17 [uc] Scheduler request completed: got 0 new tasks
25-Oct-2013 15:44:17 [uc] No tasks sent

I have created workunits.
The same project executes fine if I create the client on the same machine (i.e. the client and server reside on the same machine).

Any ideas ?

-S
ID: 51016 · Report as offensive
ChristianB
Volunteer developer
Volunteer tester

Send message
Joined: 4 Jul 12
Posts: 321
Germany
Message 51017 - Posted: 25 Oct 2013, 13:19:44 UTC

You can check the scheduler.log on the server there should be some more information on why no tasks where send.
ID: 51017 · Report as offensive
shruti

Send message
Joined: 25 Oct 13
Posts: 14
United States
Message 51094 - Posted: 30 Oct 2013, 7:55:42 UTC - in response to Message 51017.  

I have boinc-server running on machine 1. Initially I installed a client in a different user on machine 1 it ran fine.
Now I have installed a client on machine 2. And this creates a problem. The scheduler.log has the following record in it:

2013-10-22 16:07:19.3813 [PID=28516] [HOST#1] [USER#1] Found similar existing host for this user - assigned.
2013-10-22 16:07:19.3818 [PID=28516] Request: [USER#1] [HOST#1] [IP 13.2XX.XXX.XX] client 7.1.0
2013-10-22 16:07:19.4642 [PID=28516] Sending reply to [HOST#1]: 0 results, delay req 7.00


Could this be the reason - BOINC is still looking at the client on the local machine (machine 1) instead of machine 2 (remote machine) ?
Any ideas on how this can be resolved ?

Thanks.
ID: 51094 · Report as offensive
ChristianB
Volunteer developer
Volunteer tester

Send message
Joined: 4 Jul 12
Posts: 321
Germany
Message 51097 - Posted: 30 Oct 2013, 15:22:58 UTC

Id on't know what's going on there. This is the first time I saw this kind of scheduler message. The second machine should show up as host#2. You can force this by creating a second user and attaching the second machine with this user.
ID: 51097 · Report as offensive
shruti

Send message
Joined: 25 Oct 13
Posts: 14
United States
Message 51099 - Posted: 30 Oct 2013, 15:58:11 UTC - in response to Message 51097.  

My set up is as follows :
Machine 1 : Boinc Server
Login : user 1


Machine 2: Client
Login : user 2

I think I already have the setup you are mentioning or am I missing something ?
ID: 51099 · Report as offensive
ProfileJord
Volunteer tester
Help desk expert
Avatar

Send message
Joined: 29 Aug 05
Posts: 15563
Netherlands
Message 51100 - Posted: 30 Oct 2013, 15:58:50 UTC - in response to Message 51097.  

See http://boinc.berkeley.edu/trac/browser/boinc-v2/sched/handle_request.cpp, line 410 and further.

Is the second machine really a physical second machine? Or a VM running on the same machine as host1?
Was the second BOINC really installed from scratch, or was it cloned? If cloned, was it done so from the existing BOINC with its work and client_state.xml file?
ID: 51100 · Report as offensive
ChristianB
Volunteer developer
Volunteer tester

Send message
Joined: 4 Jul 12
Posts: 321
Germany
Message 51101 - Posted: 30 Oct 2013, 16:08:17 UTC - in response to Message 51099.  

My set up is as follows :
Machine 1 : Boinc Server
Login : user 1


Machine 2: Client
Login : user 2

I think I already have the setup you are mentioning or am I missing something ?

I don't mean the user to login to the machine I mean the Project user used to attach the client to the project.
ID: 51101 · Report as offensive
shruti

Send message
Joined: 25 Oct 13
Posts: 14
United States
Message 51102 - Posted: 30 Oct 2013, 17:03:14 UTC - in response to Message 51101.  

I have tried creating a new project user account and am using the new generated key to attach the project.
ID: 51102 · Report as offensive
shruti

Send message
Joined: 25 Oct 13
Posts: 14
United States
Message 51103 - Posted: 30 Oct 2013, 17:05:21 UTC - in response to Message 51100.  

1) Second machine is a different physical machine. Not a VM running on the same physical machine on which the BOINC server is running.

2) I had copied the BOINC folder to the second machine and then installed the client.
ID: 51103 · Report as offensive
ProfileJord
Volunteer tester
Help desk expert
Avatar

Send message
Joined: 29 Aug 05
Posts: 15563
Netherlands
Message 51105 - Posted: 30 Oct 2013, 17:25:09 UTC - in response to Message 51103.  

2) I had copied the BOINC folder to the second machine and then installed the client.

Well, there you have it.
Copying the BOINC directory, presumably with everything in it (sub-directories, files), means you now have two computers with the same hostID.

Can as far as I know only be fixed by detaching (Remove) and reattaching (Add) the second BOINC. And next time you want to 'easily add your project to a BOINC', only copy the account_*.xml file.
ID: 51105 · Report as offensive
shruti

Send message
Joined: 25 Oct 13
Posts: 14
United States
Message 51280 - Posted: 12 Nov 2013, 10:20:45 UTC - in response to Message 51105.  

I created a new user (generated a new key). And I ran the following commands:

./boinc_client --detach_project http://13.XX.XX.XX/uc/
./boinc_client --attach_project http://13.XX.XX.XX/uc/ ae2687667fd0446d16298a91b61d01eb


I still get the same error.
When you say, "Can as far as I know only be fixed by detaching (Remove) and reattaching (Add) the second BOINC" are you referring to the project or to the BOINC folder ?

What modifications do I need to do to the BOINC client I currently have so that it resolves the hostid conflict error ?
ID: 51280 · Report as offensive
ProfileJord
Volunteer tester
Help desk expert
Avatar

Send message
Joined: 29 Aug 05
Posts: 15563
Netherlands
Message 51286 - Posted: 12 Nov 2013, 16:29:32 UTC - in response to Message 51280.  

When you say, "Can as far as I know only be fixed by detaching (Remove) and reattaching (Add) the second BOINC" are you referring to the project or to the BOINC folder ?

Best is to remove the client_state.xml file after you did a project detach. (And you can also use BOINCCMD --project URL detach .. that way you don't start the client twice)
ID: 51286 · Report as offensive
shruti

Send message
Joined: 25 Oct 13
Posts: 14
United States
Message 51316 - Posted: 14 Nov 2013, 6:38:51 UTC - in response to Message 51286.  

I created a new project, with new work units.
Also removed the client_state.xml and client_state_prev.xml of the previous project.
I created a new user and generated new keys. I still get the same error:

14-Nov-2013 12:07:10 [http://13.XX.XX.XX/upper/] Sending scheduler request: Project initialization.
14-Nov-2013 12:07:10 [http://13.XX.XX.XX/upper/] Requesting new tasks for CPU
14-Nov-2013 12:07:13 [upper] Scheduler request completed: got 0 new tasks
14-Nov-2013 12:07:13 [upper] No tasks sent

Any ideas ?
ID: 51316 · Report as offensive
ProfileJord
Volunteer tester
Help desk expert
Avatar

Send message
Joined: 29 Aug 05
Posts: 15563
Netherlands
Message 51318 - Posted: 14 Nov 2013, 9:05:12 UTC - in response to Message 51316.  

Not without more information, no.

So, make a client configuration file (cc_config.xml) in your BOINC Data directory (depending on which BOINC client you use, that's either home/BOINC/ for the Berkeley installer or /etc/boinc-client/ or /var/lib/boinc/ for the various repository installations) and add into it:

<cc_config>
<log_flags>
<work_fetch_debug>1</work_fetch_debug>
<file_xfer_debug>1</file_xfer_debug>
<sched_op_debug>1</sched_op_debug>
<task_debug>1</task_debug>
</log_flags>
<options>
</options>
</cc_config>

Save and exit & restart the BOINC client.
Next log a work request and post that here.
ID: 51318 · Report as offensive
shruti

Send message
Joined: 25 Oct 13
Posts: 14
United States
Message 51319 - Posted: 14 Nov 2013, 9:29:52 UTC - in response to Message 51318.  

Here is the log for a work request being sent.

14-Nov-2013 14:54:00 [http://13.XX.XX.XX/upper/] URL http://13.XX.XX.XX/upper/; Computer ID not assigned yet; resource share 100
14-Nov-2013 14:54:00 [---] No general preferences found - using defaults
14-Nov-2013 14:54:00 [---] Preferences:
14-Nov-2013 14:54:00 [---] max memory usage when active: 16065.26MB
14-Nov-2013 14:54:00 [---] max memory usage when idle: 28917.46MB
14-Nov-2013 14:54:00 [---] max disk usage: 1000.00GB
14-Nov-2013 14:54:00 [---] don't use GPU while active
14-Nov-2013 14:54:00 [---] suspend work if non-BOINC CPU load exceeds 25 %
14-Nov-2013 14:54:00 [---] (to change preferences, visit a project web site or select Preferences in the Manager)
14-Nov-2013 14:54:00 [---] [work_fetch] Request work fetch: Prefs update
14-Nov-2013 14:54:00 [---] [work_fetch] Request work fetch: Startup
14-Nov-2013 14:54:00 [---] Not using a proxy
Initialization completed
14-Nov-2013 14:54:02 [---] [work_fetch] work fetch start
14-Nov-2013 14:54:02 [---] [work_fetch] choose_project() for CPU: buffer_low: yes; sim_excluded_instances 0
14-Nov-2013 14:54:02 [http://13.XX.XX.XX/upper/] [work_fetch] set_request() for CPU: ninst 32 nused_total 0.000000 nidle_now 32.000000 fetch share 1.000000 req_inst 32.000000 req_secs 1658880.000000
14-Nov-2013 14:54:02 [---] [work_fetch] ------- start work fetch state -------
14-Nov-2013 14:54:02 [---] [work_fetch] target work buffer: 8640.00 + 43200.00 sec
14-Nov-2013 14:54:02 [---] [work_fetch] --- project states ---
14-Nov-2013 14:54:02 [http://http:/13.XX.XX.XX/mci/] [work_fetch] REC 0.000 prio -0.000000 can't req work: master URL fetch pending (backoff: 76645.23 sec)
14-Nov-2013 14:54:02 [http://13.XX.XX.XX/upper/] [work_fetch] REC 0.000 prio -0.000000 can req work
14-Nov-2013 14:54:02 [---] [work_fetch] --- state for CPU ---
14-Nov-2013 14:54:02 [---] [work_fetch] shortfall 1658880.00 nidle 32.00 saturated 0.00 busy 0.00
14-Nov-2013 14:54:02 [http://http:/13.XX.XX.XX/mci/] [work_fetch] fetch share 0.000
14-Nov-2013 14:54:02 [http://13.XX.XX.XX/upper/] [work_fetch] fetch share 1.000
14-Nov-2013 14:54:02 [---] [work_fetch] ------- end work fetch state -------
14-Nov-2013 14:54:02 [http://13.XX.XX.XX/upper/] [sched_op] Starting scheduler request
14-Nov-2013 14:54:02 [http://13.XX.XX.XX/upper/] [sched_op] Fetching master file
14-Nov-2013 14:54:45 [http://13.XX.XX.XX/upper/] [sched_op] Got master file; parsing
14-Nov-2013 14:54:45 [http://13.XX.XX.XX/upper/] [sched_op] Found 1 scheduler URLs in master file
14-Nov-2013 14:54:45 [http://13.XX.XX.XX/upper/] Master file download succeeded
14-Nov-2013 14:54:45 [---] [work_fetch] Request work fetch: Master fetch complete
14-Nov-2013 14:54:50 [---] [work_fetch] work fetch start
14-Nov-2013 14:54:50 [---] [work_fetch] choose_project() for CPU: buffer_low: yes; sim_excluded_instances 0
14-Nov-2013 14:54:50 [http://13.XX.XX.XX/upper/] [work_fetch] set_request() for CPU: ninst 32 nused_total 0.000000 nidle_now 32.000000 fetch share 1.000000 req_inst 32.000000 req_secs 1658880.000000
14-Nov-2013 14:54:50 [---] [work_fetch] ------- start work fetch state -------
14-Nov-2013 14:54:50 [---] [work_fetch] target work buffer: 8640.00 + 43200.00 sec
14-Nov-2013 14:54:50 [---] [work_fetch] --- project states ---
14-Nov-2013 14:54:50 [http://http:/13.XX.XX.XX/mci/] [work_fetch] REC 0.000 prio -0.000000 can't req work: master URL fetch pending (backoff: 76596.69 sec)
14-Nov-2013 14:54:50 [http://13.XX.XX.XX/upper/] [work_fetch] REC 0.000 prio -0.000000 can req work
14-Nov-2013 14:54:50 [---] [work_fetch] --- state for CPU ---
14-Nov-2013 14:54:50 [---] [work_fetch] shortfall 1658880.00 nidle 32.00 saturated 0.00 busy 0.00
14-Nov-2013 14:54:50 [http://http:/13.XX.XX.XX/mci/] [work_fetch] fetch share 0.000
14-Nov-2013 14:54:50 [http://13.XX.XX.XX/upper/] [work_fetch] fetch share 1.000
14-Nov-2013 14:54:50 [---] [work_fetch] ------- end work fetch state -------
14-Nov-2013 14:54:50 [http://13.XX.XX.XX/upper/] [sched_op] Starting scheduler request
14-Nov-2013 14:54:50 [http://13.XX.XX.XX/upper/] [work_fetch] request: CPU (1.00 sec, 0.00 inst)
14-Nov-2013 14:54:50 [http://13.XX.XX.XX/upper/] Sending scheduler request: Project initialization.
14-Nov-2013 14:54:50 [http://13.XX.XX.XX/upper/] Requesting new tasks for CPU
14-Nov-2013 14:54:50 [http://13.XX.XX.XX/upper/] [sched_op] CPU work request: 1.00 seconds; 0.00 devices
14-Nov-2013 14:54:53 [upper] Scheduler request completed: got 0 new tasks
14-Nov-2013 14:54:53 [upper] [sched_op] Server version 701
14-Nov-2013 14:54:53 [upper] No tasks sent
14-Nov-2013 14:54:53 [upper] Project requested delay of 7 seconds
14-Nov-2013 14:54:53 [upper] [work_fetch] backing off CPU 579 sec
14-Nov-2013 14:54:53 [upper] [sched_op] Deferring communication for 7 sec
14-Nov-2013 14:54:53 [upper] [sched_op] Reason: requested by project
14-Nov-2013 14:54:53 [---] [work_fetch] Request work fetch: RPC complete
14-Nov-2013 14:54:58 [---] [work_fetch] work fetch start
14-Nov-2013 14:54:58 [---] [work_fetch] choose_project() for CPU: buffer_low: yes; sim_excluded_instances 0
14-Nov-2013 14:54:58 [---] [work_fetch] no eligible project for CPU
14-Nov-2013 14:54:58 [---] [work_fetch] ------- start work fetch state -------
14-Nov-2013 14:54:58 [---] [work_fetch] target work buffer: 8640.00 + 43200.00 sec
ID: 51319 · Report as offensive
rebirther
Avatar

Send message
Joined: 21 Jun 06
Posts: 156
Germany
Message 51320 - Posted: 14 Nov 2013, 11:24:41 UTC

What daemons are running? I think there is a missing feeder daemon?
ID: 51320 · Report as offensive
ChristianB
Volunteer developer
Volunteer tester

Send message
Joined: 4 Jul 12
Posts: 321
Germany
Message 51321 - Posted: 14 Nov 2013, 15:29:17 UTC
Last modified: 14 Nov 2013, 15:29:31 UTC

You may also take a look in the scheduler.log on the server. I think you have to enable scheduler debugging for this. See: http://boinc.berkeley.edu/trac/wiki/ProjectOptions#Logging (you may start with <debug_send/>)

This will produce a more verbose output with some axplanations why no work was feasible for your host.

Regards
Christian
ID: 51321 · Report as offensive
shruti

Send message
Joined: 25 Oct 13
Posts: 14
United States
Message 51326 - Posted: 15 Nov 2013, 7:35:59 UTC - in response to Message 51321.  

Here is what I observer in the scheduler.log file.
One observation here is that the IP to which the request is being sent does not match the IP of the machine on which my BOINC server is running. I am not sure why it is picking up a random IP each time.

2013-11-14 12:05:39.3363 [PID=32351] Request: [USER#1] [HOST#1] [IP 13.111.1.2] client 7.1.0
2013-11-14 12:05:39.3769 [PID=32351] Sending reply to [HOST#1]: 0 results, delay req 7.00
2013-11-14 12:05:39.3770 [PID=32351] Scheduler ran 0.206 seconds
2013-11-14 12:12:24.0633 [PID=32637] Request: [USER#1] [HOST#1] [IP 13.110.2.1] client 7.1.0
2013-11-14 12:12:24.1348 [PID=32637] Sending reply to [HOST#1]: 0 results, delay req 7.00
2013-11-14 12:12:24.1349 [PID=32637] Scheduler ran 0.076 seconds
2013-11-14 12:29:49.0843 [PID=803 ] Request: [USER#1] [HOST#1] [IP 13.111.1.2] client 7.1.0
2013-11-14 12:29:49.1508 [PID=803 ] Sending reply to [HOST#1]: 0 results, delay req 7.00
2013-11-14 12:29:49.1509 [PID=803 ] Scheduler ran 0.070 seconds
2013-11-14 12:52:58.4942 [PID=2062 ] Request: [USER#1] [HOST#1] [IP 13.111.1.2] client 7.1.0
2013-11-14 12:52:58.5671 [PID=2062 ] Sending reply to [HOST#1]: 0 results, delay req 7.00
2013-11-14 12:52:58.5672 [PID=2062 ] Scheduler ran 0.076 seconds
2013-11-14 14:47:16.4072 [PID=7071 ] Request: [USER#1] [HOST#1] [IP 13.163.4.9] client 7.1.0
2013-11-14 14:47:16.5072 [PID=7071 ] Sending reply to [HOST#1]: 0 results, delay req 7.00
2013-11-14 14:47:16.5073 [PID=7071 ] Scheduler ran 0.103 seconds
2013-11-14 14:53:19.3211 [PID=7321 ] [HOST#1] [USER#1] Found similar existing host for this user - assigned.
2013-11-14 14:53:19.3216 [PID=7321 ] Request: [USER#1] [HOST#1] [IP 13.163.4.9] client 7.1.0
2013-11-14 14:53:19.3972 [PID=7321 ] Sending reply to [HOST#1]: 0 results, delay req 7.00
2013-11-14 14:53:19.3973 [PID=7321 ] Scheduler ran 0.080 seconds
2013-11-14 15:03:34.5202 [PID=7718 ] Request: [USER#1] [HOST#1] [IP 13.111.1.2] client 7.1.0
2013-11-14 15:03:34.5828 [PID=7718 ] Sending reply to [HOST#1]: 0 results, delay req 7.00
2013-11-14 15:03:34.5829 [PID=7718 ] Scheduler ran 0.066 seconds
2013-11-14 15:30:52.3559 [PID=8831 ] Request: [USER#1] [HOST#1] [IP 13.110.2.1] client 7.1.0
2013-11-14 15:30:52.4439 [PID=8831 ] Sending reply to [HOST#1]: 0 results, delay req 7.00
2013-11-14 15:30:52.4440 [PID=8831 ] Scheduler ran 0.091 seconds
2013-11-14 16:33:04.0699 [PID=11686] Request: [USER#1] [HOST#1] [IP 13.111.1.2] client 7.1.0
2013-11-14 16:33:04.1730 [PID=11686] Sending reply to [HOST#1]: 0 results, delay req 7.00
2013-11-14 16:33:04.1731 [PID=11686] Scheduler ran 0.106 seconds
2013-11-14 16:38:51.6167 [PID=11906] [HOST#1] [USER#1] Found similar existing host for this user - assigned.
2013-11-14 16:38:51.6173 [PID=11906] Request: [USER#1] [HOST#1] [IP 13.111.1.2] client 7.1.0
2013-11-14 16:38:51.7082 [PID=11906] Sending reply to [HOST#1]: 0 results, delay req 7.00
2013-11-14 16:38:51.7083 [PID=11906] Scheduler ran 0.096 seconds
2013-11-14 16:45:17.4583 [PID=12220] [HOST#1] [USER#1] Found similar existing host for this user - assigned.
2013-11-14 16:45:17.4585 [PID=12220] Request: [USER#1] [HOST#1] [IP 13.163.4.9] client 7.1.0
2013-11-14 16:45:17.5689 [PID=12220] Sending reply to [HOST#1]: 0 results, delay req 7.00
2013-11-14 16:45:17.5690 [PID=12220] Scheduler ran 0.114 seconds
2013-11-14 17:00:34.0785 [PID=12829] Request: [USER#1] [HOST#1] [IP 13.110.2.1] client 7.1.0
2013-11-14 17:00:34.1462 [PID=12829] Sending reply to [HOST#1]: 0 results, delay req 7.00
2013-11-14 17:00:34.1463 [PID=12829] Scheduler ran 0.071 seconds
2013-11-14 17:28:51.1941 [PID=13874] Request: [USER#1] [HOST#1] [IP 13.163.4.9] client 7.1.0
2013-11-14 17:28:51.3115 [PID=13874] Sending reply to [HOST#1]: 0 results, delay req 7.00
2013-11-14 17:28:51.3116 [PID=13874] Scheduler ran 0.121 seconds
2013-11-14 18:00:08.3190 [PID=15163] Request: [USER#1] [HOST#1] [IP 13.163.4.9] client 7.1.0
ID: 51326 · Report as offensive
shruti

Send message
Joined: 25 Oct 13
Posts: 14
United States
Message 51327 - Posted: 15 Nov 2013, 7:37:57 UTC - in response to Message 51320.  

Here is the o/p at the boinc server:
Starting daemons
Starting daemon: feeder -d 3
Starting daemon: transitioner -d 3
Starting daemon: file_deleter -d 3
Starting daemon: sample_trivial_validator -d 3 app uppercase
Starting daemon: sample_dummy_assimilator -d 3 app uppercase
ID: 51327 · Report as offensive
ProfileJord
Volunteer tester
Help desk expert
Avatar

Send message
Joined: 29 Aug 05
Posts: 15563
Netherlands
Message 51328 - Posted: 15 Nov 2013, 7:42:26 UTC - in response to Message 51319.  

Can you post the first lines of the start-up messages, starting from "Starting BOINC client version..." and up till the 14-Nov-2013 14:54:00 [---] No general preferences found - using defaults line, please?
ID: 51328 · Report as offensive
1 · 2 · Next

Message boards : Server programs : Scheduler request completed: got 0 new tasks

Copyright © 2024 University of California.
Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.