Message boards : Server programs : Scheduler request completed: got 0 new tasks
Message board moderation
Author | Message |
---|---|
Send message Joined: 25 Oct 13 Posts: 14 |
Each time I run a boinc client for a simple "uppercase" application - that reads contents of a text file and converts it to uppercase I get the following error: 25-Oct-2013 15:44:17 [uc] Scheduler request completed: got 0 new tasks 25-Oct-2013 15:44:17 [uc] No tasks sent I have created workunits. The same project executes fine if I create the client on the same machine (i.e. the client and server reside on the same machine). Any ideas ? -S |
Send message Joined: 4 Jul 12 Posts: 321 |
You can check the scheduler.log on the server there should be some more information on why no tasks where send. |
Send message Joined: 25 Oct 13 Posts: 14 |
I have boinc-server running on machine 1. Initially I installed a client in a different user on machine 1 it ran fine. Now I have installed a client on machine 2. And this creates a problem. The scheduler.log has the following record in it: 2013-10-22 16:07:19.3813 [PID=28516] [HOST#1] [USER#1] Found similar existing host for this user - assigned. 2013-10-22 16:07:19.3818 [PID=28516] Request: [USER#1] [HOST#1] [IP 13.2XX.XXX.XX] client 7.1.0 2013-10-22 16:07:19.4642 [PID=28516] Sending reply to [HOST#1]: 0 results, delay req 7.00 Could this be the reason - BOINC is still looking at the client on the local machine (machine 1) instead of machine 2 (remote machine) ? Any ideas on how this can be resolved ? Thanks. |
Send message Joined: 4 Jul 12 Posts: 321 |
Id on't know what's going on there. This is the first time I saw this kind of scheduler message. The second machine should show up as host#2. You can force this by creating a second user and attaching the second machine with this user. |
Send message Joined: 25 Oct 13 Posts: 14 |
My set up is as follows : Machine 1 : Boinc Server Login : user 1 Machine 2: Client Login : user 2 I think I already have the setup you are mentioning or am I missing something ? |
Send message Joined: 29 Aug 05 Posts: 15565 |
See http://boinc.berkeley.edu/trac/browser/boinc-v2/sched/handle_request.cpp, line 410 and further. Is the second machine really a physical second machine? Or a VM running on the same machine as host1? Was the second BOINC really installed from scratch, or was it cloned? If cloned, was it done so from the existing BOINC with its work and client_state.xml file? |
Send message Joined: 4 Jul 12 Posts: 321 |
My set up is as follows : I don't mean the user to login to the machine I mean the Project user used to attach the client to the project. |
Send message Joined: 25 Oct 13 Posts: 14 |
I have tried creating a new project user account and am using the new generated key to attach the project. |
Send message Joined: 25 Oct 13 Posts: 14 |
1) Second machine is a different physical machine. Not a VM running on the same physical machine on which the BOINC server is running. 2) I had copied the BOINC folder to the second machine and then installed the client. |
Send message Joined: 29 Aug 05 Posts: 15565 |
2) I had copied the BOINC folder to the second machine and then installed the client. Well, there you have it. Copying the BOINC directory, presumably with everything in it (sub-directories, files), means you now have two computers with the same hostID. Can as far as I know only be fixed by detaching (Remove) and reattaching (Add) the second BOINC. And next time you want to 'easily add your project to a BOINC', only copy the account_*.xml file. |
Send message Joined: 25 Oct 13 Posts: 14 |
I created a new user (generated a new key). And I ran the following commands: ./boinc_client --detach_project http://13.XX.XX.XX/uc/ ./boinc_client --attach_project http://13.XX.XX.XX/uc/ ae2687667fd0446d16298a91b61d01eb I still get the same error. When you say, "Can as far as I know only be fixed by detaching (Remove) and reattaching (Add) the second BOINC" are you referring to the project or to the BOINC folder ? What modifications do I need to do to the BOINC client I currently have so that it resolves the hostid conflict error ? |
Send message Joined: 29 Aug 05 Posts: 15565 |
When you say, "Can as far as I know only be fixed by detaching (Remove) and reattaching (Add) the second BOINC" are you referring to the project or to the BOINC folder ? Best is to remove the client_state.xml file after you did a project detach. (And you can also use BOINCCMD --project URL detach .. that way you don't start the client twice) |
Send message Joined: 25 Oct 13 Posts: 14 |
I created a new project, with new work units. Also removed the client_state.xml and client_state_prev.xml of the previous project. I created a new user and generated new keys. I still get the same error: 14-Nov-2013 12:07:10 [http://13.XX.XX.XX/upper/] Sending scheduler request: Project initialization. 14-Nov-2013 12:07:10 [http://13.XX.XX.XX/upper/] Requesting new tasks for CPU 14-Nov-2013 12:07:13 [upper] Scheduler request completed: got 0 new tasks 14-Nov-2013 12:07:13 [upper] No tasks sent Any ideas ? |
Send message Joined: 29 Aug 05 Posts: 15565 |
Not without more information, no. So, make a client configuration file (cc_config.xml) in your BOINC Data directory (depending on which BOINC client you use, that's either home/BOINC/ for the Berkeley installer or /etc/boinc-client/ or /var/lib/boinc/ for the various repository installations) and add into it: <cc_config> <log_flags> <work_fetch_debug>1</work_fetch_debug> <file_xfer_debug>1</file_xfer_debug> <sched_op_debug>1</sched_op_debug> <task_debug>1</task_debug> </log_flags> <options> </options> </cc_config> Save and exit & restart the BOINC client. Next log a work request and post that here. |
Send message Joined: 25 Oct 13 Posts: 14 |
Here is the log for a work request being sent. 14-Nov-2013 14:54:00 [http://13.XX.XX.XX/upper/] URL http://13.XX.XX.XX/upper/; Computer ID not assigned yet; resource share 100 14-Nov-2013 14:54:00 [---] No general preferences found - using defaults 14-Nov-2013 14:54:00 [---] Preferences: 14-Nov-2013 14:54:00 [---] max memory usage when active: 16065.26MB 14-Nov-2013 14:54:00 [---] max memory usage when idle: 28917.46MB 14-Nov-2013 14:54:00 [---] max disk usage: 1000.00GB 14-Nov-2013 14:54:00 [---] don't use GPU while active 14-Nov-2013 14:54:00 [---] suspend work if non-BOINC CPU load exceeds 25 % 14-Nov-2013 14:54:00 [---] (to change preferences, visit a project web site or select Preferences in the Manager) 14-Nov-2013 14:54:00 [---] [work_fetch] Request work fetch: Prefs update 14-Nov-2013 14:54:00 [---] [work_fetch] Request work fetch: Startup 14-Nov-2013 14:54:00 [---] Not using a proxy Initialization completed 14-Nov-2013 14:54:02 [---] [work_fetch] work fetch start 14-Nov-2013 14:54:02 [---] [work_fetch] choose_project() for CPU: buffer_low: yes; sim_excluded_instances 0 14-Nov-2013 14:54:02 [http://13.XX.XX.XX/upper/] [work_fetch] set_request() for CPU: ninst 32 nused_total 0.000000 nidle_now 32.000000 fetch share 1.000000 req_inst 32.000000 req_secs 1658880.000000 14-Nov-2013 14:54:02 [---] [work_fetch] ------- start work fetch state ------- 14-Nov-2013 14:54:02 [---] [work_fetch] target work buffer: 8640.00 + 43200.00 sec 14-Nov-2013 14:54:02 [---] [work_fetch] --- project states --- 14-Nov-2013 14:54:02 [http://http:/13.XX.XX.XX/mci/] [work_fetch] REC 0.000 prio -0.000000 can't req work: master URL fetch pending (backoff: 76645.23 sec) 14-Nov-2013 14:54:02 [http://13.XX.XX.XX/upper/] [work_fetch] REC 0.000 prio -0.000000 can req work 14-Nov-2013 14:54:02 [---] [work_fetch] --- state for CPU --- 14-Nov-2013 14:54:02 [---] [work_fetch] shortfall 1658880.00 nidle 32.00 saturated 0.00 busy 0.00 14-Nov-2013 14:54:02 [http://http:/13.XX.XX.XX/mci/] [work_fetch] fetch share 0.000 14-Nov-2013 14:54:02 [http://13.XX.XX.XX/upper/] [work_fetch] fetch share 1.000 14-Nov-2013 14:54:02 [---] [work_fetch] ------- end work fetch state ------- 14-Nov-2013 14:54:02 [http://13.XX.XX.XX/upper/] [sched_op] Starting scheduler request 14-Nov-2013 14:54:02 [http://13.XX.XX.XX/upper/] [sched_op] Fetching master file 14-Nov-2013 14:54:45 [http://13.XX.XX.XX/upper/] [sched_op] Got master file; parsing 14-Nov-2013 14:54:45 [http://13.XX.XX.XX/upper/] [sched_op] Found 1 scheduler URLs in master file 14-Nov-2013 14:54:45 [http://13.XX.XX.XX/upper/] Master file download succeeded 14-Nov-2013 14:54:45 [---] [work_fetch] Request work fetch: Master fetch complete 14-Nov-2013 14:54:50 [---] [work_fetch] work fetch start 14-Nov-2013 14:54:50 [---] [work_fetch] choose_project() for CPU: buffer_low: yes; sim_excluded_instances 0 14-Nov-2013 14:54:50 [http://13.XX.XX.XX/upper/] [work_fetch] set_request() for CPU: ninst 32 nused_total 0.000000 nidle_now 32.000000 fetch share 1.000000 req_inst 32.000000 req_secs 1658880.000000 14-Nov-2013 14:54:50 [---] [work_fetch] ------- start work fetch state ------- 14-Nov-2013 14:54:50 [---] [work_fetch] target work buffer: 8640.00 + 43200.00 sec 14-Nov-2013 14:54:50 [---] [work_fetch] --- project states --- 14-Nov-2013 14:54:50 [http://http:/13.XX.XX.XX/mci/] [work_fetch] REC 0.000 prio -0.000000 can't req work: master URL fetch pending (backoff: 76596.69 sec) 14-Nov-2013 14:54:50 [http://13.XX.XX.XX/upper/] [work_fetch] REC 0.000 prio -0.000000 can req work 14-Nov-2013 14:54:50 [---] [work_fetch] --- state for CPU --- 14-Nov-2013 14:54:50 [---] [work_fetch] shortfall 1658880.00 nidle 32.00 saturated 0.00 busy 0.00 14-Nov-2013 14:54:50 [http://http:/13.XX.XX.XX/mci/] [work_fetch] fetch share 0.000 14-Nov-2013 14:54:50 [http://13.XX.XX.XX/upper/] [work_fetch] fetch share 1.000 14-Nov-2013 14:54:50 [---] [work_fetch] ------- end work fetch state ------- 14-Nov-2013 14:54:50 [http://13.XX.XX.XX/upper/] [sched_op] Starting scheduler request 14-Nov-2013 14:54:50 [http://13.XX.XX.XX/upper/] [work_fetch] request: CPU (1.00 sec, 0.00 inst) 14-Nov-2013 14:54:50 [http://13.XX.XX.XX/upper/] Sending scheduler request: Project initialization. 14-Nov-2013 14:54:50 [http://13.XX.XX.XX/upper/] Requesting new tasks for CPU 14-Nov-2013 14:54:50 [http://13.XX.XX.XX/upper/] [sched_op] CPU work request: 1.00 seconds; 0.00 devices 14-Nov-2013 14:54:53 [upper] Scheduler request completed: got 0 new tasks 14-Nov-2013 14:54:53 [upper] [sched_op] Server version 701 14-Nov-2013 14:54:53 [upper] No tasks sent 14-Nov-2013 14:54:53 [upper] Project requested delay of 7 seconds 14-Nov-2013 14:54:53 [upper] [work_fetch] backing off CPU 579 sec 14-Nov-2013 14:54:53 [upper] [sched_op] Deferring communication for 7 sec 14-Nov-2013 14:54:53 [upper] [sched_op] Reason: requested by project 14-Nov-2013 14:54:53 [---] [work_fetch] Request work fetch: RPC complete 14-Nov-2013 14:54:58 [---] [work_fetch] work fetch start 14-Nov-2013 14:54:58 [---] [work_fetch] choose_project() for CPU: buffer_low: yes; sim_excluded_instances 0 14-Nov-2013 14:54:58 [---] [work_fetch] no eligible project for CPU 14-Nov-2013 14:54:58 [---] [work_fetch] ------- start work fetch state ------- 14-Nov-2013 14:54:58 [---] [work_fetch] target work buffer: 8640.00 + 43200.00 sec |
Send message Joined: 21 Jun 06 Posts: 156 |
What daemons are running? I think there is a missing feeder daemon? |
Send message Joined: 4 Jul 12 Posts: 321 |
You may also take a look in the scheduler.log on the server. I think you have to enable scheduler debugging for this. See: http://boinc.berkeley.edu/trac/wiki/ProjectOptions#Logging (you may start with <debug_send/>) This will produce a more verbose output with some axplanations why no work was feasible for your host. Regards Christian |
Send message Joined: 25 Oct 13 Posts: 14 |
Here is what I observer in the scheduler.log file. One observation here is that the IP to which the request is being sent does not match the IP of the machine on which my BOINC server is running. I am not sure why it is picking up a random IP each time. 2013-11-14 12:05:39.3363 [PID=32351] Request: [USER#1] [HOST#1] [IP 13.111.1.2] client 7.1.0 2013-11-14 12:05:39.3769 [PID=32351] Sending reply to [HOST#1]: 0 results, delay req 7.00 2013-11-14 12:05:39.3770 [PID=32351] Scheduler ran 0.206 seconds 2013-11-14 12:12:24.0633 [PID=32637] Request: [USER#1] [HOST#1] [IP 13.110.2.1] client 7.1.0 2013-11-14 12:12:24.1348 [PID=32637] Sending reply to [HOST#1]: 0 results, delay req 7.00 2013-11-14 12:12:24.1349 [PID=32637] Scheduler ran 0.076 seconds 2013-11-14 12:29:49.0843 [PID=803 ] Request: [USER#1] [HOST#1] [IP 13.111.1.2] client 7.1.0 2013-11-14 12:29:49.1508 [PID=803 ] Sending reply to [HOST#1]: 0 results, delay req 7.00 2013-11-14 12:29:49.1509 [PID=803 ] Scheduler ran 0.070 seconds 2013-11-14 12:52:58.4942 [PID=2062 ] Request: [USER#1] [HOST#1] [IP 13.111.1.2] client 7.1.0 2013-11-14 12:52:58.5671 [PID=2062 ] Sending reply to [HOST#1]: 0 results, delay req 7.00 2013-11-14 12:52:58.5672 [PID=2062 ] Scheduler ran 0.076 seconds 2013-11-14 14:47:16.4072 [PID=7071 ] Request: [USER#1] [HOST#1] [IP 13.163.4.9] client 7.1.0 2013-11-14 14:47:16.5072 [PID=7071 ] Sending reply to [HOST#1]: 0 results, delay req 7.00 2013-11-14 14:47:16.5073 [PID=7071 ] Scheduler ran 0.103 seconds 2013-11-14 14:53:19.3211 [PID=7321 ] [HOST#1] [USER#1] Found similar existing host for this user - assigned. 2013-11-14 14:53:19.3216 [PID=7321 ] Request: [USER#1] [HOST#1] [IP 13.163.4.9] client 7.1.0 2013-11-14 14:53:19.3972 [PID=7321 ] Sending reply to [HOST#1]: 0 results, delay req 7.00 2013-11-14 14:53:19.3973 [PID=7321 ] Scheduler ran 0.080 seconds 2013-11-14 15:03:34.5202 [PID=7718 ] Request: [USER#1] [HOST#1] [IP 13.111.1.2] client 7.1.0 2013-11-14 15:03:34.5828 [PID=7718 ] Sending reply to [HOST#1]: 0 results, delay req 7.00 2013-11-14 15:03:34.5829 [PID=7718 ] Scheduler ran 0.066 seconds 2013-11-14 15:30:52.3559 [PID=8831 ] Request: [USER#1] [HOST#1] [IP 13.110.2.1] client 7.1.0 2013-11-14 15:30:52.4439 [PID=8831 ] Sending reply to [HOST#1]: 0 results, delay req 7.00 2013-11-14 15:30:52.4440 [PID=8831 ] Scheduler ran 0.091 seconds 2013-11-14 16:33:04.0699 [PID=11686] Request: [USER#1] [HOST#1] [IP 13.111.1.2] client 7.1.0 2013-11-14 16:33:04.1730 [PID=11686] Sending reply to [HOST#1]: 0 results, delay req 7.00 2013-11-14 16:33:04.1731 [PID=11686] Scheduler ran 0.106 seconds 2013-11-14 16:38:51.6167 [PID=11906] [HOST#1] [USER#1] Found similar existing host for this user - assigned. 2013-11-14 16:38:51.6173 [PID=11906] Request: [USER#1] [HOST#1] [IP 13.111.1.2] client 7.1.0 2013-11-14 16:38:51.7082 [PID=11906] Sending reply to [HOST#1]: 0 results, delay req 7.00 2013-11-14 16:38:51.7083 [PID=11906] Scheduler ran 0.096 seconds 2013-11-14 16:45:17.4583 [PID=12220] [HOST#1] [USER#1] Found similar existing host for this user - assigned. 2013-11-14 16:45:17.4585 [PID=12220] Request: [USER#1] [HOST#1] [IP 13.163.4.9] client 7.1.0 2013-11-14 16:45:17.5689 [PID=12220] Sending reply to [HOST#1]: 0 results, delay req 7.00 2013-11-14 16:45:17.5690 [PID=12220] Scheduler ran 0.114 seconds 2013-11-14 17:00:34.0785 [PID=12829] Request: [USER#1] [HOST#1] [IP 13.110.2.1] client 7.1.0 2013-11-14 17:00:34.1462 [PID=12829] Sending reply to [HOST#1]: 0 results, delay req 7.00 2013-11-14 17:00:34.1463 [PID=12829] Scheduler ran 0.071 seconds 2013-11-14 17:28:51.1941 [PID=13874] Request: [USER#1] [HOST#1] [IP 13.163.4.9] client 7.1.0 2013-11-14 17:28:51.3115 [PID=13874] Sending reply to [HOST#1]: 0 results, delay req 7.00 2013-11-14 17:28:51.3116 [PID=13874] Scheduler ran 0.121 seconds 2013-11-14 18:00:08.3190 [PID=15163] Request: [USER#1] [HOST#1] [IP 13.163.4.9] client 7.1.0 |
Send message Joined: 25 Oct 13 Posts: 14 |
Here is the o/p at the boinc server: Starting daemons Starting daemon: feeder -d 3 Starting daemon: transitioner -d 3 Starting daemon: file_deleter -d 3 Starting daemon: sample_trivial_validator -d 3 app uppercase Starting daemon: sample_dummy_assimilator -d 3 app uppercase |
Send message Joined: 29 Aug 05 Posts: 15565 |
Can you post the first lines of the start-up messages, starting from "Starting BOINC client version..." and up till the 14-Nov-2013 14:54:00 [---] No general preferences found - using defaults line, please? |
Copyright © 2024 University of California.
Permission is granted to copy, distribute and/or modify this document
under the terms of the GNU Free Documentation License,
Version 1.2 or any later version published by the Free Software Foundation.