Message boards : Questions and problems : Cannot Get Work -Climate Prediction
Message board moderation
Author | Message |
---|---|
Send message Joined: 31 Dec 18 Posts: 313 ![]() |
I have a long standing problem with climate.prediction.net whereby I cannot get the server to send me new work. The shed_ops_debug shows that I am requesting n seconds and m units of work but no work (and no error message) is returned. All other projects are working fine. My machines are :- Ubuntu 20.04, Boinc 7.16.14, Ryzen 5 2600, no GPU. This machine has 850,000 credits with CP before it stopped working, no obvious change to trigger the problem. Ubuntu 20.04, Boinc 7.16.6, Ryzen 5 3600, Nvidea GT710 1mb GPU. This machine was added after the problem started and has never managed to get work. I have sent many logs to the CP forums and they cannot see any problems with the work fetch. I can attach some here if you think it would help. I have detached, rebooted, reattached several times, with and without setting nnt for all other projects and/or increasing the buffer size to 10+10 before reattaching. I have not set nnt and then run the existing buffers down to zero as I am loath to do that and waste good processing time. I have set up a new user within CP and attached to that, no joy. So, to my question. I have spent a short while looking through the code in github for the server side scheduler and it seems to me that there are two conditions where it blocks the send of work and logs the fact but does not appear to return an error message to the user. During a work request it sets a lock file on the host id. During the next work request it finds the lock file still exists so exits. It receives an unrecognised code sign key. Now, obviously, I cannot check the server for an uncleared lock file but is there any way I can change my host id and is there any way I can resync my code sign key? |
![]() ![]() Send message Joined: 17 Nov 16 Posts: 904 ![]() |
It receives an unrecognised code sign key. Not familiar with Climate Prediction. Does it sign its applications? I assume you are not talking about SSL certs or something as I think you have implied you can contact the project and get a reply from the scheduler. You are just not getting work Do you have this cc_config.xml parameter set in the Proxy Info section? <unsigned_apps_ok>0</unsigned_apps_ok> Does CP use this parameter and do you have it set wrong? |
![]() Send message Joined: 28 Jun 10 Posts: 2829 ![]() |
Does CP use this parameter and do you have it set wrong? Having followed and been part of the discussion over on the CPDN boards I am pretty certain that this is with a default installation of BOINC. Attach to project and request work. No fiddling with anything apart from resource shares and suspending other projects etc. to try and kick things into action. |
Send message Joined: 25 Nov 05 Posts: 1654 ![]() |
Finally find where I posted about this: Not downloading tasks Sat 11 Jan 2020 22:54:25 AEDT | climateprediction.net | No tasks sent |
Send message Joined: 31 Dec 18 Posts: 313 ![]() |
It receives an unrecognised code sign key. I’m guessing it’s a mechanism Boinc uses to ensure that the applications have not been tampered with. I’m sure it’s not specific to CP. The parameter is one I’ve not seen before, I’ll check on the setting in the morning and report back. |
Send message Joined: 31 Dec 18 Posts: 313 ![]() |
Finally find where I posted about this: Not downloading tasks So all I need is a convenient thunderstorm? :-) I’d guess it’s not a hard bug in any of the newer versions or they’d be swamped with error reports and I’m fairly certain the client is actually making the request which is why I looked at the server side - either for a bug there or a mismatch between the two sides. |
Send message Joined: 8 Nov 10 Posts: 310 ![]() |
I have detached, rebooted, reattached several times, with and without setting nnt for all other projects and/or increasing the buffer size to 10+10 before reattaching. I have not set nnt and then run the existing buffers down to zero as I am loath to do that and waste good processing time. The only problem I have had with the Linux version (other than the 32-bit libraries) is the buffer being too short. It depends on what other projects you are running too. For example, Rosetta tends to be rather long also. Often the default 0.1 + 0.5 day buffer is not enough. The world won't end (just yet) if you let it run dry, and set it to 0.5 + 1.0 days. That should work. |
Send message Joined: 31 Dec 18 Posts: 313 ![]() |
I have detached, rebooted, reattached several times, with and without setting nnt for all other projects and/or increasing the buffer size to 10+10 before reattaching. I have not set nnt and then run the existing buffers down to zero as I am loath to do that and waste good processing time. My default is 0.1 + 0.1 and I assume that setting nnt then changing the buffer to 10 + 10 before requesting work would have the same effect without requiring me to run down the existing WUs first? |
Send message Joined: 25 Nov 05 Posts: 1654 ![]() |
It stopped again, and I gave up on it. |
Send message Joined: 31 Dec 18 Posts: 313 ![]() |
It stopped again, and I gave up on it. It’s like a loose tooth - I keep on going back to it! |
Send message Joined: 31 Dec 18 Posts: 313 ![]() |
It receives an unrecognised code sign key. I checked the parameter on both machines and it is unset (0). I did notice, however, that no alt platforms was still set from the days when I was getting 32 bit apps from ?WCG? that were crashing. I’ve reset it just in case. |
Send message Joined: 31 Dec 18 Posts: 313 ![]() |
It was worth a try, made no difference though. |
Send message Joined: 8 Nov 10 Posts: 310 ![]() |
My default is 0.1 + 0.1 and I assume that setting nnt then changing the buffer to 10 + 10 before requesting work would have the same effect without requiring me to run down the existing WUs first?If the BOINC scheduler behaved the way we thought it should, we wouldn't be here discussing it. |
Send message Joined: 31 Dec 18 Posts: 313 ![]() |
My default is 0.1 + 0.1 and I assume that setting nnt then changing the buffer to 10 + 10 before requesting work would have the same effect without requiring me to run down the existing WUs first?If the BOINC scheduler behaved the way we thought it should, we wouldn't be here discussing it. Touché, I offer no defence :-) |
Send message Joined: 25 May 09 Posts: 1325 ![]() |
I've been running 1 +0.01 for some time, but CPDN does not follow the convention on sending out work as the majority of its tasks are of extremely long duration. As far as I can see it sends out enough work to populate all allowed CPU cores (in my case four out of eight), then only restocks as cores become available until work balance is restored as far as CPDN. Also, on my system at least, BONC is pretty good at "bumping" CPDN out of the way of other work that is due to expire and not refilling the cache until CPDN is out of the way. |
Send message Joined: 31 Dec 18 Posts: 313 ![]() |
So back to the original questions :- Is there any way I can force a change of host id? Is there any way I can resync the code sign key? |
Send message Joined: 8 Nov 10 Posts: 310 ![]() |
Is there any way I can force a change of host id?It is easy enough on Ubuntu by just updating. This is the same machine with different Linux kernels: https://www.cpdn.org/show_host_detail.php?hostid=1507626 https://www.cpdn.org/show_host_detail.php?hostid=1508717 But you could probably do it by just changing the PC name if you are on Windows. And you could then change the name back again I suppose. |
Send message Joined: 31 Dec 18 Posts: 313 ![]() |
Is there any way I can force a change of host id?It is easy enough on Ubuntu by just updating. This is the same machine with different Linux kernels: Hmm. I’m on Ubuntu 20.04 and the next update is 20.10 which, as far as I understand, gives problems with Boinc so I’d rather not update from the LTS version yet. I’ve just tried a name change and it registered with CP but stayed as the same machine with the same host id |
Send message Joined: 25 May 09 Posts: 1325 ![]() |
Changing computer name is not one of the triggers used to get a new user computer ID, basically BOINC ignores the name you give to a computer. A change in operating system, bigger than just a Linux version change, but a more substantial type (Linux to Windows) will get you a new computer ID. Changing Linux family may count, but I'm not sure about that. BIG hardware changes (e.g. changing from an Intel to an AMD CPU) will almost certainly get you a new computer ID. There have been times when changing BOINC version has triggered a mass computer ID change, but that was very much the luck of the draw. edit to add: Linux kernel version sometimes works, but sometimes doesn't. |
![]() Send message Joined: 28 Jun 10 Posts: 2829 ![]() |
Hmm. I’m on Ubuntu 20.04 and the next update is 20.10 which, as far as I understand, gives problems with Boinc so I’d rather not update from the LTS version yet. I have a fresh install of 20.10 on both my Ryzen and the old Laptop and both seem to be working OK with BOINC. I have compiled from source however rather than using the version supplied by repository. |
Copyright © 2025 University of California.
Permission is granted to copy, distribute and/or modify this document
under the terms of the GNU Free Documentation License,
Version 1.2 or any later version published by the Free Software Foundation.