Thread 'News on Project Outages'

Message boards : Projects : News on Project Outages
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 16 · 17 · 18 · 19 · 20 · 21 · Next

AuthorMessage
fjpod

Send message
Joined: 17 Dec 07
Posts: 2
United States
Message 60691 - Posted: 5 Mar 2015, 20:09:58 UTC

I am not able to log on to Collatz either. It is not accepting my completed work.
ID: 60691 · Report as offensive
Dr Who Fan
Avatar

Send message
Joined: 10 May 07
Posts: 1444
United States
Message 60695 - Posted: 6 Mar 2015, 0:08:35 UTC - in response to Message 60681.  

Collatz is back up* with a News item from Slicker:
Corrupt Index Caused Authentication Errors

Summary:
An index on the user table in the database was corrupt and has been fixed. If you have issues or notice something that is out of sync somehow, please let me know.

Details:
The email address index on the user table was corrupted. This kept people from being able to authenticate when connecting. Ever heard that the definition of lunacy is doing the same thing over and over and expecting different results? If that is true, some of you need some serious counselling. ;-)

When people couldn't connect, they attempted to authenticate and BOINC's logic is that if it can't find you, you must be new. So, it created another record because the index used to look up your existing record was corrupt -- or for some lunatics, another and another and another and another... That resulted in multiple user records with the same email address and because of that, I couldn't just drop and re-create the corrupt index because there were now duplicate data values that needed to first be removed.

So, after cleaning up hundreds of records each of which had to be manually evaluated to determine whether any other data was associated with the user record (since BOINC doesn't have any data integrity between tables because it uses no foreign key relationships in the database it will allow me to delete records that do have data associated with them which means I have to be very careful when fixing things). Once they were all removed, I was able to rebuild the index and allow everyone to access the project again.

*anyone want to put a guess on how long it stays up this time?
ID: 60695 · Report as offensive
Richard Haselgrove
Volunteer tester
Help desk expert

Send message
Joined: 5 Oct 06
Posts: 5129
United Kingdom
Message 60703 - Posted: 6 Mar 2015, 9:18:39 UTC

Climate Prediction (CPDN)

It's not currently possible to download the data files for new CPDN tasks.

Staff are investigating a possible hardware failure. More news later.
ID: 60703 · Report as offensive
David Ball

Send message
Joined: 2 Dec 06
Posts: 69
United States
Message 60704 - Posted: 6 Mar 2015, 11:35:59 UTC
Last modified: 6 Mar 2015, 11:37:41 UTC

Haven't gotten any work from Convector for a while. For the last 2 or 3 days, I can't even connect to the server either through boinc or through a web browser. Boincsynergy says it hasn't been able to get stats for 3.55 days. Does anyone know if the project is going away or just having problems?

http://convector.fsv.cvut.cz/

edit: Add project URL
ID: 60704 · Report as offensive
ProfileTigers Dave

Send message
Joined: 24 Dec 05
Posts: 52
United States
Message 60716 - Posted: 6 Mar 2015, 20:46:35 UTC - in response to Message 60688.  
Last modified: 6 Mar 2015, 20:47:19 UTC

Update on my previous message 60688.

Now that Collatz is back up, my backlog of completed tasks has been uploaded and I received plenty of new tasks to crunch. I did not lose any completed work units, either; my crunching activity (~1.1 M/day) for the last 9 days (2/26-3/07) is within 3% of my activity for the preceding two weeks (2/12 to 2/26).

I am going to try to continue to treat Collatz and Slicker with patience. I think they deserve it.
ID: 60716 · Report as offensive
BarryAZ

Send message
Joined: 4 Sep 09
Posts: 381
United States
Message 60776 - Posted: 9 Mar 2015, 18:12:52 UTC

Collatz is back down as it so frequently is.

I count 12 outage events of various lengths over the past 5 months.
ID: 60776 · Report as offensive
BarryAZ

Send message
Joined: 4 Sep 09
Posts: 381
United States
Message 60799 - Posted: 9 Mar 2015, 22:57:27 UTC - in response to Message 60776.  

GPUGrid is offline
ID: 60799 · Report as offensive
BarryAZ

Send message
Joined: 4 Sep 09
Posts: 381
United States
Message 60860 - Posted: 11 Mar 2015, 20:01:26 UTC - in response to Message 60799.  

GPUGrid was only offline for a couple of hours.

Collatz is offline again.
ID: 60860 · Report as offensive
BarryAZ

Send message
Joined: 4 Sep 09
Posts: 381
United States
Message 60873 - Posted: 12 Mar 2015, 4:33:13 UTC - in response to Message 60860.  

GPUGrid went offline again about an hour ago.

Collatz remains offline.
ID: 60873 · Report as offensive
BarryAZ

Send message
Joined: 4 Sep 09
Posts: 381
United States
Message 60912 - Posted: 13 Mar 2015, 5:42:03 UTC - in response to Message 60873.  

The GPUGrid outage was only for a few hours -- they were tweaking things to make the server more responsive -- seems to have helped.

Collatz was temporarily running earlier to day, but is down yet again this evening.
ID: 60912 · Report as offensive
WezH

Send message
Joined: 1 Oct 12
Posts: 90
Finland
Message 61083 - Posted: 19 Mar 2015, 18:10:23 UTC
Last modified: 19 Mar 2015, 18:10:52 UTC

WUProp@Home is down
ID: 61083 · Report as offensive
ProfileBlurf

Send message
Joined: 18 Jul 11
Posts: 217
United States
Message 61084 - Posted: 19 Mar 2015, 18:23:30 UTC

Can't upload files to GPUGrid...website is accessible
ID: 61084 · Report as offensive
Richard Haselgrove
Volunteer tester
Help desk expert

Send message
Joined: 5 Oct 06
Posts: 5129
United Kingdom
Message 61086 - Posted: 19 Mar 2015, 18:45:10 UTC - in response to Message 61084.  

Can't upload files to GPUGrid...website is accessible

There's a verified workround at https://www.gpugrid.net/forum_thread.php?id=3846&nowrap=true#40528 - use the certificate bundle from v7.4.36

Or simply update to the current recommended BOINC version for your operating system.
ID: 61086 · Report as offensive
Dr Who Fan
Avatar

Send message
Joined: 10 May 07
Posts: 1444
United States
Message 61088 - Posted: 19 Mar 2015, 19:45:57 UTC - in response to Message 61083.  

WuProp is into the 2nd hour of being down... appears to be another Database crash:
Server Status was last updated as of 19 Mar 2015, 17:38:40 UTC
Warning: mysqli::mysqli(): (08004/1040): Too many connections in /home/sebastien/projects/wuproj/html/inc/db_conn.inc on line 39

Warning: mysqli::query(): Couldn't fetch mysqli in /home/sebastien/projects/wuproj/html/inc/db_conn.inc on line 63
Database Error

Warning: mysqli::escape_string(): Couldn't fetch mysqli in /home/sebastien/projects/wuproj/html/inc/db_conn.inc on line 262
ID: 61088 · Report as offensive
ProfileBlurf

Send message
Joined: 18 Jul 11
Posts: 217
United States
Message 61089 - Posted: 19 Mar 2015, 20:02:32 UTC - in response to Message 61086.  

There's a verified workround at https://www.gpugrid.net/forum_thread.php?id=3846&nowrap=true#40528 - use the certificate bundle from v7.4.36

Or simply update to the current recommended BOINC version for your operating system.


Thank you Richard. I like my older Boinc version so I'll try the workaround when I get home
ID: 61089 · Report as offensive
Dr Who Fan
Avatar

Send message
Joined: 10 May 07
Posts: 1444
United States
Message 61090 - Posted: 19 Mar 2015, 21:38:21 UTC - in response to Message 61088.  

WuProp is into the 4th hour of being down... appears to be another Database crash:
Server Status was last updated as of 19 Mar 2015, 17:38:40 UTC
Warning: mysqli::mysqli(): (08004/1040): Too many connections in /home/sebastien/projects/wuproj/html/inc/db_conn.inc on line 39

Warning: mysqli::query(): Couldn't fetch mysqli in /home/sebastien/projects/wuproj/html/inc/db_conn.inc on line 63
Database Error

Warning: mysqli::escape_string(): Couldn't fetch mysqli in /home/sebastien/projects/wuproj/html/inc/db_conn.inc on line 262
ID: 61090 · Report as offensive
Dr Who Fan
Avatar

Send message
Joined: 10 May 07
Posts: 1444
United States
Message 61093 - Posted: 19 Mar 2015, 23:05:23 UTC - in response to Message 61090.  

WuProp is BACK UP as of 19 Mar 2015, 22:51:04 UTC
ID: 61093 · Report as offensive
ProfileBlurf

Send message
Joined: 18 Jul 11
Posts: 217
United States
Message 61100 - Posted: 20 Mar 2015, 13:38:14 UTC

GPUGRID: Didn't need to do the workaround-my files uploaded but wouldn't report.

3/20/2015 9:29:33 AM Project communication failed: attempting access to reference site
3/20/2015 9:29:34 AM Internet access OK - project servers may be temporarily down.
ID: 61100 · Report as offensive
Richard Haselgrove
Volunteer tester
Help desk expert

Send message
Joined: 5 Oct 06
Posts: 5129
United Kingdom
Message 61104 - Posted: 20 Mar 2015, 14:41:14 UTC - in response to Message 61100.  

It might still be worth doing. Prior to my testing the workround, my older computers failed both to contact the server to be allocated new work (which would include reporting old work), and to upload data files for completed work. With the workround in place, both functions are operating normally - a task is uploading as I type, and will probably report before I've finished.

[especially since I just broke off to report a bug here]

Back to GPUGrid - it's a recent change which has caused the problem, and it may still be a work-in-progress on the server: be prepared to the situation to be somewhat fluid.

Yes, automatic report complete - though it was from one of my newer BOINC installations.
ID: 61104 · Report as offensive
ProfileTigers Dave

Send message
Joined: 24 Dec 05
Posts: 52
United States
Message 61116 - Posted: 21 Mar 2015, 16:39:30 UTC
Last modified: 21 Mar 2015, 16:40:01 UTC

Collatz came back up on 3/16, but went back off on 3/21. No big deal, as I have 4-6 days of C@H tasks remaining in the queues.
ID: 61116 · Report as offensive
Previous · 1 . . . 16 · 17 · 18 · 19 · 20 · 21 · Next

Message boards : Projects : News on Project Outages

Copyright © 2024 University of California.
Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.