Thread 'News on Project Outages'

Message boards : Projects : News on Project Outages
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 49 · 50 · 51 · 52 · 53 · 54 · 55 . . . 68 · Next

AuthorMessage
ProfileYavanius
Avatar

Send message
Joined: 19 May 15
Posts: 123
Antarctica
Message 111270 - Posted: 12 Mar 2023, 21:03:11 UTC - in response to Message 111215.  

[Dennis currently telling me it has no work available.


(Wonders if anybody ever reads anything on the projects or just connect blindly...)


DENIS is realizing work in large batches as they fine-tune their models. They just finished the last batch and posted the results to News.


Ironically, it's one main researcher who is overseeing the project and he is a professor at the University (there seem to be a team in the background analyzing things though). He posts and communicates more than the whole Krembil team... I do wonder if the communications intern doesn't know what to post or they aren't letting her post.

Someday I see in an interview: I was a communications intern at Krembil but they never wanted to let me post updates about failures occurring at the project...
ID: 111270 · Report as offensive     Reply Quote
ProfileYavanius
Avatar

Send message
Joined: 19 May 15
Posts: 123
Antarctica
Message 111271 - Posted: 12 Mar 2023, 21:06:57 UTC - in response to Message 111261.  
Last modified: 12 Mar 2023, 21:20:57 UTC

Asteroids@home is back online.


Asteroids@home periodically runs out of work. They just came back to activity rather recently after a hiatus of a few years after their old hardware bit the dust. It's one person who is running the project probably on a shoe-string budget. I'm sure he'd be ecstatic if he got the rounding error of the budget LHC has. ^_^
ID: 111271 · Report as offensive     Reply Quote
Steven Gaber

Send message
Joined: 28 Jun 20
Posts: 69
United States
Message 111272 - Posted: 13 Mar 2023, 2:06:12 UTC - in response to Message 111261.  
Last modified: 13 Mar 2023, 2:07:50 UTC

Asteroids@home is back online.


Maybe.

But I Ihave tasks stuck in uploading, can't access my account, can't get to their message boards or the Home Page.

S. Gaber
Oldsmar, FL
ID: 111272 · Report as offensive     Reply Quote
Dr Who Fan
Avatar

Send message
Joined: 10 May 07
Posts: 1450
United States
Message 111273 - Posted: 13 Mar 2023, 3:26:54 UTC - in response to Message 111272.  

Asteroids@home is back online.


Maybe.

But I Ihave tasks stuck in uploading, can't access my account, can't get to their message boards or the Home Page.

S. Gaber
Oldsmar, FL

I have not had any problems accessing the website from DFW Metro area in Texas nor server access to send/receive tasks since the new certificate was installed.

Restart your web browser and/or empty the browser cache to clean out old information it might contain. Then try accessing the forums.

For stuck tasks in BOINC go to the transfers tab in advanced view and select 4 to 6 Asteroids tasks and retry upload until you have successfully transferred all.
ID: 111273 · Report as offensive     Reply Quote
Steven Gaber

Send message
Joined: 28 Jun 20
Posts: 69
United States
Message 111274 - Posted: 13 Mar 2023, 6:28:35 UTC - in response to Message 111273.  

Asteroids@home is back online.


Maybe.

But I Ihave tasks stuck in uploading, can't access my account, can't get to their message boards or the Home Page.

S. Gaber
Oldsmar, FL

I have not had any problems accessing the website from DFW Metro area in Texas nor server access to send/receive tasks since the new certificate was installed.

Restart your web browser and/or empty the browser cache to clean out old information it might contain. Then try accessing the forums.

For stuck tasks in BOINC go to the transfers tab in advanced view and select 4 to 6 Asteroids tasks and retry upload until you have successfully transferred all.


Still getting downloads from Universe. But all 26 or my tasks in Transfer say "Upload pennding: project backoff."
ID: 111274 · Report as offensive     Reply Quote
ProfileDave
Help desk expert

Send message
Joined: 28 Jun 10
Posts: 2718
United Kingdom
Message 111276 - Posted: 13 Mar 2023, 9:07:47 UTC

(Wonders if anybody ever reads anything on the projects or just connect blindly...)
I did have a look at their forums but obviously not carefully enough!
ID: 111276 · Report as offensive     Reply Quote
Dr Who Fan
Avatar

Send message
Joined: 10 May 07
Posts: 1450
United States
Message 111279 - Posted: 13 Mar 2023, 16:19:33 UTC - in response to Message 111259.  

It has been nearly 2 weeks since WCG crashed & burned into the ether.

Another Monday 1/2 gone and nothing but cricket's from Krembil about what if anything is happening with the RAID STORAGE failure at WCG.

WCG Facebook page: https://facebook.com/197379135651/
ID: 111279 · Report as offensive     Reply Quote
Grumpy Swede
Avatar

Send message
Joined: 30 Mar 20
Posts: 423
Sweden
Message 111281 - Posted: 13 Mar 2023, 18:44:04 UTC
Last modified: 13 Mar 2023, 18:44:42 UTC

Yup, last update from WCG according to the timestamp of the tweet on Twitter, was March 10, at 19:19 UTC.
Now, it's March 13, 18:44 UTC.
ID: 111281 · Report as offensive     Reply Quote
Grumpy Swede
Avatar

Send message
Joined: 30 Mar 20
Posts: 423
Sweden
Message 111283 - Posted: 13 Mar 2023, 21:30:06 UTC
Last modified: 13 Mar 2023, 21:43:33 UTC

New WCG Update, 20 minutes ago:

"The web pages and forums are back online, but the recovery process continues.
As a result, performance is slower than usual, and not all functionality is there.
Until we can restart the science database and BOINC, stats/contributions are not
accurate. We will provide further updates as we progress. Thank you for your patience."


Edit, added: Well to say that the website is back, was to go a bit too far. Not possible to log in. "System Error", or "503 Service Unavailable", is the response to any attempt to log in.
ID: 111283 · Report as offensive     Reply Quote
Phillip Spencer

Send message
Joined: 3 Mar 23
Posts: 10
France
Message 111288 - Posted: 14 Mar 2023, 9:28:35 UTC
Last modified: 14 Mar 2023, 9:29:04 UTC

WCG website back and forums working (but, sadly, no official communication update yesterday)
ID: 111288 · Report as offensive     Reply Quote
Phillip Spencer

Send message
Joined: 3 Mar 23
Posts: 10
France
Message 111289 - Posted: 14 Mar 2023, 13:47:02 UTC - in response to Message 111288.  

WCG website back and forums working (but, sadly, no official communication update yesterday)

It looks like I spoke too soon. Website and forums down once more with the "System Error" message again. This does not bode well for the overall recovery.
ID: 111289 · Report as offensive     Reply Quote
Warped
Avatar

Send message
Joined: 25 Aug 08
Posts: 40
South Africa
Message 111290 - Posted: 14 Mar 2023, 13:50:24 UTC - in response to Message 111288.  

Our website is currently down and we are looking into the root cause and a method to fix it. We will post a follow-up when it has been resolved.

The latest news on WCG from their Twitter feed. Seems to be going backwards!
ID: 111290 · Report as offensive     Reply Quote
Grumpy Swede
Avatar

Send message
Joined: 30 Mar 20
Posts: 423
Sweden
Message 111291 - Posted: 14 Mar 2023, 13:52:50 UTC
Last modified: 14 Mar 2023, 13:54:29 UTC

New WCG Update, on FB and Twitter, 15 minutes ago:

Our website is currently down and we are looking into the root cause and a method to fix it.
We will post a follow-up when it has been resolved.
ID: 111291 · Report as offensive     Reply Quote
Grumpy Swede
Avatar

Send message
Joined: 30 Mar 20
Posts: 423
Sweden
Message 111292 - Posted: 14 Mar 2023, 15:36:26 UTC
Last modified: 14 Mar 2023, 15:53:08 UTC

First the problem was with the RAID card. Then a borrowed card from the data centre was installed, and then they managed to successfully rebuild the RAID array.
That didn't help, so now they said the problem was the PCI bus. (how could they successfully rebuild the RAID array with a broken PCI Bus?)

So, another storage system (DSS 7000) was installed by the data center, and again rebuilt the RAID array. "The "new" system did recognize the data hardware RAIDs.
All have been rebuilt, and the data center is attempting to repair the OS drives/RAID." Later on "The storage server was revived yesterday late afternoon. Both database
filesystems mounted as before, but the science filesystem did not. It needs a repair; erasing the old log first." So, yesterday the website came back, but then took a dive
again some hours later. BOINC is still MIA of course.

I think they are chasing ghosts, and looking in the wrong direction. As said before: how could they successfully rebuild the RAID array, the first time, (after they first changed
only the RAID card), with a broken PCI Bus.
ID: 111292 · Report as offensive     Reply Quote
Grumpy Swede
Avatar

Send message
Joined: 30 Mar 20
Posts: 423
Sweden
Message 111293 - Posted: 14 Mar 2023, 16:55:24 UTC
Last modified: 14 Mar 2023, 16:59:45 UTC

New WCG Update, on FB and Twitter. 30 minutes ago:

Update: The system error has been resolved and all users should regain access to the website. Thank you for your patience.

I doubt the website will stay up, for long. Still no BOINC....
ID: 111293 · Report as offensive     Reply Quote
Grumpy Swede
Avatar

Send message
Joined: 30 Mar 20
Posts: 423
Sweden
Message 111294 - Posted: 14 Mar 2023, 18:38:15 UTC
Last modified: 14 Mar 2023, 18:40:11 UTC

New WCG Update 5 minutes ago:

"The website has been restarted and we are working on rebuilding the science database so BOINC can restart soon.
Read more here:"
https://worldcommunitygrid.org/about_us/article.s?articleId=780
ID: 111294 · Report as offensive     Reply Quote
Dr Who Fan
Avatar

Send message
Joined: 10 May 07
Posts: 1450
United States
Message 111295 - Posted: 14 Mar 2023, 22:19:14 UTC - in response to Message 111294.  

Best quote from the WCG/Krembil's sob story today:
We are immensely grateful for the positivity that we received during the process.


They ask that users post all "recovery" comments in this WCG forum.

I see someone from Krembil posted this message in the above linked forum:
... once we recover, backup and restart - we will start moving to a new storage system (with warranty). We will be able to use it for some months to come. That should give us improved performance (SSD drives) but also more reliability
igor
ID: 111295 · Report as offensive     Reply Quote
JeromeC

Send message
Joined: 13 Oct 10
Posts: 120
France
Message 111296 - Posted: 15 Mar 2023, 10:00:17 UTC - in response to Message 111295.  

I'm not sure the positivity they mention is mainly coming from this topic ;)
ID: 111296 · Report as offensive     Reply Quote
Dr Who Fan
Avatar

Send message
Joined: 10 May 07
Posts: 1450
United States
Message 111299 - Posted: 15 Mar 2023, 22:01:58 UTC
Last modified: 15 Mar 2023, 22:02:50 UTC

Another day just about gone and still no WCG BOINC upload or download.

Now over TWO WEEKS since the crash and no posting by Krembil today about restoration status, checked Twitter, Facebook and the project forum.
ID: 111299 · Report as offensive     Reply Quote
Grumpy Swede
Avatar

Send message
Joined: 30 Mar 20
Posts: 423
Sweden
Message 111306 - Posted: 16 Mar 2023, 11:03:45 UTC
Last modified: 16 Mar 2023, 11:05:40 UTC

WCG website down again. No surprise there......
WCG BOINC still dead as a doornail.
ID: 111306 · Report as offensive     Reply Quote
Previous · 1 . . . 49 · 50 · 51 · 52 · 53 · 54 · 55 . . . 68 · Next

Message boards : Projects : News on Project Outages

Copyright © 2024 University of California.
Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.