Thread 'News on Project Outages'

Message boards : Projects : News on Project Outages
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 13 · 14 · 15 · 16 · 17 · 18 · 19 . . . 21 · Next

AuthorMessage
ProfileTigers Dave

Send message
Joined: 24 Dec 05
Posts: 52
United States
Message 59786 - Posted: 18 Jan 2015, 1:38:43 UTC - in response to Message 59711.  

BarryAZ, I am an old-timer, too - I began crunching for Seti in 2000, long before Berkeley developed the BOINC framework. I still crunch a small amount for Seti, but have shifted focus to other projects; it became too burdensome to troubleshoot the Seti optimized clients, particularly since the Mac OS is not a focal point for Seti. So, once, GPU apps were released by Einstein and Collatz, I shifted a lot of my effort there.
ID: 59786 · Report as offensive
BarryAZ

Send message
Joined: 4 Sep 09
Posts: 381
United States
Message 59838 - Posted: 19 Jan 2015, 20:52:07 UTC - in response to Message 59786.  

I'd note that Collatz is still offline. It as almost he stopped by two weeks ago, said, 'oh heck, I'll give more shot with a server restart' -- and left.

He might simply be getting on with life. 6 years is a long run for a single operator project.

At one point I was running ATI 4550 and 4650's. I still have one workstation reporting work (on moo) that way.

Over time I've cycled out the older cards -- first to ATI 4850's and then to ATI 6750's and 7850's.

These days, when I replace an ATI 4850 it typically has been with a GTX 750ti.

Those systems are running mostly GPUGrid.
ID: 59838 · Report as offensive
ProfileTigers Dave

Send message
Joined: 24 Dec 05
Posts: 52
United States
Message 59839 - Posted: 19 Jan 2015, 21:05:40 UTC - in response to Message 59838.  

Again, it would be a shame if Slicker pulled the plug on the project. Once we get passed the Martin Luther King Jr holiday, I will look to see if Slicker gives us some sign that the project is coming back online.
ID: 59839 · Report as offensive
Thyme Lawn

Send message
Joined: 2 Sep 05
Posts: 103
United Kingdom
Message 59891 - Posted: 21 Jan 2015, 13:43:21 UTC
Last modified: 21 Jan 2015, 13:44:06 UTC

WUProp@home is currently down.

The web site pages say that "The project's database server is down" and (unsurprisingly) scheduler requests are failing:
21/01/2015 12:39:25 | WUProp@Home | Reporting 1 completed tasks, requesting new tasks for CPU
21/01/2015 12:39:27 | WUProp@Home | Scheduler request completed: got 0 new tasks
21/01/2015 12:39:27 | WUProp@Home | Server error: feeder not running
ID: 59891 · Report as offensive
Stick

Send message
Joined: 10 Oct 09
Posts: 34
United States
Message 59896 - Posted: 21 Jan 2015, 18:17:23 UTC
Last modified: 21 Jan 2015, 18:22:27 UTC

Collatz was up for a short time last night but is down again. There was a very short note about the problem on the home page which I will try to paraphrase here. Unfortunately, I did not read it as carefully as I now wish.

Basically, it has been a recurring problem with DB indexing that is somehow related to the nightly back-up process. They rebuild the indexes and attempt to replicate the problem, but haven't been able to. Then the problem comes back later and crashes the server again. In other words, they are still scratching their heads.
ID: 59896 · Report as offensive
BarryAZ

Send message
Joined: 4 Sep 09
Posts: 381
United States
Message 59897 - Posted: 21 Jan 2015, 18:50:05 UTC - in response to Message 59896.  

Stick -- note, it is not a 'they' Slicker is a one man shop there.



Collatz was up for a short time last night but is down again. There was a very short note about the problem on the home page which I will try to paraphrase here. Unfortunately, I did not read it as carefully as I now wish.

Basically, it has been a recurring problem with DB indexing that is somehow related to the nightly back-up process. They rebuild the indexes and attempt to replicate the problem, but haven't been able to. Then the problem comes back later and crashes the server again. In other words, they are still scratching their heads.
ID: 59897 · Report as offensive
Artem Vorotnikov

Send message
Joined: 19 Nov 14
Posts: 4
Russia
Message 59898 - Posted: 21 Jan 2015, 19:19:36 UTC - in response to Message 59897.  

And this basically boils down to:
"Open the source code and build a community or die"

Switched to Einstein.
ID: 59898 · Report as offensive
ProfileBlurf

Send message
Joined: 18 Jul 11
Posts: 217
United States
Message 59902 - Posted: 21 Jan 2015, 20:21:39 UTC

WuProp is up
ID: 59902 · Report as offensive
Stick

Send message
Joined: 10 Oct 09
Posts: 34
United States
Message 59903 - Posted: 21 Jan 2015, 20:33:30 UTC - in response to Message 59897.  

Stick -- note, it is not a 'they' Slicker is a one man shop there.

Thanks! I imagine he is a very busy man right now.

I've been a Collatz member for several years now and until recently it has run like clockwork. Hope he can figure things out!
ID: 59903 · Report as offensive
ProfileBlurf

Send message
Joined: 18 Jul 11
Posts: 217
United States
Message 59907 - Posted: 21 Jan 2015, 23:45:36 UTC

WuProp is down again
ID: 59907 · Report as offensive
ProfileBlurf

Send message
Joined: 18 Jul 11
Posts: 217
United States
Message 59910 - Posted: 22 Jan 2015, 7:10:00 UTC - in response to Message 59907.  

WuProp is down again


Back up
ID: 59910 · Report as offensive
ProfileTigers Dave

Send message
Joined: 24 Dec 05
Posts: 52
United States
Message 59925 - Posted: 22 Jan 2015, 22:32:51 UTC - in response to Message 59898.  
Last modified: 22 Jan 2015, 22:33:51 UTC

Collatz is back up again!
ID: 59925 · Report as offensive
BarryAZ

Send message
Joined: 4 Sep 09
Posts: 381
United States
Message 59962 - Posted: 26 Jan 2015, 0:53:47 UTC - in response to Message 59925.  

And back down -- I suspect he's got a hardware intermittent -- since his logs don't provide any enlightenment.


Collatz is back up again!
ID: 59962 · Report as offensive
Co
Avatar

Send message
Joined: 17 Apr 12
Posts: 13
Australia
Message 59963 - Posted: 26 Jan 2015, 1:01:47 UTC

LHC@home doesn't supply new scores to statistics servers for last 5 days.
ID: 59963 · Report as offensive
ProfileGary Charpentier
Avatar

Send message
Joined: 23 Feb 08
Posts: 2493
United States
Message 59981 - Posted: 27 Jan 2015, 1:57:57 UTC - in response to Message 59963.  

LHC@home doesn't supply new scores to statistics servers for last 5 days.

It could be worse http://setiweb.ssl.berkeley.edu/beta/forum_thread.php?id=2206
ID: 59981 · Report as offensive
Richard Haselgrove
Volunteer tester
Help desk expert

Send message
Joined: 5 Oct 06
Posts: 5129
United Kingdom
Message 59986 - Posted: 27 Jan 2015, 10:19:47 UTC

CPDN Scheduled downtime 29-30 January 2015

Jonathan Miller says:

I have to schedule some project downtime this week, 29 & 30 Jan 2015.

This is so that we can configure the underlying hardware to accept a tape backup system as part of the 'near-line' storage.

I will taking the opportunity to move the database backup to a different server, to make us more resilient in case of failure of the above hardware.

The downtime ought to be no more than a few hours on Thursday, but I have said that many times before!
ID: 59986 · Report as offensive
Richard Haselgrove
Volunteer tester
Help desk expert

Send message
Joined: 5 Oct 06
Posts: 5129
United Kingdom
Message 60004 - Posted: 27 Jan 2015, 18:28:56 UTC - in response to Message 59986.  

As in the CPDN thread:

The scheduled [CPDN] downtime has been pushed back a week and will now be on Thursday 5th and Friday 6th February.
ID: 60004 · Report as offensive
Richard Haselgrove
Volunteer tester
Help desk expert

Send message
Joined: 5 Oct 06
Posts: 5129
United Kingdom
Message 60025 - Posted: 28 Jan 2015, 15:05:07 UTC

Einstein@Home

"[Later] today we're going to shutdown the project for at most 1-2 hours to deploy important security fixes on our infrastructure. 28 Jan 2015"

Looks like the shutdown has already taken place, but they're having difficulty bringing it back up - all services are disabled or showing errors.
ID: 60025 · Report as offensive
Richard Haselgrove
Volunteer tester
Help desk expert

Send message
Joined: 5 Oct 06
Posts: 5129
United Kingdom
Message 60029 - Posted: 28 Jan 2015, 19:31:14 UTC - in response to Message 60025.  

Einstein@Home is back up - Oliver says:

I updated 19 servers and gateways in two data centers in the past seven hours, fixing the Ghost CVE, upgraded kernels, updated our master/slave databases and deployed pending package updates.
ID: 60029 · Report as offensive
boboviz
Help desk expert

Send message
Joined: 12 Feb 11
Posts: 419
Italy
Message 60038 - Posted: 29 Jan 2015, 13:10:02 UTC

CSG@Home is down
ID: 60038 · Report as offensive
Previous · 1 . . . 13 · 14 · 15 · 16 · 17 · 18 · 19 . . . 21 · Next

Message boards : Projects : News on Project Outages

Copyright © 2024 University of California.
Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.