Thread 'World Community Grid has announced an extended outage from Feb 14 to April 22, 2022'

Message boards : Projects : World Community Grid has announced an extended outage from Feb 14 to April 22, 2022
Message board moderation

To post messages, you must log in.

Previous · 1 · 2 · 3 · 4 · 5 . . . 10 · Next

AuthorMessage
Sir LanDroid

Send message
Joined: 7 Apr 13
Posts: 64
United States
Message 108975 - Posted: 6 Jul 2022, 3:51:21 UTC

World Community Grid: Notice from BOINC
This project seems to have changed its URL. When convenient, remove the project, then add https://master.worldcommunitygrid.org/
7/5/2022 11:49:12 PM

Still no work, but get set up for it...
ID: 108975 · Report as offensive
Bryn Mawr
Help desk expert

Send message
Joined: 31 Dec 18
Posts: 296
United Kingdom
Message 108976 - Posted: 6 Jul 2022, 8:30:03 UTC - in response to Message 108975.  

World Community Grid: Notice from BOINC
This project seems to have changed its URL. When convenient, remove the project, then add https://master.worldcommunitygrid.org/
7/5/2022 11:49:12 PM

Still no work, but get set up for it...


That message appears to be spurious, there is no need to change URL.

Good news is that there’s been a lot of activity over the past day trying to get the server on-line and some files have been sent out.
ID: 108976 · Report as offensive
ProfileDave
Help desk expert

Send message
Joined: 28 Jun 10
Posts: 2701
United Kingdom
Message 109011 - Posted: 6 Jul 2022, 19:01:14 UTC

Just seen on BOINC processing for Science Facebook page a post about getting first tasks from WCG since outage.
ID: 109011 · Report as offensive
Grumpy Swede
Avatar

Send message
Joined: 30 Mar 20
Posts: 419
Sweden
Message 109016 - Posted: 6 Jul 2022, 20:01:39 UTC - in response to Message 109011.  
Last modified: 6 Jul 2022, 20:03:11 UTC

Just seen on BOINC processing for Science Facebook page a post about getting first tasks from WCG since outage.

Still only a test. Full restart will take some time, probably several weeks.
There are still tons of unresolved issues to fix.
This migration became more of a total rebuild than a migration.
ID: 109016 · Report as offensive
Sir LanDroid

Send message
Joined: 7 Apr 13
Posts: 64
United States
Message 109204 - Posted: 13 Jul 2022, 16:01:33 UTC
Last modified: 13 Jul 2022, 16:08:53 UTC

Dear volunteers,

Over the past week, we entered a new stage of our testing phase where we gave a few work units to our volunteers to crunch. The goal was to see if any unforeseen errors occurred, which we were able to find thanks to the feedback of our volunteers. For specific questions you may have on the limited work units, check this thread on our forums.

Here’s a quick update from our tech team:

We have been working to assess how well the system responds to load and whether configuration is correct. As many have noted on the forums, there are some issues with the BOINC backend configuration. Users will have noticed off-putting notices in the BOINC manager about incorrect project URLs and attaching twice to the same project. We are aware of these issues and nearing a resolution.

In addition, we are working to resolve the lack of synchronization between the website profile page and the BOINC manager. We are also working to fix a blocking issue with the GPU work units for Open Pandemics. Once this is resolved, we should be able to increase the quantity of work units we can send out to volunteers.

Our backlog of non-blocking issues has continued to grow as we observe load on the system, but overall we are nearing a stable state of affairs and look forward to updating users with statistics and more details as we continue the roll out of our production system.

Thanks for understanding and stay tuned for more updates, coming soon!

Sincerely,
The World Community Grid team

7/13/22
https://www.worldcommunitygrid.org/about_us/article.s?articleId=773
ID: 109204 · Report as offensive
Sir LanDroid

Send message
Joined: 7 Apr 13
Posts: 64
United States
Message 109354 - Posted: 18 Jul 2022, 20:30:12 UTC
Last modified: 18 Jul 2022, 20:46:51 UTC

I just snagged a couple of Africa Rainfall Project work units. Anyone else gittin' 'em sum WCG? 😉
ID: 109354 · Report as offensive
Dr Who Fan
Avatar

Send message
Joined: 10 May 07
Posts: 1443
United States
Message 109356 - Posted: 18 Jul 2022, 21:06:15 UTC - in response to Message 109354.  

Received & processed about 10 or so OPN (Open Pandemics) and one or two ARP (African Rainfall) last week. All completed without errors on my end and most validated except a few OPN tasks.

WCG sill a long way from being 100%. I suspect that it's many months before we can call it 'back to normal'.
ID: 109356 · Report as offensive
ProfileCthulhu

Send message
Joined: 18 Nov 20
Posts: 14
Message 109359 - Posted: 18 Jul 2022, 23:32:09 UTC - in response to Message 108680.  

They're at least two weeks away from a fix.
ID: 109359 · Report as offensive
Grumpy Swede
Avatar

Send message
Joined: 30 Mar 20
Posts: 419
Sweden
Message 109435 - Posted: 25 Jul 2022, 22:57:34 UTC
Last modified: 25 Jul 2022, 23:03:37 UTC

WCG fell off the Internet totally it seems.

downforeveryoneorjustme.com says:

Is Worldcommunitygrid.org down?
Checking if worldcommunitygrid.org is down or it is just you...
It's not just you! worldcommunitygrid.org is down.

BOINC says:

2696 World Community Grid 2022-07-26 00:45:48 Sending scheduler request: Requested by user.
2697 World Community Grid 2022-07-26 00:45:48 Requesting new tasks for CPU
2698 World Community Grid 2022-07-26 00:45:50 Scheduler request failed: Couldn't resolve host name
2699 2022-07-26 00:45:51 Project communication failed: attempting access to reference site
2700 2022-07-26 00:45:53 Internet access OK - project servers may be temporarily down.
ID: 109435 · Report as offensive
Grumpy Swede
Avatar

Send message
Joined: 30 Mar 20
Posts: 419
Sweden
Message 109436 - Posted: 26 Jul 2022, 11:03:08 UTC
Last modified: 26 Jul 2022, 11:07:04 UTC

Still totally down here. Both the website, and the BOINC part of WCG. No updates on Facebook or Twitter at all.
They really have to work on their communication skills. It's not OK to leave thousands of volunteers without any
information at all, when things like this happens.

I'm beginning to seriously doubt if Jurisica Lab, really will be able to make WCG work. Krembil doesn't seem to
help them much at all. I've been told by a person within WCG, "We run from Krembil - but the institution is not
running it (they did help with the legal discussions with IBM during transfer)."


So Jurisica Lab, seems to be on their own running the project, and by the looks of things they do not have enough resources,
or any support from Krembil Research Institute. They are using sharcnet.ca as their Internet and cloud provider, but Krembil as such,
is not involved in running the project at all.
ID: 109436 · Report as offensive
Dr Who Fan
Avatar

Send message
Joined: 10 May 07
Posts: 1443
United States
Message 109438 - Posted: 26 Jul 2022, 21:16:08 UTC - in response to Message 109436.  

WCG web site is back after some "complications" that knocked them off the ether.
ID: 109438 · Report as offensive
Sir LanDroid

Send message
Joined: 7 Apr 13
Posts: 64
United States
Message 109467 - Posted: 31 Jul 2022, 0:45:36 UTC
Last modified: 31 Jul 2022, 0:48:07 UTC

I had not heard of Jurisica Lab before. Trying to make sense of this, I read the "Who Are We?" section at the following link several times.
https://www.worldcommunitygrid.org/about/about.s

Here's what I get out of that, let us know what's off base...

  • IBM transferred WCG assets to Krembil Research Institute, part of the Canadian University Health Network (UHN).
  • UHN has Canada’s largest hospital-based research program, comprising four major teaching hospitals (Toronto Western Hospital, Toronto General Hospital, Princess Margaret Cancer Centre, Toronto Rehabilitation Institute, and The Michener Institute of Education).
  • Jurisica Lab is connected to the Toronto Western Hospital and the hospital’s research arm, the Krembil Research Institute; a non-profit academic biomedical research institute.
  • Therefore WCG moved from IBM to two departments within one hospital in Canada.
  • Jurisca Lab appears to be associated mostly with the Mapping Cancer Markers Project. Unclear how they work with Krembil or how both operate within all the competing priorities of a teaching hospital.
  • Grumpy Swede indicates Jurisica Lab may be on their own...
  • Conclusion: Not surprising we are seeing many problems...

ID: 109467 · Report as offensive
Grumpy Swede
Avatar

Send message
Joined: 30 Mar 20
Posts: 419
Sweden
Message 109498 - Posted: 5 Aug 2022, 10:15:28 UTC

Maybe useless stats:
According to BoincStats, only 712 WCG volunteers had any credits for the last day.
That tells a lot about how little work that is sent out during this test period.
So, the chance of getting any tasks at all, is slim to none, at the moment.
Unless you have an entire computer farm of course.
ID: 109498 · Report as offensive
ProfileDave
Help desk expert

Send message
Joined: 28 Jun 10
Posts: 2701
United Kingdom
Message 109502 - Posted: 5 Aug 2022, 13:41:39 UTC - in response to Message 109498.  

Maybe useless stats:
According to BoincStats, only 712 WCG volunteers had any credits for the last day.
That tells a lot about how little work that is sent out during this test period.
So, the chance of getting any tasks at all, is slim to none, at the moment.
Unless you have an entire computer farm of course.


I suspect useless stats. I have been able to get ARP work from WCG without issue.
ID: 109502 · Report as offensive
Richard Haselgrove
Volunteer tester
Help desk expert

Send message
Joined: 5 Oct 06
Posts: 5129
United Kingdom
Message 109506 - Posted: 5 Aug 2022, 15:10:25 UTC - in response to Message 109502.  

I haven't been getting any for GPUs. I'm thinking of mothballing the GPUs at the end of next month, when the next electricity price hike hits in the UK. That'll free a lot of cores for CPU work, so I might have another look then.
ID: 109506 · Report as offensive
Grumpy Swede
Avatar

Send message
Joined: 30 Mar 20
Posts: 419
Sweden
Message 109507 - Posted: 5 Aug 2022, 15:21:23 UTC - in response to Message 109506.  
Last modified: 5 Aug 2022, 15:34:13 UTC

No OPNG tasks here either, only OPN1, even though Cyclops posted yesterday:
"We'll likely send out more workunits now that we fixed a minor issue with the OPNG workunits which might be elaborated upon in a future forum post."
https://www.worldcommunitygrid.org/forums/wcg/viewthread_thread,44139_offset,120#674689

So, I guess the OPNG issue isn't fixed yet after all.

Edit: And when it comes to the high electricity cost nowadays, I have ordered one of these. I'll be able to sell electricity to all my neighbours too.
Problem solved, no more electricity bills
Yeah, sure I have ordered it :-)

ID: 109507 · Report as offensive
Sir LanDroid

Send message
Joined: 7 Apr 13
Posts: 64
United States
Message 109581 - Posted: 11 Aug 2022, 15:33:04 UTC
Last modified: 11 Aug 2022, 15:33:20 UTC

Here's a list of resolved and unresolved issues dated 8/9/22.

https://www.worldcommunitygrid.org/forums/wcg/viewthread_thread,44246
ID: 109581 · Report as offensive
Sir LanDroid

Send message
Joined: 7 Apr 13
Posts: 64
United States
Message 109709 - Posted: 27 Aug 2022, 14:45:59 UTC

Excerpts from an update posted early this morning.

We have taken additional measures to increase the quantity of WUs we can send out, and we have been able to increase the quantity of WUs in flight at any given time. Volunteers should see this reflected on their devices now, and perhaps even over this past week.

We are also relieved to share that the hosting data centre has assigned additional personnel on site to resolve our networking issues, meaning a fix is imminent. We will share with you any further updates we receive from the data centre. The network fix will allow us to bring our remaining servers online, stabilizing and further increasing the WU supply. Thus, until we are able to deploy all dedicated servers, we must continuously adjust and monitor tasks scheduled in Aurora/Mesos to keep the tasks balanced and the workunits flowing, and so far this process is unduly intensive and sporadic.

...Last week, we mentioned that we have begun to investigate concerns over statistics, credit, streaks, and database dumps raised by volunteers. We will have an update on some of these issues next week. We also plan to release a more structured breakdown from the tech team similar to a CHANGELOG starting next week or the week after so that we can increase the frequency and clarity of updates.

https://www.worldcommunitygrid.org/forums/wcg/viewthread_thread,44273

[/quote]
ID: 109709 · Report as offensive
Richard Haselgrove
Volunteer tester
Help desk expert

Send message
Joined: 5 Oct 06
Posts: 5129
United Kingdom
Message 109710 - Posted: 27 Aug 2022, 15:26:34 UTC

I have certainly seen a vastly increased supply of WUs in the last 30 hours, but the network congestion - especially on task data file downloads - is still an inhibiting factor.
ID: 109710 · Report as offensive
ProfileDave
Help desk expert

Send message
Joined: 28 Jun 10
Posts: 2701
United Kingdom
Message 109765 - Posted: 7 Sep 2022, 8:00:06 UTC - in response to Message 109710.  

I have certainly seen a vastly increased supply of WUs in the last 30 hours, but the network congestion - especially on task data file downloads - is still an inhibiting factor.


Yes. even on my bored band connection, downloads very slow compared with other projects. Repeated clicking of, "retry now" has after the best part of two and a half hours on and off just got four ARP tasks running. One file for a fifth is still being stubborn and refusing to download. I don't know if this is related but the tasks didn't show up in the Tasks tab till quite a few files had downloaded whereas with other projects they seem to appear at once. Is that just because I don't see the initial small files downloading when they appear at once?
ID: 109765 · Report as offensive
Previous · 1 · 2 · 3 · 4 · 5 . . . 10 · Next

Message boards : Projects : World Community Grid has announced an extended outage from Feb 14 to April 22, 2022

Copyright © 2024 University of California.
Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.