Advanced search

Message boards : Number crunching : GPUGRID active users are falling dramatically!

Author Message
Betting Slip
Send message
Joined: 5 Jan 09
Posts: 589
Credit: 2,044,904,025
RAC: 1,465,479
Level
Phe
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 47358 - Posted: 3 Jun 2017 | 9:24:46 UTC
Last modified: 3 Jun 2017 | 10:13:46 UTC

Why are active users falling on this project?

Lost over 33% of active users since April. That's bad and doesn't bode well for the future.

Only faster cards are holding flops/s up.

Time I think for this project to get proactive on how to attract and retain high end users, communicate more and stop taking users for granted.



Source: https://boincstats.com/en/stats/45/project/detail/user

I posted this over a year ago https://gpugrid.net/forum_thread.php?id=4304#43468

Jim1348
Send message
Joined: 28 Jul 12
Posts: 463
Credit: 1,131,150,255
RAC: 42,983
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwat
Message 47359 - Posted: 3 Jun 2017 | 15:55:19 UTC - in response to Message 47358.

Maybe the occasional very long work units are discouraging some people? I don't think the bonuses are that important, but it is a psychological thing. Also, the reduced output of the app under Windows may be a problem, though I am mainly on Linux now and am glad to have the work.

kain
Send message
Joined: 3 Sep 14
Posts: 110
Credit: 133,389,891
RAC: 82,677
Level
Cys
Scientific publications
watwatwatwat
Message 47361 - Posted: 3 Jun 2017 | 20:58:02 UTC

It is called summer in the north hemisphere. High temps = air conditioning and air conditioning doesn't work well with high end computers crunching for the better future...

Betting Slip
Send message
Joined: 5 Jan 09
Posts: 589
Credit: 2,044,904,025
RAC: 1,465,479
Level
Phe
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 47362 - Posted: 3 Jun 2017 | 21:45:22 UTC - in response to Message 47361.

It is called summer in the north hemisphere. High temps = air conditioning and air conditioning doesn't work well with high end computers crunching for the better future...


I have had ambient temperatures up to 32c in summer and still run computers without air conditioning. Are you trying to tell me that's responsible for a 33% drop in users that started in April?

I would like to believe you but I can't just yet.

Profile Retvari Zoltan
Avatar
Send message
Joined: 20 Jan 09
Posts: 1844
Credit: 10,686,789,844
RAC: 10,151,546
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 47363 - Posted: 4 Jun 2017 | 0:49:26 UTC - in response to Message 47362.
Last modified: 4 Jun 2017 | 0:50:59 UTC

What happened between 21st & 22nd of April? (actually a month before this date)
21. April: 3133 users
22. April: 2840 users
That's 293 less (~10%) in a day.
There's a "thing" called charityengine.
It actually installs BOINC manager and connects it to different projects (at least the last time I've encountered this it did. I did not have the guts to install it on my computers, as they are selling the computing power their users submit for them). I think there's a lot of users on every project thanks to charityengine, but I think these users could quit after a short period if they don't win a fortune.
Maybe it's not them, but another similar "thing".

Erich56
Send message
Joined: 1 Jan 15
Posts: 372
Credit: 1,686,904,002
RAC: 2,943,712
Level
His
Scientific publications
watwatwat
Message 47364 - Posted: 4 Jun 2017 | 6:20:05 UTC - in response to Message 47358.

Why are active users falling on this project?

Lost over 33% of active users since April. That's bad and doesn't bode well for the future.

Only faster cards are holding flops/s up.

Time I think for this project to get proactive on how to attract and retain high end users, communicate more and stop taking users for granted.


really surprised? I am not!

It all started out around mid-April when all of a sudden the crunching software became invalid and stopped working (which should never ever happen that way).

So, a new software needed to be put together, and since this had to be done in a hurry, it was rather buggy. Here just a few examples of what one could read in various threads of the forum, and what I myself have been experiencing since:

- the new software is around 30% slower :-(

- GPU overclocking is much less possible than before (at least for Maxwell cards; no idea how it is with Pascals).

- tasks stop for unknown reasons, and only continue if they are switched off (suspended) and switched on again manually (so, if a cruncher does not notice such a stop for say 10 hours, the system runs idle for 10 hours).

- the new software does not work well with BOINC: when pushing the "suspense" button in the BOINC manager (either in "Tasks" or in "Projects"), it takes several minutes until the task reacts and stops.

In the recent past, GPUGRID tasks have become even more GPU-straining and long-lasting; for example "ADRIA_FOLDGREED10_crystal_ss_contacts_100_ubiquitin" (also the _50_ubiquitin) - on a GTX750ti (with some unvoluntary stops inbetween, as mentioned above), it can take 3 days or more until this task gets finished.
That's why I had suggested that on the Project Preference page, besides "short runs" (which virtually don't exist any more) and "long runs", a third category like "extra long runs" (or whatever wording suits) is being implemented, so that the many GTX750Ti crunchers can exclude such long tasks from download.

And here we are at the next problem:
back at GPUGRID, no-one really seems to care which problems the crunchers have and which suggestens they are presenting.
Reading much in the forum, I can think of so many other people writing about all kinds of problems, making useful suggestions and also putting questions now and then. However: NO REACTION AT ALL !

So, coming back to the beginning of this posting: I am NOT surprised that people are turning away from this project. Sorry to say this :-(

By accident, in the forum of another BOINC project I participate, yesterday I read a statement from the project people there:
"Of course having happy volunteers is very important for the health of a project; so it is something that should be addressed ..."
Why is this different with GPUGRID?

randi
Send message
Joined: 9 Nov 16
Posts: 2
Credit: 2,319,600
RAC: 1,614
Level
Ala
Scientific publications
wat
Message 47367 - Posted: 5 Jun 2017 | 16:10:18 UTC
Last modified: 5 Jun 2017 | 16:11:07 UTC

I have been waiting a long time to get a task.

Normally I say no to long tasks because they take a VERY long time on my computer.
Recently I changed that to yes, but I am still not getting any tasks.

6/5/2017 12:07:49 | GPUGRID | update requested by user
6/5/2017 12:07:54 | GPUGRID | Sending scheduler request: Requested by user.
6/5/2017 12:07:54 | GPUGRID | Requesting new tasks for NVIDIA GPU
6/5/2017 12:07:56 | GPUGRID | Scheduler request completed: got 0 new tasks
6/5/2017 12:07:56 | GPUGRID | No tasks sent

Erich56
Send message
Joined: 1 Jan 15
Posts: 372
Credit: 1,686,904,002
RAC: 2,943,712
Level
His
Scientific publications
watwatwat
Message 47368 - Posted: 5 Jun 2017 | 16:30:02 UTC - in response to Message 47367.

I have been waiting a long time to get a task.

I guess this is kind of not quite the right thread to post your problem.

A lot of statements and opinions about the problem of not getting tasks are contained in this thread here:

http://gpugrid.net/forum_thread.php?id=4574

you may look this up, perhaps you get an idea what's wrong.

Erich56
Send message
Joined: 1 Jan 15
Posts: 372
Credit: 1,686,904,002
RAC: 2,943,712
Level
His
Scientific publications
watwatwat
Message 47404 - Posted: 12 Jun 2017 | 9:45:50 UTC - in response to Message 47364.

...
It all started out around mid-April when all of a sudden the crunching software became invalid and stopped working (which should never ever happen that way).

So, a new software needed to be put together, and since this had to be done in a hurry, it was rather buggy. Here just a few examples of what one could read in various threads of the forum, and what I myself have been experiencing since:

- the new software is around 30% slower :-(

- GPU overclocking is much less possible than before (at least for Maxwell cards; no idea how it is with Pascals).

- tasks stop for unknown reasons, and only continue if they are switched off (suspended) and switched on again manually (so, if a cruncher does not notice such a stop for say 10 hours, the system runs idle for 10 hours).

- the new software does not work well with BOINC: when pushing the "suspense" button in the BOINC manager (either in "Tasks" or in "Projects"), it takes several minutes until the task reacts and stops.

I am curious how much longer it will take the GPUGRID people to acknowledge that the current software is buggy and needs to be repaired!
More or less every day, I get annoyed by these bugs cited above :-(

Stefan
Volunteer moderator
Project developer
Project scientist
Send message
Joined: 5 Mar 13
Posts: 258
Credit: 0
RAC: 0
Level

Scientific publications
wat
Message 47407 - Posted: 12 Jun 2017 | 10:07:17 UTC - in response to Message 47404.

Fixing bugs with BOINC is relatively pointless from our perspective (and time-intensive). We are considering rather other options like moving out of it, but don't ask when or how as it's more an idea than a scheduled plan.

I am sorry for those inconveniences that this causes.

The reason we cannot address technical issues with BOINC is that we don't have anyone in the lab anymore who knows his way around it and that priorities are higher on getting scientific work done. Of course you have a point that this will eventually bite us in the ass since we won't be able to do scientific work without crunchers but it's a tricky thing to manage.

Jacob Klein
Send message
Joined: 11 Oct 08
Posts: 1068
Credit: 1,151,203,264
RAC: 1,008,903
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 47411 - Posted: 12 Jun 2017 | 13:39:13 UTC - in response to Message 47407.

Thanks for replying.

It is unclear to me, which bugs are actual BOINC bugs (if any?), which bugs are NVIDIA bugs (if any?), and which bugs are GPUGrid app bugs.

I hope you guys can get the resources you need to triage and solve the bugs. If you think any are BOINC or NVIDIA bugs, please let the community know, so we can (continue) to offer help in solving those. I have been asking for a while, with no response.

Jacob

Erich56
Send message
Joined: 1 Jan 15
Posts: 372
Credit: 1,686,904,002
RAC: 2,943,712
Level
His
Scientific publications
watwatwat
Message 47433 - Posted: 14 Jun 2017 | 13:50:53 UTC - in response to Message 47407.

Fixing bugs with BOINC is relatively pointless from our perspective (and time-intensive). We are considering rather other options like moving out of it, but don't ask when or how as it's more an idea than a scheduled plan.

I am sorry for those inconveniences that this causes.

The reason we cannot address technical issues with BOINC is that we don't have anyone in the lab anymore who knows his way around it and that priorities are higher on getting scientific work done. Of course you have a point that this will eventually bite us in the ass since we won't be able to do scientific work without crunchers but it's a tricky thing to manage.

Sorry Stefan for contradicting.
I don't think that any of the deficits in the crunching software 9.18 have to do with BOINC. So blaming BOINC, at least the way I see it, is simply wrong.

As said before, this software was obviously compiled in a hurry, overnight so to speak, without much (thorough) testing.
All the bugs had not existed with the previous software.

The content of the second paragraph of your postings makes me worry even more.
Again, as I said in another posting, a project of the magnitude of GPUGRID definitely needs a certain amount of infrastructure expertise. Just having the scientits there is not enough.

If, for example, no one at GPUGRID is able to reply to my posting
http://gpugrid.net/forum_thread.php?id=4561&nowrap=true#47204
from a month ago, then something needs to be improved. Definitely so.
Otherwise, GPUGRID really risks to loose more and more crunchers. Which would be too bad - I personally feel that GPUGRID is a fantastic project! And that's why I am participating :-)
So, please put your heads together to come up with a solution!

Jim1348
Send message
Joined: 28 Jul 12
Posts: 463
Credit: 1,131,150,255
RAC: 42,983
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwat
Message 47434 - Posted: 14 Jun 2017 | 14:20:21 UTC - in response to Message 47433.

I don't think that any of the deficits in the crunching software 9.18 have to do with BOINC. So blaming BOINC, at least the way I see it, is simply wrong.

FWIW, I have always advocated optimizing the apps for the latest hardware, since I think you get more bang for the crunching buck that way. If it leaves the older cards behind, so be it. You avoid precisely the type of problems that we are seeing here.

I usually have fairly new cards, and you will get a lot of complaints from people with older cards that they are being abandoned, or that they are being "forced" to buy new cards (I love that one).

So you have a choice. Make the one that is best for the science.

Erich56
Send message
Joined: 1 Jan 15
Posts: 372
Credit: 1,686,904,002
RAC: 2,943,712
Level
His
Scientific publications
watwatwat
Message 47435 - Posted: 14 Jun 2017 | 15:13:16 UTC - in response to Message 47434.
Last modified: 14 Jun 2017 | 15:50:18 UTC

If it leaves the older cards behind, so be it. You avoid precisely the type of problems that we are seeing here.

one thing that's interesing:

the GTX750Ti in the host with Windows10 now shows problems with the new software.
the GTX750Ti in the host with WindowsXP does NOT show any problems - although this software is also new, but not the same as for Windows10.

I guess there won't be many crunchers using WindowsXP; so, many of the crunchers using their GTX750Ti with Windows10 might have problems now. And I also guess that there are many crunchers with a GTX750Ti. What can be done: throw the GTX750Ti's away? :-(

Last year, I bought two GTX780Ti just for GPUGRID crunching, Euro 700 each. So far, they work perfectly with WindowsXP. When GPUGRID support will end in April of next year, I'll need to change to Windows10. And then all the problems will begin.
However, I don't think that I will exchange them for two new Pascals. Paying some 1400 Euros every two years just to have the latest generation of cards in order to have GPUGRID running smoothly?

Jim1348
Send message
Joined: 28 Jul 12
Posts: 463
Credit: 1,131,150,255
RAC: 42,983
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwat
Message 47436 - Posted: 14 Jun 2017 | 16:02:42 UTC - in response to Message 47435.
Last modified: 14 Jun 2017 | 16:22:56 UTC

the GTX750Ti in the host with Windows10 now shows problems with the new software.
the GTX750Ti in the host with WindowsXP does NOT show any problems - although this software is also new, but not the same as for Windows10.

I guess that says something about WDDM, but I don't know what. It would be fun to trace it down, but GPUGrid just does not have the staff it seems. That is why they have to avoid unnecessary risks if they can. It is not a perfect solution, but seems to be the best under the circumstances.

I was planning to wait for Volta, but that will be a long time, so I migrated out of the lower-end cards into a few Pascals for higher efficiency in the warmer months, though it is still a mix. The prices are much more reasonable in the U.S., especially on sales. But everything has gone through the roof now, apparently with high demand for AMD cards even spilling over into Nvidia.

mikey
Send message
Joined: 2 Jan 09
Posts: 269
Credit: 247,832,990
RAC: 31,701
Level
Leu
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 47470 - Posted: 18 Jun 2017 | 11:17:29 UTC - in response to Message 47435.

If it leaves the older cards behind, so be it. You avoid precisely the type of problems that we are seeing here.

one thing that's interesing:

the GTX750Ti in the host with Windows10 now shows problems with the new software.
the GTX750Ti in the host with WindowsXP does NOT show any problems - although this software is also new, but not the same as for Windows10.

I guess there won't be many crunchers using WindowsXP; so, many of the crunchers using their GTX750Ti with Windows10 might have problems now. And I also guess that there are many crunchers with a GTX750Ti. What can be done: throw the GTX750Ti's away? :-(

Last year, I bought two GTX780Ti just for GPUGRID crunching, Euro 700 each. So far, they work perfectly with WindowsXP. When GPUGRID support will end in April of next year, I'll need to change to Windows10. And then all the problems will begin.
However, I don't think that I will exchange them for two new Pascals. Paying some 1400 Euros every two years just to have the latest generation of cards in order to have GPUGRID running smoothly?


On other projects some people have gone back to older drivers for their older gpu's and that brings back the gpu's under Win10 again. In short try older drivers and see if your Win10 machine can crunch again, it may just work for you too.

ChristianVirtual
Send message
Joined: 16 Aug 14
Posts: 13
Credit: 281,790,075
RAC: 18,821
Level
Asn
Scientific publications
watwat
Message 47471 - Posted: 18 Jun 2017 | 11:33:31 UTC
Last modified: 18 Jun 2017 | 11:55:59 UTC

For what it is worth: no issues with Linux / CentOS on my 980Ti,1080 and 1080Ti ... come over to the bright side of life ;-)

Update: opps, sorry, just saw the version difference ... kind of still learning the technical details here ...

Erich56
Send message
Joined: 1 Jan 15
Posts: 372
Credit: 1,686,904,002
RAC: 2,943,712
Level
His
Scientific publications
watwatwat
Message 47473 - Posted: 18 Jun 2017 | 16:13:12 UTC - in response to Message 47470.

On other projects some people have gone back to older drivers for their older gpu's and that brings back the gpu's under Win10 again. In short try older drivers and see if your Win10 machine can crunch again, it may just work for you too.

The new crunching software acemd 918.80 only works with the latest drivers.
My two Windows 10 machines had run with 376.53 before, and with the new crunching software I had to update to 381.65 to get GPUGRID run.
Furthermore, Matt was pointing out clearly that the new software requires the newest drivers.

In other words: no way to install older drivers for getting problems solved :-(

Profile Zarck
Send message
Joined: 16 Aug 08
Posts: 135
Credit: 238,817,093
RAC: 6,310
Level
Leu
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 47505 - Posted: 29 Jun 2017 | 11:59:28 UTC
Last modified: 29 Jun 2017 | 11:59:46 UTC

I stopped your project, for my part far too many units end up in error ... especially after 12 hours of calculations (Titan 2013) is it not possible to still have points for calculated time? More I have no problems with Asteroids, Folding, milkyway etc.

J'ai arrêté votre projet, pour ma part beaucoup trop d'unités finissent en erreur... surtout après 12 heures de calculs (Titan 2013) n'est-il pas possible d'avoir quand même des points pour le temps calculé ? de plus je n'ai pas de problèmes avec Asteroids, Folding, milkyway etc.

@+
*_*
____________

Profile Retvari Zoltan
Avatar
Send message
Joined: 20 Jan 09
Posts: 1844
Credit: 10,686,789,844
RAC: 10,151,546
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 47506 - Posted: 29 Jun 2017 | 13:13:50 UTC - in response to Message 47505.

I stopped your project, for my part far too many units end up in error ...
Your GPU is too hot (88°C), that's the reason for the too many errors. (see your stderr.txt output I've attached at the end of this post.)
You should increase the cooling of your card: increase the airflow by increasing the RPM of the GPU's fan, or install extra fans in your PC case, or remove its side panel.
Alternatively you can reduce the clock speed (or the power target) of your card to decrease its power consumption (=its heat output).

especially after 12 hours of calculations (Titan 2013) is it not possible to still have points for calculated time?
No. A partial result is useless for GPUGrid, as it can't be used for generating the next step of the simulation, so it has to be calculated again (on another host).

More I have no problems with Asteroids, Folding, milkyway etc.
Those projects do not stress the GPU as much as GPUGrid does.

<core_client_version>7.6.33</core_client_version> <![CDATA[ <message> (unknown error) - exit code -55 (0xffffffc9) </message> <stderr_txt> # GPU [GeForce GTX TITAN] Platform [Windows] Rev [3212] VERSION [80] # SWAN Device 0 : # Name : GeForce GTX TITAN # ECC : Disabled # Global mem : 6144MB # Capability : 3.5 # PCI ID : 0000:02:00.0 # Device clock : 875MHz # Memory clock : 3004MHz # Memory width : 384bit # Driver version : r382_48 : 38253 # GPU 0 : 70C # GPU 0 : 77C # GPU 0 : 80C # GPU 0 : 81C # GPU 0 : 82C # GPU 0 : 83C # GPU 0 : 86C # GPU 0 : 87C # GPU [GeForce GTX TITAN] Platform [Windows] Rev [3212] VERSION [80] # SWAN Device 0 : # Name : GeForce GTX TITAN # ECC : Disabled # Global mem : 6144MB # Capability : 3.5 # PCI ID : 0000:02:00.0 # Device clock : 875MHz # Memory clock : 3004MHz # Memory width : 384bit # Driver version : r382_48 : 38253 # GPU 0 : 77C # GPU 0 : 80C # GPU 0 : 82C # GPU 0 : 83C # GPU 0 : 84C # GPU 0 : 87C # GPU 0 : 88C SWAN : FATAL : Cuda driver error 702 in file 'swanlibnv2.cpp' in line 1965. # SWAN swan_assert 0 </stderr_txt> ]]>

Profile Zarck
Send message
Joined: 16 Aug 08
Posts: 135
Credit: 238,817,093
RAC: 6,310
Level
Leu
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 47507 - Posted: 29 Jun 2017 | 14:03:47 UTC

Thank you for your reply, I am considering changing the case for a Corsair Obsidian 900D (15 fans), or waiting for the outside temperature to return to normal (30 ° presently).
____________

PappaLitto
Send message
Joined: 21 Mar 16
Posts: 271
Credit: 1,333,565,731
RAC: 5,244,364
Level
Met
Scientific publications
watwat
Message 47508 - Posted: 29 Jun 2017 | 15:04:54 UTC - in response to Message 47507.

Thank you for your reply, I am considering changing the case for a Corsair Obsidian 900D (15 fans), or waiting for the outside temperature to return to normal (30 ° presently).

The case with better airflow will do a lot more than a few degree ambient celcius change

Jim1348
Send message
Joined: 28 Jul 12
Posts: 463
Credit: 1,131,150,255
RAC: 42,983
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwat
Message 47509 - Posted: 29 Jun 2017 | 16:46:58 UTC

All good advice. It is possible to run them without error.
http://www.gpugrid.net/results.php?hostid=358185
http://www.gpugrid.net/results.php?hostid=375823
http://www.gpugrid.net/results.php?hostid=428242

Profile Zarck
Send message
Joined: 16 Aug 08
Posts: 135
Credit: 238,817,093
RAC: 6,310
Level
Leu
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 47519 - Posted: 1 Jul 2017 | 16:22:16 UTC
Last modified: 1 Jul 2017 | 16:22:28 UTC

The outside temperature dropped by 10 ° Celsius and I managed to finish a Gpugrid unit.

But your project to another problem, within 5 hours the GpuGrid unit that is being computed on my machine will be terminated, and the GpuGrid server is empty.

As I prefer medical research projects, I will move on to Folding, for an indeterminate period of time.

Maybe I'll come back to your project. But how many users switch to another project, and do not come back to yours?
____________

Erich56
Send message
Joined: 1 Jan 15
Posts: 372
Credit: 1,686,904,002
RAC: 2,943,712
Level
His
Scientific publications
watwatwat
Message 47531 - Posted: 5 Jul 2017 | 9:07:31 UTC - in response to Message 47519.

As I prefer medical research projects, I will move on to Folding ...

This I did several days ago with one of my host - the PC with Windows10 and a GTX750Ti inside.

As I wrote in some other postings in the forum, the problem was that since the new crunching software (acmed 918.80) was implemented beginning of April, GPUGRID crunching on this host did not work well any longer.
Probably for several reasons:

- the crunching software is buggy: one in a while, crunching stops, and can only be resumed by manually pausing and then resuming the task (if the task stops in the late evening or during the night, the host runs idle till next morning).

- overclocking is a lot less possible than before. In fact, with the GTX750Ti I even had to underclock - and still crunching stopped once in a while.

- this happens very often with one of the newer, extremely demanding WU's like
ADRIA_FOLDGREED50_crystal_ss_contacts_100_ubiquitin_ (and some others)
Crunching such a WU, with numerous interruptions inbetween, takes up to 3 days, which does not make a lot of sense.

Hence, some time ago, I suggested that a third tier of tasks (like "extra longruns" or so) be introduced in the personal settings page, so that people with cards like a GTX750Ti can exclude such heavy tasks from being downlaoaded.
I also suggested that the buggy software (which had to be put together in a hurry, and was probably optimized for Pascal cards) be improved.
So far, there was NO REACTION AT ALL from the GPUGRID people - no comment, nothing.
As I and others here stated several times: they don't listen to their crunchers; they are not interested in receiving any useful suggestions, in learning about problems the crunchers may face, etc. - which is really too bad. I am wondering how this project will survive longterm.

So, I was simply curious, and with this host (Windows10 and GTX750Ti) I changed to Folding@Home. Just to see whether the same problems show up there as well.
They die NOT! Crunching works smoothly, no interruptions, nothing ...
And questions to the Folding@Home team are being answered within half a day or shorter (BTW - by reading in their forums, I realized that quite a number of people there had changed from GPUGRID to Folding@Home in the past weeks and months - I guess, they wouldn't have done that without reason).

GPUGRID crunching still runs well on my two Windows XP hosts with two GTX980Ti and one GTX750Ti - most likely as the XP crunching software (acemd 849.65) did not need to be optimized for Pascal cards (they don't work with XP).
However, it was already officially announced that this crunching software will only be available until April 2018. This date will then be the end of GPUGRID crunching on XP (which is too bad in a sense as crunching with XP is some 15-20% faster due to the non existing WDDM overhead).
So I am curious what will happen when with my two hosts which now run on XP I change to Windows10. Maybe there will exist an improved software by then. Or maybe not.
In any case, should I face all these problems I am having now with the Windows10 hosts, I can always change to Folding@Home.

At the end, I'd like to repeat what I posted before, after having read it in the forum of another BOINC project:
Of course have happy volunteers is very important for the health of a project so this is something that should be addressed

ChristianVirtual
Send message
Joined: 16 Aug 14
Posts: 13
Credit: 281,790,075
RAC: 18,821
Level
Asn
Scientific publications
watwat
Message 47532 - Posted: 5 Jul 2017 | 10:31:53 UTC
Last modified: 5 Jul 2017 | 10:32:36 UTC

I do both FAH and GPUgrid these days; but seems no more work needed here so can also switch all cards back to FAH (there are problems too; at times)

PS: keep in mind that FAH has an upcoming server maintenance day July 6/July 7; given bunkering is not possible you might have some time to cleanup the dust in the systems

Erich56
Send message
Joined: 1 Jan 15
Posts: 372
Credit: 1,686,904,002
RAC: 2,943,712
Level
His
Scientific publications
watwatwat
Message 47534 - Posted: 5 Jul 2017 | 12:36:47 UTC - in response to Message 47532.

... FAH (there are problems too; at times)

I guess any project has problems now and then.
However, it makes a difference whether they are being solved within short time or not at all.

I am using part of my CPUs for LHC@Home - and yes, they have (all kinds of) problems about once or even twice a week. However, these problems get solved within a few hours. And when bringing things up in their forum, the team reacts within a few hours.

Stefan
Volunteer moderator
Project developer
Project scientist
Send message
Joined: 5 Mar 13
Posts: 258
Credit: 0
RAC: 0
Level

Scientific publications
wat
Message 47562 - Posted: 10 Jul 2017 | 9:47:08 UTC

We are trying to fix the issue with the WUs now

Jacob Klein
Send message
Joined: 11 Oct 08
Posts: 1068
Credit: 1,151,203,264
RAC: 1,008,903
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 47563 - Posted: 10 Jul 2017 | 10:38:18 UTC - in response to Message 47562.
Last modified: 10 Jul 2017 | 10:39:12 UTC

We are trying to fix the issue with the WUs now

Thanks for replying. Which issue are you trying to fix?

Profile Logan Carr
Send message
Joined: 12 Aug 15
Posts: 207
Credit: 31,560,850
RAC: 91,597
Level
Val
Scientific publications
wat
Message 47568 - Posted: 10 Jul 2017 | 18:03:08 UTC - in response to Message 47563.
Last modified: 10 Jul 2017 | 18:04:31 UTC

Erich56,

Perhaps try linux. No wddm there!
____________
Cruncher/Learner in progress.

ChristianVirtual
Send message
Joined: 16 Aug 14
Posts: 13
Credit: 281,790,075
RAC: 18,821
Level
Asn
Scientific publications
watwat
Message 47569 - Posted: 10 Jul 2017 | 19:43:08 UTC

ahhh, the dry period is over; I got again assignments on my linux-based GPUs; thanks for fixing and giving my GPU meaningful work

Erich56
Send message
Joined: 1 Jan 15
Posts: 372
Credit: 1,686,904,002
RAC: 2,943,712
Level
His
Scientific publications
watwatwat
Message 47570 - Posted: 10 Jul 2017 | 19:45:50 UTC - in response to Message 47568.

Erich56,
Perhaps try linux. No wddm there!

but, as far as I understand, no SWAN_SYNC possible with Linux :-(

Profile Logan Carr
Send message
Joined: 12 Aug 15
Posts: 207
Credit: 31,560,850
RAC: 91,597
Level
Val
Scientific publications
wat
Message 47571 - Posted: 10 Jul 2017 | 21:06:31 UTC - in response to Message 47570.

Erich56,
Perhaps try linux. No wddm there!

but, as far as I understand, no SWAN_SYNC possible with Linux :-(



Yes that's true. You should try it after support for XP with gpugrid stops. Even with swan_sync on Vista, 7, 8 ,10, I notice it doesn't use as much of the gpu as Linux did for me. For windows 7 I personally get in the 80's WITH swan_sync. Linux, I get in the low to mid 90's. Percent I mean.
____________
Cruncher/Learner in progress.

mmonnin
Send message
Joined: 2 Jul 16
Posts: 47
Credit: 57,131,919
RAC: 1,422,030
Level
Thr
Scientific publications
wat
Message 47574 - Posted: 10 Jul 2017 | 22:38:02 UTC - in response to Message 47570.

Erich56,
Perhaps try linux. No wddm there!

but, as far as I understand, no SWAN_SYNC possible with Linux :-(


That's because it's not needed in Linux. ;)

liderbug
Send message
Joined: 29 Jul 16
Posts: 22
Credit: 57,680,162
RAC: 381
Level
Thr
Scientific publications
watwat
Message 47614 - Posted: 15 Jul 2017 | 15:39:28 UTC

Some time back (couple of months) I was processing WU's like crazy. I received notice that GPUGRD was moving to a new server - we'll be right back. I've never received another WU.
Linux, GTX770

tks

Betting Slip
Send message
Joined: 5 Jan 09
Posts: 589
Credit: 2,044,904,025
RAC: 1,465,479
Level
Phe
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 47615 - Posted: 15 Jul 2017 | 20:48:53 UTC - in response to Message 47614.
Last modified: 15 Jul 2017 | 20:49:24 UTC

Some time back (couple of months) I was processing WU's like crazy. I received notice that GPUGRD was moving to a new server - we'll be right back. I've never received another WU.
Linux, GTX770

tks


You need to upgrade to latest drivers and then you should get WU's

AgentB
Send message
Joined: 13 Aug 15
Posts: 2
Credit: 105,000
RAC: 0
Level

Scientific publications
wat
Message 47666 - Posted: 23 Jul 2017 | 15:23:18 UTC - in response to Message 47363.

What happened between 21st & 22nd of April? (actually a month before this date)


Apologies for being late to the party. Assuming boincstats are accurate (and noticing other projects are losing active hosts and users as well) the Win10 Creator's Edition update must have occurred around that time. That certainly broke some hosts.

Number continues to fall :(

Jacob Klein
Send message
Joined: 11 Oct 08
Posts: 1068
Credit: 1,151,203,264
RAC: 1,008,903
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 47667 - Posted: 23 Jul 2017 | 15:27:06 UTC - in response to Message 47666.

Let's not blame a Windows Update unless we have proof, please.
I'm a Windows Insider, and routinely run Windows 10 Fast Ring, with no problems regarding GPUGrid.

Profile Retvari Zoltan
Avatar
Send message
Joined: 20 Jan 09
Posts: 1844
Credit: 10,686,789,844
RAC: 10,151,546
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 47673 - Posted: 23 Jul 2017 | 17:17:16 UTC - in response to Message 47666.

What happened between 21st & 22nd of April? (actually a month before this date)

Apologies for being late to the party. Assuming boincstats are accurate (and noticing other projects are losing active hosts and users as well) the Win10 Creator's Edition update must have occurred around that time. That certainly broke some hosts.
The Creator's update is not a mandatory update (yet), and it is distributed over a long period of time. I think that those who forced this update could also manage those problems which could prevent GPUGrid from working correctly; if the problems persist they could even revert back to the previous version. The only issue I've faced after updating to the Creator's update is that the Windows Search broke down in Outlook; but it turned out later that it wasn't the Creator's update (as the error showed up on not updated PCs). So my experiences do not support this explanation.

Number continues to fall :(
No, the number of active GPUGrid users is fluctuates around 1800 since two weeks. I think it will rise again in September.

Richard Haselgrove
Send message
Joined: 11 Jul 09
Posts: 791
Credit: 1,427,941,620
RAC: 1,315,908
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 47676 - Posted: 23 Jul 2017 | 21:57:54 UTC - in response to Message 47673.

When I installed the Creator's update, I had to re-install NVidia's driver to get CUDA and OpenCL support back. Just re-ran the installer for the driver I'd downloaded already and had been using before the update.

Erich56
Send message
Joined: 1 Jan 15
Posts: 372
Credit: 1,686,904,002
RAC: 2,943,712
Level
His
Scientific publications
watwatwat
Message 47681 - Posted: 24 Jul 2017 | 4:38:53 UTC - in response to Message 47676.

When I installed the Creator's update, I had to re-install NVidia's driver to get CUDA and OpenCL support back.

Same happened to me after the recent large Windows10 Updates. The reason is that the graphic driver that comes with the update is crippled in order to keep the update size to a minimum. Only re-installing the original driver from NVIDIA helps.

liderbug
Send message
Joined: 29 Jul 16
Posts: 22
Credit: 57,680,162
RAC: 381
Level
Thr
Scientific publications
watwat
Message 47699 - Posted: 27 Jul 2017 | 21:25:19 UTC - in response to Message 47615.
Last modified: 27 Jul 2017 | 21:27:05 UTC

I did: OS FC25, NVIDIA 381.22. I have 3 other projects that use my GPU but GpuGrid keeps saying "got 0 new tasks".

For what it's worth, it seems ps3grid.net and gpugrid.net are the same? are different? And on ps3 there was a page showing photos of half a dozen people working on the project with links to their ?bio? - problem is that none of the links ever returns, doesn't ping (IP), nslookup returns an IP. It's like everybody just walked. Also emails to "contact us" go unanswered.

Richard Haselgrove
Send message
Joined: 11 Jul 09
Posts: 791
Credit: 1,427,941,620
RAC: 1,315,908
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 47701 - Posted: 27 Jul 2017 | 21:36:34 UTC - in response to Message 47699.

PS3s were used for crunching here, before Sony locked down third party operating systems.

So they switched to using GPUs instead.

Erich56
Send message
Joined: 1 Jan 15
Posts: 372
Credit: 1,686,904,002
RAC: 2,943,712
Level
His
Scientific publications
watwatwat
Message 47704 - Posted: 28 Jul 2017 | 4:45:50 UTC - in response to Message 47699.

problem is that none of the links ever returns, doesn't ping (IP), nslookup returns an IP. It's like everybody just walked. Also emails to "contact us" go unanswered.

looks very much like GPUGRID :-(

Post to thread

Message boards : Number crunching : GPUGRID active users are falling dramatically!