21) Message boards : Number crunching : ADRIA extremely slow - not checkpointing (Message 56396)
Posted 1173 days ago by lohphat
I just got my first WU and the clock is climbing. It's taken 1:40:00 for 6%, the estimated time is now at 26h and climbing.

GTX 980 Ti
22) Message boards : Server and website : New computer and restored BOINC directory (Message 55141)
Posted 1367 days ago by lohphat
My last computer died a few months back and I was just able to build a new one.

The problem is that I restored the old BOINC directory from a backup.

So now new results (reported this week on the new hardware) are being reported on the old computer ID but it's updated the hardware and computer name. ID: 392370

Is this a problem? I don't want results skewed w/o having the results segregated properly.

Should I be concerned about this?
23) Message boards : Number crunching : GForce 980 Ti only using 30% with acemd 2.10 (Message 53334)
Posted 1601 days ago by lohphat
Shouldn't the app scale to use more of the GPU? I have resources set to use 100%.
24) Message boards : News : Windows apps restored (Message 50159)
Posted 2099 days ago by lohphat
The WU I'm getting have HUGH estimated completing times of 12d+ and are counting down at 1m50s time remaining per second.

So something's overestimating the time pretty poorly.

And yes, benchmarks were run during the WU suspension period so they're current.
25) Message boards : Number crunching : Stalled WUs? (Message 48732)
Posted 2287 days ago by lohphat
I had to roll back driver to 385.41 which is the latest driver not to have issues with Firefox browser. It is on Nvidia forums, I had driver "stopped responding and recovered" while browsing with Firefox.


That did it.

However I have in my notes that 385.41 caused WU errors with Einstein@Home -- I'm awaiting for the project to issue me new WUs to verify.

But as for GPUGRID, it fixed the problem.

FWIW, it never crashed the driver or FFox -- it just caused GPUGRID WUs to stall but not error out.

26) Message boards : Number crunching : Stalled WUs? (Message 48718)
Posted 2288 days ago by lohphat
It seems related to running Firefox (knowing it has h/w acceleration options) -- I'm still playing woth the settings but I can get GPUGRID work units to stall simply by opening up YouTube and playing a video.

The WU percentage stops increasing but it still shows active.

After 10hours I exit BOINC and restart and the hours worked drops back down to the point where it stalled.

So it's still happening, and I can recreate the failure consistently.

I've restarted the project to refresh resources.
27) Message boards : Number crunching : Stalled WUs? (Message 48309)
Posted 2333 days ago by lohphat
Two more work units stalled which I had to abort.

Methinks there's a systemic problem managing WUs.
28) Message boards : Number crunching : Stalled WUs? (Message 48295)
Posted 2334 days ago by lohphat
I have a WU which has been running for several days and seems to get stuck at a percentage complete.

Then I shutdown BOINC and relaunch and the accumulated work disappears and it restarts from a much lower percentage.

Rinse repeat.

The WU is crunching but with no percentage progress.
29) Message boards : Number crunching : Why do I have a bunch of old WU in my history which haven't been purged? (Message 48071)
Posted 2373 days ago by lohphat
But these are on the GPUGRID website, don't they control what's purged from their local db?
30) Message boards : Number crunching : Why do I have a bunch of old WU in my history which haven't been purged? (Message 48067)
Posted 2373 days ago by lohphat
13266067 10174700 392370 24 Oct 2014 | 2:06:43 UTC 24 Oct 2014 | 2:08:25 UTC Error while downloading 0.00 0.00 --- Test application for CPU MD v9.02 (mtavx)

13046953 10045085 392370 4 Sep 2014 | 14:08:45 UTC 4 Sep 2014 | 14:59:47 UTC Error while computing 3.28 0.13 --- Long runs (8-12 hours on fastest card) v8.41 (cuda60)

10318414 7496871 392370 14 May 2014 | 17:03:28 UTC 22 May 2014 | 8:02:17 UTC Aborted by user 4,603.37 1,038.16 --- Long runs (8-12 hours on fastest card) v8.41 (cuda42)

7418915 4883327 392370 31 Oct 2013 | 14:49:38 UTC 31 Oct 2013 | 16:40:40 UTC Error while computing 2.13 0.34 --- Short runs (2-3 hours on fastest card) v8.14 (cuda55)

7226751 4735022 392370 30 Aug 2013 | 23:35:42 UTC 4 Sep 2013 | 23:35:16 UTC Not started by deadline - canceled 0.00 0.00 --- ACEMD beta version v8.04 (cuda42)

7218170 4728998 392370 28 Aug 2013 | 16:58:44 UTC 23 Sep 2013 | 7:07:14 UTC Aborted by user 3,328.91 4.89 --- Long runs (8-12 hours on fastest card) v8.00 (cuda42)


Previous 10 | Next 10
//