Advanced search

Message boards : Number crunching : Task....continues? Zero Status but no finished File

Author Message
Dayle Diamond
Send message
Joined: 5 Dec 12
Posts: 69
Credit: 1,066,941,815
RAC: 998,480
Level
Met
Scientific publications
watwatwatwatwatwatwatwat
Message 47422 - Posted: 13 Jun 2017 | 7:27:10 UTC

Came back to my computer, turned on the screen. Saw graphical artifacts, a few seconds later they fade to black, and then the machine whirrs to life like nothing happened.

I check MSI afterburner, and the chart shows both GPUs briefly drew less power.

GPUgrid progress seems unaffected.
Log shows these messages, stretching back to a few hours prior.

6/12/2017 9:56:19 PM | GPUGRID | Task e103s21_e56s4p0f314-PABLO_Q15004_0_IDP-0-1-RND1111_1 exited with zero status but no 'finished' file
6/12/2017 9:56:19 PM | GPUGRID | Task e8s3_e6s27p0f178-PABLO_P01106_4_IDP-0-1-RND8778_0 exited with zero status but no 'finished' file

Anybody else experiencing this?

Betting Slip
Send message
Joined: 5 Jan 09
Posts: 588
Credit: 2,037,147,925
RAC: 1,495,102
Level
Phe
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 47423 - Posted: 13 Jun 2017 | 8:57:50 UTC - in response to Message 47422.
Last modified: 13 Jun 2017 | 9:02:01 UTC

Came back to my computer, turned on the screen. Saw graphical artifacts, a few seconds later they fade to black, and then the machine whirrs to life like nothing happened.

I check MSI afterburner, and the chart shows both GPUs briefly drew less power.

GPUgrid progress seems unaffected.
Log shows these messages, stretching back to a few hours prior.

6/12/2017 9:56:19 PM | GPUGRID | Task e103s21_e56s4p0f314-PABLO_Q15004_0_IDP-0-1-RND1111_1 exited with zero status but no 'finished' file
6/12/2017 9:56:19 PM | GPUGRID | Task e8s3_e6s27p0f178-PABLO_P01106_4_IDP-0-1-RND8778_0 exited with zero status but no 'finished' file

Anybody else experiencing this?


This is usually a case of loss of power or being shutdown incorrectly. Has either of these occured?

I can only find 2 tasks on your 970 that say "SWAN : FATAL Unable to load module .mshake_kernel.cu. (999)" Which is usually the reasons I mentioned above.

Dayle Diamond
Send message
Joined: 5 Dec 12
Posts: 69
Credit: 1,066,941,815
RAC: 998,480
Level
Met
Scientific publications
watwatwatwatwatwatwatwat
Message 47431 - Posted: 13 Jun 2017 | 22:27:40 UTC

There was a power outage in my home about a week and a half ago.
With my turnaround time, I don't think there was a causal relationship between the two.

One of the WU's has finished and has been validated.
The other was only at 43% after 21 hours, so it appears to be stuck.

I've paused it but intend to cancel it unless warned that multiday WU's are in the queue.


Same system, same time, same science projects (presumably), same graphics card model (EVGA GTX 970 SSC). Totally different results.

Post to thread

Message boards : Number crunching : Task....continues? Zero Status but no finished File