Advanced search

Message boards : Server and website : Problems or maintenance?

Author Message
ftpd
Send message
Joined: 6 Jun 08
Posts: 152
Credit: 328,250,382
RAC: 0
Level
Asp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 20170 - Posted: 17 Jan 2011 | 11:02:05 UTC

Good morning,

Since this morning (after restart your servers) this message is comming up! Correct???
17-1-2011 11:57:45 GPUGRID work fetch resumed by user
17-1-2011 11:57:46 GPUGRID update requested by user
17-1-2011 11:57:51 GPUGRID Sending scheduler request: Requested by user.
17-1-2011 11:57:51 GPUGRID Reporting 2 completed tasks, requesting new tasks for CPU and GPU
17-1-2011 11:57:52 GPUGRID Scheduler request completed: got 0 new tasks
17-1-2011 11:57:52 GPUGRID Message from server: Server can't open log file (../log_grosso/scheduler.log)

____________
Ton (ftpd) Netherlands

ignasi
Send message
Joined: 10 Apr 08
Posts: 254
Credit: 16,836,000
RAC: 0
Level
Pro
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwat
Message 20172 - Posted: 17 Jan 2011 | 11:36:01 UTC - in response to Message 20170.

Maintenance.

We were dangerously running out of disk space.

All sorted for now,
i

ftpd
Send message
Joined: 6 Jun 08
Posts: 152
Credit: 328,250,382
RAC: 0
Level
Asp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 20173 - Posted: 17 Jan 2011 | 11:37:04 UTC - in response to Message 20170.

After reporting and download all units cancel!

17-1-2011 12:32:44 GPUGRID Started download of 462-KASHIF_HIVPR_n1_unbound_so_ba2-5-LICENSE
17-1-2011 12:32:45 GPUGRID Finished download of 462-KASHIF_HIVPR_n1_unbound_so_ba2-5-LICENSE
17-1-2011 12:32:45 GPUGRID Started download of 462-KASHIF_HIVPR_n1_unbound_so_ba2-5-COPYRIGHT
17-1-2011 12:32:46 GPUGRID Finished download of 462-KASHIF_HIVPR_n1_unbound_so_ba2-5-COPYRIGHT
17-1-2011 12:32:46 GPUGRID Started download of 462-KASHIF_HIVPR_n1_unbound_so_ba2-5-462-KASHIF_HIVPR_n1_unbound_so_ba2-4-100-RND1161_1
17-1-2011 12:32:47 GPUGRID Finished download of 462-KASHIF_HIVPR_n1_unbound_so_ba2-5-462-KASHIF_HIVPR_n1_unbound_so_ba2-4-100-RND1161_1
17-1-2011 12:32:47 GPUGRID Started download of 462-KASHIF_HIVPR_n1_unbound_so_ba2-5-462-KASHIF_HIVPR_n1_unbound_so_ba2-4-100-RND1161_2
17-1-2011 12:32:48 GPUGRID work fetch suspended by user
17-1-2011 12:32:49 GPUGRID Finished download of 462-KASHIF_HIVPR_n1_unbound_so_ba2-5-462-KASHIF_HIVPR_n1_unbound_so_ba2-4-100-RND1161_2
17-1-2011 12:32:49 GPUGRID Started download of 462-KASHIF_HIVPR_n1_unbound_so_ba2-5-462-KASHIF_HIVPR_n1_unbound_so_ba2-4-100-RND1161_3
17-1-2011 12:32:50 GPUGRID Finished download of 462-KASHIF_HIVPR_n1_unbound_so_ba2-5-462-KASHIF_HIVPR_n1_unbound_so_ba2-4-100-RND1161_3
17-1-2011 12:32:50 GPUGRID Started download of 462-KASHIF_HIVPR_n1_unbound_so_ba2-5-pdb_file
17-1-2011 12:32:54 GPUGRID Finished download of 284-KASHIF_HIVPR_n1_bound_so_ba2-16-par_file
17-1-2011 12:32:54 GPUGRID Started download of 462-KASHIF_HIVPR_n1_unbound_so_ba2-5-psf_file
17-1-2011 12:32:54 GPUGRID Starting 284-KASHIF_HIVPR_n1_bound_so_ba2-16-100-RND3760_1
17-1-2011 12:32:56 GPUGRID Starting task 284-KASHIF_HIVPR_n1_bound_so_ba2-16-100-RND3760_1 using acemd2 version 613
17-1-2011 12:32:57 GPUGRID Finished download of 462-KASHIF_HIVPR_n1_unbound_so_ba2-5-psf_file
17-1-2011 12:32:57 GPUGRID Started download of 462-KASHIF_HIVPR_n1_unbound_so_ba2-5-par_file
17-1-2011 12:33:02 GPUGRID update requested by user
17-1-2011 12:33:14 GPUGRID Computation for task 284-KASHIF_HIVPR_n1_bound_so_ba2-16-100-RND3760_1 finished
17-1-2011 12:33:14 GPUGRID Output file 284-KASHIF_HIVPR_n1_bound_so_ba2-16-100-RND3760_1_1 for task 284-KASHIF_HIVPR_n1_bound_so_ba2-16-100-RND3760_1 absent
17-1-2011 12:33:14 GPUGRID Output file 284-KASHIF_HIVPR_n1_bound_so_ba2-16-100-RND3760_1_2 for task 284-KASHIF_HIVPR_n1_bound_so_ba2-16-100-RND3760_1 absent
17-1-2011 12:33:14 GPUGRID Output file 284-KASHIF_HIVPR_n1_bound_so_ba2-16-100-RND3760_1_3 for task 284-KASHIF_HIVPR_n1_bound_so_ba2-16-100-RND3760_1 absent
17-1-2011 12:33:14 GPUGRID Sending scheduler request: Requested by user.
17-1-2011 12:33:14 GPUGRID Reporting 1 completed tasks, not requesting new tasks
17-1-2011 12:33:15 GPUGRID Started upload of 284-KASHIF_HIVPR_n1_bound_so_ba2-16-100-RND3760_1_0
17-1-2011 12:33:15 GPUGRID Started upload of 284-KASHIF_HIVPR_n1_bound_so_ba2-16-100-RND3760_1_7
17-1-2011 12:33:16 GPUGRID Scheduler request completed
17-1-2011 12:33:21 GPUGRID Finished download of 462-KASHIF_HIVPR_n1_unbound_so_ba2-5-pdb_file
17-1-2011 12:33:21 GPUGRID Started download of 462-KASHIF_HIVPR_n1_unbound_so_ba2-5-conf_file_enc
17-1-2011 12:33:22 GPUGRID Finished download of 462-KASHIF_HIVPR_n1_unbound_so_ba2-5-conf_file_enc
17-1-2011 12:33:22 GPUGRID Started download of 462-KASHIF_HIVPR_n1_unbound_so_ba2-5-metainp_file
17-1-2011 12:33:23 GPUGRID Finished download of 462-KASHIF_HIVPR_n1_unbound_so_ba2-5-metainp_file
17-1-2011 12:33:23 GPUGRID Started download of 462-KASHIF_HIVPR_n1_unbound_so_ba2-5-462-KASHIF_HIVPR_n1_unbound_so_ba2-4-100-RND1161_7
17-1-2011 12:33:25 GPUGRID Finished download of 462-KASHIF_HIVPR_n1_unbound_so_ba2-5-462-KASHIF_HIVPR_n1_unbound_so_ba2-4-100-RND1161_7
17-1-2011 12:33:32 GPUGRID Finished download of 462-KASHIF_HIVPR_n1_unbound_so_ba2-5-par_file
17-1-2011 12:33:32 GPUGRID Starting 462-KASHIF_HIVPR_n1_unbound_so_ba2-5-100-RND1161_1
17-1-2011 12:33:33 GPUGRID Starting task 462-KASHIF_HIVPR_n1_unbound_so_ba2-5-100-RND1161_1 using acemd2 version 613
17-1-2011 12:33:47 GPUGRID Computation for task 462-KASHIF_HIVPR_n1_unbound_so_ba2-5-100-RND1161_1 finished
17-1-2011 12:33:47 GPUGRID Output file 462-KASHIF_HIVPR_n1_unbound_so_ba2-5-100-RND1161_1_1 for task 462-KASHIF_HIVPR_n1_unbound_so_ba2-5-100-RND1161_1 absent
17-1-2011 12:33:47 GPUGRID Output file 462-KASHIF_HIVPR_n1_unbound_so_ba2-5-100-RND1161_1_2 for task 462-KASHIF_HIVPR_n1_unbound_so_ba2-5-100-RND1161_1 absent
17-1-2011 12:33:47 GPUGRID Output file 462-KASHIF_HIVPR_n1_unbound_so_ba2-5-100-RND1161_1_3 for task 462-KASHIF_HIVPR_n1_unbound_so_ba2-5-100-RND1161_1 absent

____________
Ton (ftpd) Netherlands

ftpd
Send message
Joined: 6 Jun 08
Posts: 152
Credit: 328,250,382
RAC: 0
Level
Asp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 20176 - Posted: 17 Jan 2011 | 12:18:58 UTC - in response to Message 20173.

And now this one:
17-1-2011 12:38:06 GPUGRID work fetch resumed by user
17-1-2011 12:38:08 GPUGRID Sending scheduler request: To fetch work.
17-1-2011 12:38:08 GPUGRID Reporting 1 completed tasks, requesting new tasks for GPU
17-1-2011 12:38:09 GPUGRID Scheduler request completed: got 0 new tasks
17-1-2011 12:38:09 GPUGRID Message from server: No work sent
17-1-2011 12:38:09 GPUGRID Message from server: ACEMD beta version is not available for your type of computer.
17-1-2011 12:38:10 GPUGRID work fetch suspended by user
17-1-2011 12:39:06 GPUGRID work fetch resumed by user
17-1-2011 12:39:09 GPUGRID Sending scheduler request: To fetch work.
17-1-2011 12:39:09 GPUGRID Requesting new tasks for GPU
17-1-2011 12:39:11 GPUGRID Scheduler request completed: got 0 new tasks
17-1-2011 12:39:11 GPUGRID Message from server: No work sent
17-1-2011 12:39:11 GPUGRID Message from server: ACEMD beta version is not available for your type of computer.
17-1-2011 12:39:49 GPUGRID Sending scheduler request: To fetch work.
17-1-2011 12:39:49 GPUGRID Requesting new tasks for CPU
17-1-2011 12:39:51 GPUGRID Scheduler request completed: got 0 new tasks
17-1-2011 12:39:51 GPUGRID Message from server: No work sent
17-1-2011 12:39:51 GPUGRID Message from server: ACEMD beta version is not available for your type of computer.
17-1-2011 12:40:07 GPUGRID work fetch suspended by user
17-1-2011 12:41:01 GPUGRID work fetch resumed by user
17-1-2011 12:41:06 GPUGRID Sending scheduler request: To fetch work.
17-1-2011 12:41:06 GPUGRID Requesting new tasks for CPU and GPU
17-1-2011 12:41:07 GPUGRID Scheduler request completed: got 0 new tasks
17-1-2011 12:41:07 GPUGRID Message from server: No work sent
17-1-2011 12:41:07 GPUGRID Message from server: ACEMD beta version is not available for your type of computer.
17-1-2011 12:41:08 GPUGRID work fetch suspended by user
17-1-2011 12:43:50 Suspending network activity - user request
17-1-2011 13:14:50 Resuming network activity
17-1-2011 13:15:12 GPUGRID work fetch resumed by user
17-1-2011 13:15:13 GPUGRID Sending scheduler request: To fetch work.
17-1-2011 13:15:13 GPUGRID Requesting new tasks for CPU and GPU
17-1-2011 13:15:15 GPUGRID Scheduler request completed: got 0 new tasks
17-1-2011 13:15:15 GPUGRID Message from server: No work sent
17-1-2011 13:15:15 GPUGRID Message from server: ACEMD beta version is not available for your type of computer.
17-1-2011 13:15:16 GPUGRID work fetch suspended by user
17-1-2011 13:15:52 GPUGRID work fetch resumed by user
17-1-2011 13:16:08 GPUGRID update requested by user
17-1-2011 13:16:08 GPUGRID Sending scheduler request: Requested by user.
17-1-2011 13:16:08 GPUGRID Requesting new tasks for CPU and GPU
17-1-2011 13:16:10 GPUGRID Scheduler request completed: got 0 new tasks
17-1-2011 13:16:10 GPUGRID Message from server: No work sent
17-1-2011 13:16:10 GPUGRID Message from server: ACEMD beta version is not available for your type of computer.
17-1-2011 13:16:46 GPUGRID Sending scheduler request: To fetch work.
17-1-2011 13:16:46 GPUGRID Requesting new tasks for GPU
17-1-2011 13:16:47 GPUGRID Scheduler request completed: got 0 new tasks
17-1-2011 13:16:47 GPUGRID Message from server: No work sent
17-1-2011 13:16:47 GPUGRID Message from server: ACEMD beta version is not available for your type of computer.

Good luck!!
____________
Ton (ftpd) Netherlands

Profile skgiven
Volunteer moderator
Volunteer tester
Avatar
Send message
Joined: 23 Apr 09
Posts: 3968
Credit: 1,995,359,260
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 20177 - Posted: 17 Jan 2011 | 14:40:41 UTC - in response to Message 20176.
Last modified: 17 Jan 2011 | 14:48:59 UTC

Are those bincoordfile errors related to task creation?

ERROR: file mdioload.cpp line 80: Unable to read bincoordfile

called boinc_finish

Last seen in Oct

Profile Saenger
Avatar
Send message
Joined: 20 Jul 08
Posts: 134
Credit: 23,657,183
RAC: 0
Level
Pro
Scientific publications
watwatwatwatwatwat
Message 20178 - Posted: 17 Jan 2011 | 15:22:33 UTC - in response to Message 20177.

Are those bincoordfile errors related to task creation?

ERROR: file mdioload.cpp line 80: Unable to read bincoordfile

called boinc_finish

Last seen in Oct

The fix given there was to restart the computer, which is impossible for me as I have one checkpointless RNAworld running for at least another 200h.

Will a detach/reattach help as well?
____________
Gruesse vom Saenger

For questions about Boinc look in the BOINC-Wiki

Profile skgiven
Volunteer moderator
Volunteer tester
Avatar
Send message
Joined: 23 Apr 09
Posts: 3968
Credit: 1,995,359,260
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 20179 - Posted: 17 Jan 2011 | 17:24:43 UTC - in response to Message 20178.

I'm in the same boat - GPUGrid tasks were failing on a GT240, and I am running RNA tasks, so I can't reboot.
I reset the project, and although I initially just got the usual "ACEMD beta version is not available for your type of computer" message, I now have tasks.
So yes, I suggest you try resetting the project too.
If you don't get any work you could always run some Einstein tasks for a while.

Profile Saenger
Avatar
Send message
Joined: 20 Jul 08
Posts: 134
Credit: 23,657,183
RAC: 0
Level
Pro
Scientific publications
watwatwatwatwatwat
Message 20180 - Posted: 17 Jan 2011 | 20:39:12 UTC

I fixed itself without any intervention from me, no reset, no restart, just some new WUs.
At least I'm crunching now for 4:30h on the current Gianni.

Ah, and Einstein is not for me now, as it's windoze only.
____________
Gruesse vom Saenger

For questions about Boinc look in the BOINC-Wiki

ignasi
Send message
Joined: 10 Apr 08
Posts: 254
Credit: 16,836,000
RAC: 0
Level
Pro
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwat
Message 20195 - Posted: 20 Jan 2011 | 14:33:11 UTC - in response to Message 20177.

They are ghost WUs.

These WUs were long ago finished, retrieved by the scientist and removed from the server. On the 17th of January we had to act on the server due to disk space problems. We ran out of disk space basically. We had Gigs of log files from the scheduler and others that we removed. Which explains these weird transient behaviours with the scheduler.log and -may- have somehow re-spawned these zombie WUs that died after successive error accumulation.

Back to normal now. Sorry for the inconviniences,
ignasi

Post to thread

Message boards : Server and website : Problems or maintenance?

//