Advanced search

Message boards : News : WARNING/CHALLENGE: VERY LONG WU (VERYLONG_CXCL12_confAna)

Author Message
Gerard
Send message
Joined: 26 Mar 14
Posts: 101
Credit: 0
RAC: 0
Level

Scientific publications
wat
Message 39575 - Posted: 22 Jan 2015 | 15:33:26 UTC
Last modified: 22 Jan 2015 | 17:15:07 UTC

We just launched 400 very long WU (they will take about 24h in a 780GTX) named VERYLONG_CXCL12_confAna whose results we need as soon as possible (we are in a hurry). They come with a credit+bonus of 400K. Please, if you don't have a good graphic card, reject them. For the brave ones, take it as a challenge and see you on the performance tab ;)

Betting Slip
Send message
Joined: 5 Jan 09
Posts: 670
Credit: 2,498,095,550
RAC: 0
Level
Phe
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 39578 - Posted: 22 Jan 2015 | 16:16:48 UTC - in response to Message 39575.
Last modified: 22 Jan 2015 | 16:32:37 UTC

Please, if you don't have a good credit card, reject them.


Would love to do as you ask but its not always possible as some computers are remote.


I think he means graphic card. :-)

[CSF] Thomas H.V. DUPONT
Send message
Joined: 20 Jul 14
Posts: 732
Credit: 100,630,366
RAC: 0
Level
Cys
Scientific publications
watwatwatwatwatwatwatwat
Message 39579 - Posted: 22 Jan 2015 | 16:23:40 UTC - in response to Message 39575.
Last modified: 22 Jan 2015 | 16:23:57 UTC

Nothing here ! Too bad ! Snif, snif ;)
____________
[CSF] Thomas H.V. Dupont
Founder of the team CRUNCHERS SANS FRONTIERES 2.0
www.crunchersansfrontieres

Gerard
Send message
Joined: 26 Mar 14
Posts: 101
Credit: 0
RAC: 0
Level

Scientific publications
wat
Message 39580 - Posted: 22 Jan 2015 | 17:16:00 UTC - in response to Message 39578.

hahaha sorry. what was i thinking about...?

Profile [AF>Amis des Lapins] Phil...
Send message
Joined: 16 Jul 13
Posts: 56
Credit: 1,626,354,890
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 39581 - Posted: 22 Jan 2015 | 17:48:34 UTC
Last modified: 22 Jan 2015 | 17:50:37 UTC

If it was possible to select the WU's, I would take some of the VERY LONG.

Or ... is there a way to download them ? Or will you send them with "high priority" ?

NB Will receive my 4th GTX970 tomorrow :D

Purchased only "because" of GPUGRID ;)

[CSF] Thomas H.V. DUPONT
Send message
Joined: 20 Jul 14
Posts: 732
Credit: 100,630,366
RAC: 0
Level
Cys
Scientific publications
watwatwatwatwatwatwatwat
Message 39583 - Posted: 22 Jan 2015 | 17:54:52 UTC - in response to Message 39581.

If it was possible to select the WU's, I would take some of the VERY LONG.

Or ... is there a way to download them ? Or will you send them with "high priority" ?

NB Will receive my 4th GTX970 tomorrow :D

Purchased only "because" of GPUGRID ;)

Je ne pense pas que ce soit possible Phil...
C'est au petit bonheur la chance, IMHO...
Ou sinon je suis preneur aussi ;)
Bonjour à l'Alliance :)
____________
[CSF] Thomas H.V. Dupont
Founder of the team CRUNCHERS SANS FRONTIERES 2.0
www.crunchersansfrontieres

skydivingnerd
Send message
Joined: 26 Feb 13
Posts: 7
Credit: 2,242,660,281
RAC: 0
Level
Phe
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 39585 - Posted: 22 Jan 2015 | 18:05:11 UTC

Just checked my tasks. No joy on the high value target.
____________

Profile [AF>Amis des Lapins] Phil...
Send message
Joined: 16 Jul 13
Posts: 56
Credit: 1,626,354,890
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 39586 - Posted: 22 Jan 2015 | 18:22:48 UTC

Hello Thomas,

Ou alors, il faut abandonner les UT jusqu'à ce qu'on reçoive des VeryLong MDR ?

Je transmettrai ton Bonjour ;) :)

Philippe

Profile [AF>Amis des Lapins] Phil...
Send message
Joined: 16 Jul 13
Posts: 56
Credit: 1,626,354,890
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 39587 - Posted: 22 Jan 2015 | 18:27:50 UTC
Last modified: 22 Jan 2015 | 18:39:15 UTC

Received 2 : GERARD_VERYLONG_CXCL12

:D :D :D

Luca
Send message
Joined: 7 Jun 13
Posts: 1
Credit: 61,498,187
RAC: 2
Level
Thr
Scientific publications
watwatwatwatwatwatwat
Message 39588 - Posted: 22 Jan 2015 | 18:51:01 UTC

This may be the happiest day of my life.

[CSF] Thomas H.V. DUPONT
Send message
Joined: 20 Jul 14
Posts: 732
Credit: 100,630,366
RAC: 0
Level
Cys
Scientific publications
watwatwatwatwatwatwatwat
Message 39590 - Posted: 22 Jan 2015 | 19:05:22 UTC - in response to Message 39587.

Received 2 : GERARD_VERYLONG_CXCL12

:D :D :D

Veinard ! :) GG !
____________
[CSF] Thomas H.V. Dupont
Founder of the team CRUNCHERS SANS FRONTIERES 2.0
www.crunchersansfrontieres

Edwin
Send message
Joined: 19 Nov 14
Posts: 3
Credit: 857,950
RAC: 0
Level
Gly
Scientific publications
wat
Message 39591 - Posted: 22 Jan 2015 | 19:07:45 UTC
Last modified: 22 Jan 2015 | 19:08:21 UTC

Looking forward to compute some of them on my GTX 970 :3
Must happen ASAP! :)

Will they come Automatically? Cuz in Settings I´ve only selected Short runs :O

Profile taurec
Send message
Joined: 21 Jun 10
Posts: 3
Credit: 351,940,197
RAC: 4,329,764
Level
Asp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 39592 - Posted: 22 Jan 2015 | 19:57:20 UTC

Hi, I've received one very long GERARD on my 560ti with 448 shaders. Does it make sense to to crunch it to the end?
Greets :-)

Edwin
Send message
Joined: 19 Nov 14
Posts: 3
Credit: 857,950
RAC: 0
Level
Gly
Scientific publications
wat
Message 39593 - Posted: 22 Jan 2015 | 19:58:29 UTC

I got another short run-.-

RaymondFO*
Send message
Joined: 22 Nov 12
Posts: 72
Credit: 14,040,706,346
RAC: 0
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 39595 - Posted: 22 Jan 2015 | 20:40:45 UTC
Last modified: 22 Jan 2015 | 21:00:24 UTC

Got one of these tasks on a computer with a NVIDIA GTX 680 factory OC'd video card that rarely has a processing error. I will run it unless you need the work unit assigned to another computer with a stronger/faster video card.

sarge104
Send message
Joined: 21 Nov 13
Posts: 5
Credit: 7,420,264
RAC: 0
Level
Ser
Scientific publications
watwatwatwatwatwatwat
Message 39596 - Posted: 22 Jan 2015 | 20:42:18 UTC

Saw the call for the challenge and was able to download 2 VL's while on lunch. I've got 15 hours from the time this message is being typed until my Nvidia Titan's attention can be turned to them. Excited to see if these use the full processing power of the card or if I have to run them two(or more) at a time like most GPU WU's.

Are there any settings we need to tweak to make sure we get full utilization?

Dayle Diamond
Send message
Joined: 5 Dec 12
Posts: 84
Credit: 1,628,088,415
RAC: 1,457,783
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 39598 - Posted: 22 Jan 2015 | 21:25:38 UTC

These are on the, uh, normally long queue right?

Tried to opt in with my preferences, to no avail.

None here yet. Good luck and happy crunching!

sarge104
Send message
Joined: 21 Nov 13
Posts: 5
Credit: 7,420,264
RAC: 0
Level
Ser
Scientific publications
watwatwatwatwatwatwat
Message 39599 - Posted: 22 Jan 2015 | 21:40:25 UTC - in response to Message 39598.

These are on the, uh, normally long queue right?

Tried to opt in with my preferences, to no avail.

None here yet. Good luck and happy crunching!


Dayle,

I did switch my settings from accepting shorts and beta work to just long WU's. During my lunch break was able to get two WU's before being told to fly a kite by Boinc when requesting more. ;)


Profile Retvari Zoltan
Avatar
Send message
Joined: 20 Jan 09
Posts: 2343
Credit: 16,201,255,749
RAC: 16,606
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 39600 - Posted: 22 Jan 2015 | 23:55:29 UTC - in response to Message 39575.
Last modified: 23 Jan 2015 | 0:06:49 UTC

We just launched 400 very long WU (they will take about 24h in a 780GTX) named VERYLONG_CXCL12_confAna whose results we need as soon as possible (we are in a hurry).

I've got two of them on my GTX 780Ti host.
According to my linear approximation made at 12.3% progress, the total computing time will be about 18 hours and 15 minutes.

They come with a credit+bonus of 400K.

That's nice.

Please, if you don't have a good graphic card, reject them.

That's an inappropriate way to arrange such batches.
You should set up a third queue for these purposes.
I couldn't receive one of these workunits on my GTX980 host. I'm sure I'm not alone with that.
EDIT: I had to abort 10 other workunits to receive one of these on my GTX980 host.
That's why your method is dangerous: it propagates failed workunits (by encouraging user intervention)

For the brave ones, take it as a challenge and see you on the performance tab ;)

Challenge accepted. :)

Richard Haselgrove
Send message
Joined: 11 Jul 09
Posts: 1576
Credit: 5,518,086,851
RAC: 8,606,062
Level
Tyr
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 39601 - Posted: 23 Jan 2015 | 0:25:38 UTC

Found one already downloaded and running on a GTX 670 - 10% done, and the figures suggest ~36 hours total, or mid-morning Saturday UTC for completion. Plus whatever it takes to upload the result file - most tasks generate ~5MB per hour on this card, so I'm expecting a 180 MB final upload!

Profile Retvari Zoltan
Avatar
Send message
Joined: 20 Jan 09
Posts: 2343
Credit: 16,201,255,749
RAC: 16,606
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 39603 - Posted: 23 Jan 2015 | 0:34:03 UTC

I have two of these running on my GTX 680 host, it will take 28 hours and 40 minutes for the faster (1200MHz) card to finish.

Gerard Audette
Send message
Joined: 19 Oct 14
Posts: 1
Credit: 150,909,822
RAC: 0
Level
Ile
Scientific publications
watwatwatwat
Message 39604 - Posted: 23 Jan 2015 | 1:37:20 UTC

Is a intel i3 with a gtx 660 fast enough?

Wdethomas
Send message
Joined: 6 Feb 10
Posts: 38
Credit: 274,204,838
RAC: 0
Level
Asn
Scientific publications
watwatwatwatwatwat
Message 39605 - Posted: 23 Jan 2015 | 2:13:28 UTC

Frustrating. I have THREE GTX 780ti SC and not even one very long WU. Aborted several long run WU to see if I could get just one and nothing.

VT
Send message
Joined: 28 Nov 13
Posts: 17
Credit: 153,786,987
RAC: 0
Level
Ile
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 39606 - Posted: 23 Jan 2015 | 2:20:33 UTC

aborted about 30 - no luck in getting the rare ones for 970.

kppom
Send message
Joined: 26 Dec 14
Posts: 1
Credit: 40,180,326
RAC: 0
Level
Val
Scientific publications
watwatwatwatwat
Message 39608 - Posted: 23 Jan 2015 | 4:24:55 UTC - in response to Message 39575.

Is GTX 970M fast enough to compute this project?
____________

Profile caffeineyellow5
Avatar
Send message
Joined: 30 Jul 14
Posts: 225
Credit: 2,658,976,345
RAC: 0
Level
Phe
Scientific publications
watwatwatwatwatwatwatwat
Message 39610 - Posted: 23 Jan 2015 | 7:07:31 UTC - in response to Message 39575.

Looks like 16.5 hours here! YAY!
____________
1 Corinthians 9:16 "For though I preach the gospel, I have nothing to glory of: for necessity is laid upon me; yea, woe is unto me, if I preach not the gospel!"
Ephesians 6:18-20, please ;-)
http://tbc-pa.org

Edwin
Send message
Joined: 19 Nov 14
Posts: 3
Credit: 857,950
RAC: 0
Level
Gly
Scientific publications
wat
Message 39613 - Posted: 23 Jan 2015 | 9:06:28 UTC

Well, I don´t care if I don´t get these WU´s.. It´s ok when I can spend the little things too, it´s for Science. :)

Happy Crunching Guys! :)

Stefan
Project administrator
Project developer
Project tester
Project scientist
Send message
Joined: 5 Mar 13
Posts: 348
Credit: 0
RAC: 0
Level

Scientific publications
wat
Message 39616 - Posted: 23 Jan 2015 | 9:25:42 UTC
Last modified: 23 Jan 2015 | 9:36:29 UTC

Please don't cancel any more WUs to get them! I checked and they are already all taken at this point so only if they fail on some machines will they get added to the queue again. They are only a single step so they are not coming back.
I agree with Retvari that it's not a good idea to put them on the long queue. We will discuss about queues and how to organize them better.

Profile Retvari Zoltan
Avatar
Send message
Joined: 20 Jan 09
Posts: 2343
Credit: 16,201,255,749
RAC: 16,606
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 39617 - Posted: 23 Jan 2015 | 9:32:16 UTC

It looks like my GTX 980 will finish this very long workunit in 15 hours and 40 minutes. It's 2.5 hours shorter than the estimated running time of the GTX 780Ti. These workunits has low memory controller load (29-31% on my GTX980@3.5GHz, 11% on my GTX780Ti@3.5GHz, 24% on my GTX780Ti@2.9GHz).

Profile Retvari Zoltan
Avatar
Send message
Joined: 20 Jan 09
Posts: 2343
Credit: 16,201,255,749
RAC: 16,606
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 39618 - Posted: 23 Jan 2015 | 9:39:51 UTC - in response to Message 39604.

Is a intel i3 with a gtx 660 fast enough?

According to my estimation your host will finish in 2 days 5 hours and 20 minutes, so it fast enough to finish before the deadline, but not fast enough to earn bonus for returning the result within 2 days.

biodoc
Send message
Joined: 26 Aug 08
Posts: 183
Credit: 6,466,114,375
RAC: 1,393,151
Level
Tyr
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 39619 - Posted: 23 Jan 2015 | 11:53:09 UTC

I have one on a GTX980. 43% complete after 11 hours.

Betting Slip
Send message
Joined: 5 Jan 09
Posts: 670
Credit: 2,498,095,550
RAC: 0
Level
Phe
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 39620 - Posted: 23 Jan 2015 | 12:26:53 UTC - in response to Message 39616.

Please don't cancel any more WUs to get them! I checked and they are already all taken at this point so only if they fail on some machines will they get added to the queue again. They are only a single step so they are not coming back.
I agree with Retvari that it's not a good idea to put them on the long queue. We will discuss about queues and how to organize them better.


Gerard never asked anyone to ABORT WU's to get their hands on one of these long WU's.
While you may agree with Retvari he was the first to post that he had ABORTED other WU's for no other reason than to get one of these on a particular card. His warning was a "Self Fulfilling Prophecy" and others followed.
Maybe Retvari decided that these units gave more credits and a place on the "Performance" tab and were more important than other scientists WU's. In addition, he might of thought he deserved more of them than anyone else.

Profile Retvari Zoltan
Avatar
Send message
Joined: 20 Jan 09
Posts: 2343
Credit: 16,201,255,749
RAC: 16,606
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 39621 - Posted: 23 Jan 2015 | 13:25:16 UTC - in response to Message 39620.
Last modified: 23 Jan 2015 | 13:32:47 UTC

Gerard never asked anyone to ABORT WU's to get their hands on one of these long WU's.
While you may agree with Retvari he was the first to post that he had ABORTED other WU's for no other reason than to get one of these on a particular card. His warning was a "Self Fulfilling Prophecy" and others followed.

It is quite insulting to others actually, that you are saying they couldn't figure it out without my "advice".

Maybe Retvari decided that these units gave more credits and a place on the "Performance" tab and were more important than other scientists WU's. In addition, he might of thought he deserved more of them than anyone else.

I'm just a human. Sorry. I made mistakes. But I really like to help other people learn from my mistakes, especially those which were induced by their mistakes.

sarge104
Send message
Joined: 21 Nov 13
Posts: 5
Credit: 7,420,264
RAC: 0
Level
Ser
Scientific publications
watwatwatwatwatwatwat
Message 39622 - Posted: 23 Jan 2015 | 14:07:48 UTC

Started the VL WU's early this morning while grabbing a cup of coffee. One WU only used around 60% of the Titan(May have misread, it was early :)). Currently running them concurrently and achieved 75-80 percent utilization.

Boinc is estimating 7.5 hours. Will check during lunch and see what progress is made in around 5 hrs.

Richard Haselgrove
Send message
Joined: 11 Jul 09
Posts: 1576
Credit: 5,518,086,851
RAC: 8,606,062
Level
Tyr
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 39623 - Posted: 23 Jan 2015 | 14:16:31 UTC - in response to Message 39601.

... so I'm expecting a 180 MB final upload!

My task has now reached 47%, and I'm beginning to get a little bit worried by this.

File 2x23-GERARD_VERYLONG_CXCL12_confAna-0-1-RND3250_0_9 has already reached 84,510 KB, and if it continues to grow linearly with progress, the final size will be very close to the 180 MB I predicted.

But the upload file specification says

<file>
<name>2x23-GERARD_VERYLONG_CXCL12_confAna-0-1-RND3250_0_9</name>
<nbytes>0.000000</nbytes>
<max_nbytes>128000000.000000</max_nbytes>
<status>0</status>
<upload_url>http://www.gpugrid.org/PS3GRID_cgi/file_upload_handler</upload_url>
</file>

<max_nbytes> 128,000,000 is 128 MB, no? And we are going to exceed that by nearly 50%?

Fortunately, Retvari's GTX 980 mine-canary should finish in a couple of hours, and the rest of us tortoises will find out whether the hare has crashed and burned - or not.

Dayle Diamond
Send message
Joined: 5 Dec 12
Posts: 84
Credit: 1,628,088,415
RAC: 1,457,783
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 39624 - Posted: 23 Jan 2015 | 14:39:35 UTC - in response to Message 39623.

Fortunately, Retvari's GTX 980 mine-canary should finish in a couple of hours, and the rest of us tortoises will find out whether the hare has crashed and burned - or not.


Ahem. We are all working on the same side.
If Retvari Zoltan's work units fail, ours will also fail...just a little slower.

Profile [AF>Amis des Lapins] Phil...
Send message
Joined: 16 Jul 13
Posts: 56
Credit: 1,626,354,890
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 39626 - Posted: 23 Jan 2015 | 14:59:15 UTC - in response to Message 39620.
Last modified: 23 Jan 2015 | 14:59:54 UTC

Please don't cancel any more WUs to get them! I checked and they are already all taken at this point so only if they fail on some machines will they get added to the queue again. They are only a single step so they are not coming back.
I agree with Retvari that it's not a good idea to put them on the long queue. We will discuss about queues and how to organize them better.


Gerard never asked anyone to ABORT WU's to get their hands on one of these long WU's.
While you may agree with Retvari he was the first to post that he had ABORTED other WU's for no other reason than to get one of these on a particular card. His warning was a "Self Fulfilling Prophecy" and others followed.
Maybe Retvari decided that these units gave more credits and a place on the "Performance" tab and were more important than other scientists WU's. In addition, he might of thought he deserved more of them than anyone else.


Strongly Disagree with you. I also asked if one should abort WU's to obtain the "VERY LONG". But in French, so I guess you did not understand. Furthermore, it was a question only, not a suggestion.

http://www.gpugrid.net/forum_thread.php?id=3988&nowrap=true#39586

I don't know Retvari Zoltan in the real life, but your post is insulting, as he is very often posting here in order to HELP crunchers + to give very good advices.

We are here to help science, not to denigrate a particular person.

Retvari Zoltan, I want to thank you for the help and advices you give on this forum.

Best Regards,

Phil1966

Profile Retvari Zoltan
Avatar
Send message
Joined: 20 Jan 09
Posts: 2343
Credit: 16,201,255,749
RAC: 16,606
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 39627 - Posted: 23 Jan 2015 | 15:01:44 UTC - in response to Message 39623.
Last modified: 23 Jan 2015 | 15:03:41 UTC

Good points, both of you! (Richard Haselgrove & Dayle)

My GTX780Ti has 55 minutes left, so we have to act fast.
The 4x14-GERARD_VERYLONG_CXCL12_confAna-0-1-RND7430_0_9 file on that host has grown to 177639400 bytes, which is larger than the specified size in the client_state.xml file, but if you look for the previous file section, it's upload size is doubled. So I think that the wrong file size is doubled by accident.

<file> <name>4x14-GERARD_VERYLONG_CXCL12_confAna-0-1-RND7430_0_8</name> <nbytes>0.000000</nbytes> <max_nbytes>256000000.000000</max_nbytes> <status>0</status> <upload_url>http://www.gpugrid.org/PS3GRID_cgi/file_upload_handler</upload_url> </file> <file> <name>4x14-GERARD_VERYLONG_CXCL12_confAna-0-1-RND7430_0_9</name> <nbytes>0.000000</nbytes> <max_nbytes>128000000.000000</max_nbytes> <status>0</status> <upload_url>http://www.gpugrid.org/PS3GRID_cgi/file_upload_handler</upload_url> </file>

However, it could be easily fixed by manually editing these numbers with a text editor (it is advised to exit BOINC manager & the scientific applications before doing so).

Profile [AF>Amis des Lapins] Phil...
Send message
Joined: 16 Jul 13
Posts: 56
Credit: 1,626,354,890
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 39628 - Posted: 23 Jan 2015 | 15:02:21 UTC

It looks like the 970 PNY will take about 30 hours to complete these WU's while the 970 Gaming G1 will need about 24 hours.

(The PNY are connected to an old PCI E 2.0 MB ... :/)

Profile [AF>Amis des Lapins] Phil...
Send message
Joined: 16 Jul 13
Posts: 56
Credit: 1,626,354,890
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 39629 - Posted: 23 Jan 2015 | 15:05:37 UTC - in response to Message 39627.
Last modified: 23 Jan 2015 | 15:07:38 UTC

Good points, both of you! (Richard Haselgrove & Dayle)

My GTX780Ti has 55 minutes left, so we have to act fast.
The 4x14-GERARD_VERYLONG_CXCL12_confAna-0-1-RND7430_0_9 file on that host has grown to 177639400 bytes, which is larger than the specified size in the client_state.xml file, but if you look for the previous file section, it's upload size is doubled. So I think that the wrong file size is doubled by accident.

However, it could be easily fixed by manually editing these numbers with a text editor (it is advised to exit BOINC manager & the scientific applications before doing so).



Are we supposed to change something manually ?
Do we need to increase the <max_nbytes> ?
Or do we need to check if this is necessary before doing so ?

Richard Haselgrove
Send message
Joined: 11 Jul 09
Posts: 1576
Credit: 5,518,086,851
RAC: 8,606,062
Level
Tyr
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 39630 - Posted: 23 Jan 2015 | 15:22:39 UTC - in response to Message 39627.

However, it could be easily fixed by manually editing these numbers with a text editor (it is advised to exit BOINC manager & the scientific applications before doing so).

Agreed. It would be required to exit BOINC first (stopping the client - the Manager doesn't matter), before making the edit - very carefully, and using only a plain-text editor. Client_state.xml is only ever read by BOINC at start-up - it's effectively a hard-copy backup file, so any edits made while the BOINC client is running are overwritten by the next dump from memory.

I think we should wait for confirmation that a real problem exists (or doesn't, as the case may be) before advocating wholesale file editing. I did send a PM to Gerard after my last post here, suggesting that he monitors the early returns, but I haven't received any feedback yet.

Profile Retvari Zoltan
Avatar
Send message
Joined: 20 Jan 09
Posts: 2343
Credit: 16,201,255,749
RAC: 16,606
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 39631 - Posted: 23 Jan 2015 | 15:53:05 UTC - in response to Message 39630.

However, it could be easily fixed by manually editing these numbers with a text editor (it is advised to exit BOINC manager & the scientific applications before doing so).

Agreed. It would be required to exit BOINC first (stopping the client - the Manager doesn't matter), before making the edit - very carefully, and using only a plain-text editor. Client_state.xml is only ever read by BOINC at start-up - it's effectively a hard-copy backup file, so any edits made while the BOINC client is running are overwritten by the next dump from memory.

I think we should wait for confirmation that a real problem exists (or doesn't, as the case may be) before advocating wholesale file editing. I did send a PM to Gerard after my last post here, suggesting that he monitors the early returns, but I haven't received any feedback yet.

It is proven:
23/01/2015 16:47:23 | GPUGRID | Computation for task 4x14-GERARD_VERYLONG_CXCL12_confAna-0-1-RND7430_0 finished 23/01/2015 16:47:23 | GPUGRID | Output file 4x14-GERARD_VERYLONG_CXCL12_confAna-0-1-RND7430_0_9 for task 4x14-GERARD_VERYLONG_CXCL12_confAna-0-1-RND7430_0 exceeds size limit. 23/01/2015 16:47:23 | GPUGRID | File size: 186931932.000000 bytes. Limit: 128000000.000000 bytes

Richard Haselgrove
Send message
Joined: 11 Jul 09
Posts: 1576
Credit: 5,518,086,851
RAC: 8,606,062
Level
Tyr
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 39632 - Posted: 23 Jan 2015 | 16:03:23 UTC - in response to Message 39631.

I was afraid so. What's more, the error is declared immediately after the task finishes: I was hoping that possibly disabling networking as the task approaches completion would buy enough time for an edit, but evidently not.

OK, time for those who read the forums to get out their editors, and a supply of resent tasks (which will themselves need editing) from those who don't.

Profile Retvari Zoltan
Avatar
Send message
Joined: 20 Jan 09
Posts: 2343
Credit: 16,201,255,749
RAC: 16,606
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 39634 - Posted: 23 Jan 2015 | 16:18:28 UTC - in response to Message 39632.
Last modified: 23 Jan 2015 | 16:18:52 UTC

This is deadly serious! Every workunit will fail to upload, so the whole "challenge" will come to naught without user intervention.
Those who can't edit their client_state.xml should abort these workunits immediately.
Also, if you've received a fresh one of them, you should edit the client_state.xml again!

Jacob Klein
Send message
Joined: 11 Oct 08
Posts: 1127
Credit: 1,901,927,545
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 39635 - Posted: 23 Jan 2015 | 16:24:35 UTC

It sounds like the batch is bad.
The cancellation, and rebatching, should all be done server-side.

Richard Haselgrove
Send message
Joined: 11 Jul 09
Posts: 1576
Credit: 5,518,086,851
RAC: 8,606,062
Level
Tyr
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 39636 - Posted: 23 Jan 2015 | 16:46:58 UTC

OK, so here are the KISS ('Keep it simple, stupid') instructions.

1) Check if you are running a GPUGrid task with 'GERARD_VERYLONG_CXCL12_confAna' in the task name - or if you have one waiting to start. If you can't see one anywhere, relax and do nothing (except maybe open a beer).

2) If you have one of these tasks, read to the end of these instructions. If you feel confident about following them and carrying out the (very simple) edit required - carry on. If you don't feel confident, abort the task - running without editing it is a waste of time.

3) OK, you've decided to edit. First, shut down BOINC, making sure that the science applications shut down as well.

4) Find the file 'client_state.xml' in your BOINC Data folder. Under Windows, this is likely - if you accepted the default installation setting - to be C:\Programdata\BOINC: under Linux, it might be /var/lib/boinc

5) Open client_state.xml for editing, using a plain-text editor. Under Windows, NotePad or one of its replacements like NotePad++ is recommended - linux users are on their own here, but probably know their own toolsets.

6) Search the file for that GERARD_VERYLONG_CXCL12_confAna text we started with. There will be many search hits: we are looking for a <file>...</file> section like this:

<file> <name>2x23-GERARD_VERYLONG_CXCL12_confAna-0-1-RND3250_0_9</name> <nbytes>0.000000</nbytes> <max_nbytes>128000000.000000</max_nbytes> <status>0</status> <upload_url>http://www.gpugrid.org/PS3GRID_cgi/file_upload_handler</upload_url> </file>

7) Make sure you have exactly the right section: the last number before </name> should be _9, and there should be an <upload_url> line.

8) Change the first three numbers after <max_nbytes> from 128 to 256. Just those three numbers - don't accidentally delete any punctuation, change the number of zeroes, or make any other change.

9) Repeat steps (6), (7) and (8) for each separate VERYLONG task that you have on the system.

10) Save the file, restart BOINC, and relax. All done.

Richard Haselgrove
Send message
Joined: 11 Jul 09
Posts: 1576
Credit: 5,518,086,851
RAC: 8,606,062
Level
Tyr
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 39637 - Posted: 23 Jan 2015 | 16:52:25 UTC - in response to Message 39635.

It sounds like the batch is bad.
The cancellation, and rebatching, should all be done server-side.

And they possibly will be. But in the meantime, we can do something to help.

Look at the opening post in this thread: these are an urgent research challenge, "whose results we need as soon as possible (we are in a hurry)". That was over 24 hours ago, and many tasks will be approaching completion. We can save them, and get good data back today or tomorrow. If we have to go through the batch recall process (with the weekend already started in Barcelona), it'll probably be Tuesday before any results are returned. Why wait?

Jacob Klein
Send message
Joined: 11 Oct 08
Posts: 1127
Credit: 1,901,927,545
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 39638 - Posted: 23 Jan 2015 | 16:59:02 UTC
Last modified: 23 Jan 2015 | 17:01:04 UTC

While your instructions are very well laid out, manual intervention is something that a lot of people probably won't be doing, in my estimation.

Thus, I would think the majority of the tasks out there, will go to completion, then fail, even if a handful of experts attempt client state manipulation.

This means that:
a) GPUGrid won't get the "task failed" notice until after the task ran
b) GPUGrid won't get the full batch of results that they are requesting
c) GPUGrid will be relying on user intervention to save the day.

Instead, it seems most appropriate, to me, for them to immediately cancel the batch and re-issue a corrected one. Why wait? :)

I'm not trying to pick an argument. Really. I'm just pointing out the option that, if I were admin'ing, I might choose to do.

Profile Retvari Zoltan
Avatar
Send message
Joined: 20 Jan 09
Posts: 2343
Credit: 16,201,255,749
RAC: 16,606
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 39639 - Posted: 23 Jan 2015 | 17:05:16 UTC - in response to Message 39636.

4) Find the file 'client_state.xml' in your BOINC Data folder. Under Windows, this is likely - if you accepted the default installation setting - to be C:\Programdata\BOINC: under Linux, it might be /var/lib/boinc

As the ProgramData folder is hidden, it's easier to make a shortcut to notepad, which immediately opens the client_state.xml for editing:
1. right click on an empty area of your desktop
2. select new -> shortcut
3. enter the follwing text to the input field:
C:\Windows\System32\notepad.exe c:\ProgramData\BOINC\client_state.xml
4. click next
5. enter a self-explanatory name for this shortcut: edit client_state.xml

Richard Haselgrove
Send message
Joined: 11 Jul 09
Posts: 1576
Credit: 5,518,086,851
RAC: 8,606,062
Level
Tyr
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 39640 - Posted: 23 Jan 2015 | 17:06:38 UTC - in response to Message 39638.

Instead, it seems most appropriate, to me, for them to immediately cancel the batch and re-issue a corrected one. Why wait? :)

Because I for one would resent the waste of 20 hours of perfectly salvageable crunching. Save what can be saved, and only reissue the remainder. This project tends to attract people with a pretty high level of motivation: let them demonstrate that their skill level matches their determination, before you condemn them all as incapable. But I agree - many tasks will fail, simply because not enough people will pick up news of the problem from this thread - unless the admins can post a second news item flagged to appear as a Notice?

Jacob Klein
Send message
Joined: 11 Oct 08
Posts: 1127
Credit: 1,901,927,545
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 39641 - Posted: 23 Jan 2015 | 17:07:08 UTC - in response to Message 39638.

You could just open File Explorer, and then in the address bar, type "%ProgramData%" (without the double-quotes).

Profile Retvari Zoltan
Avatar
Send message
Joined: 20 Jan 09
Posts: 2343
Credit: 16,201,255,749
RAC: 16,606
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 39642 - Posted: 23 Jan 2015 | 17:10:38 UTC - in response to Message 39638.

While your instructions are very well laid out, manual intervention is something that a lot of people probably won't be doing, in my estimation.

Thus, I would think the majority of the tasks out there, will go to completion, then fail, even if a handful of experts attempt client state manipulation.

This means that:
a) GPUGrid won't get the "task failed" notice until after the task ran
b) GPUGrid won't get the full batch of results that they are requesting
c) GPUGrid will be relying on user intervention to save the day.

Instead, it seems most appropriate, to me, for them to immediately cancel the batch and re-issue a corrected one. Why wait? :)

I'm not trying to pick an argument. Really. I'm just pointing out the option that, if I were admin'ing, I might choose to do.

I agree.
The instructions intended for those who want to save 20+ hours of crunching.
If the project aborts the batch, those pieces which are under processing won't be aborted, only those which are sitting in the queue.

Carlos Augusto Engel
Send message
Joined: 5 Jun 09
Posts: 38
Credit: 2,880,758,878
RAC: 0
Level
Phe
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 39643 - Posted: 23 Jan 2015 | 17:19:26 UTC

I will follow your instructions.
I have just one 'VERYLONG'.

____________

ROBtheLIONHEART
Send message
Joined: 21 Nov 13
Posts: 34
Credit: 636,026,131
RAC: 0
Level
Lys
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 39644 - Posted: 23 Jan 2015 | 17:21:20 UTC - in response to Message 39638.
Last modified: 23 Jan 2015 | 17:28:49 UTC

I agree.Most wont even know. I last checked on mine about 7.5 hrs in all seamed well. Next time I had a chance to check on it it was competed. When I came to the site to see how long it did actually take I found it of course did not upload. So 17.5 hrs wasted. Bummer :( It uploaded at 15.41 UTC so the info was too late for me

Gerard
Send message
Joined: 26 Mar 14
Posts: 101
Credit: 0
RAC: 0
Level

Scientific publications
wat
Message 39645 - Posted: 23 Jan 2015 | 17:37:08 UTC
Last modified: 23 Jan 2015 | 17:53:11 UTC

Please do not panic. I'm going to discuss this issue with my superiors. I'll keep you updated.

Profile [AF>Amis des Lapins] Phil...
Send message
Joined: 16 Jul 13
Posts: 56
Credit: 1,626,354,890
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 39646 - Posted: 23 Jan 2015 | 17:42:34 UTC

Thank you to Richard Haselgrove, Retvari Zoltan* and Jacon Klein for their explanation, pro-activity and involvment.

Have updated the "client_state" for the 5 "GERARD VERY LONG" I have.

Hope, if it works, that they won't cancel the whole badge !

First 2 results in about 3h30 and 5h15 ...

Best,

Phil1966

sarge104
Send message
Joined: 21 Nov 13
Posts: 5
Credit: 7,420,264
RAC: 0
Level
Ser
Scientific publications
watwatwatwatwatwatwat
Message 39647 - Posted: 23 Jan 2015 | 17:50:05 UTC - in response to Message 39636.

OK, so here are the KISS ('Keep it simple, stupid') instructions.

4) Find the file 'client_state.xml' in your BOINC Data folder. Under Windows, this is likely - if you accepted the default installation setting - to be C:\Programdata\BOINC: under Linux, it might be /var/lib/boinc

5) Open client_state.xml for editing, using a plain-text editor. Under Windows, NotePad or one of its replacements like NotePad++ is recommended - linux users are on their own here, but probably know their own toolsets.

6) Search the file for that GERARD_VERYLONG_CXCL12_confAna text we started with. There will be many search hits: we are looking for a <file>...</file> section like this:

<file> <name>2x23-GERARD_VERYLONG_CXCL12_confAna-0-1-RND3250_0_9</name> <nbytes>0.000000</nbytes> <max_nbytes>128000000.000000</max_nbytes> <status>0</status> <upload_url>http://www.gpugrid.org/PS3GRID_cgi/file_upload_handler</upload_url> </file>

7) Make sure you have exactly the right section: the last number before </name> should be _9, and there should be an <upload_url> line.

8) Change the first three numbers after <max_nbytes> from 128 to 256. Just those three numbers - don't accidentally delete any punctuation, change the number of zeroes, or make any other change.

9) Repeat steps (6), (7) and (8) for each separate VERYLONG task that you have on the system.

10) Save the file, restart BOINC, and relax. All done.


Thank you guys for bringing this info out, I'm hoping by the time I get home I will be able to make the edits in time. Like others have stated it is wasteful to use all this energy only to fail during upload.

Profile skgiven
Volunteer moderator
Volunteer tester
Avatar
Send message
Joined: 23 Apr 09
Posts: 3968
Credit: 1,995,359,260
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 39648 - Posted: 23 Jan 2015 | 18:21:51 UTC - in response to Message 39636.


6) Search the file for that GERARD_VERYLONG_CXCL12_confAna text we started with. There will be many search hits: we are looking for a <file>...</file> section like this:

<file> <name>2x23-GERARD_VERYLONG_CXCL12_confAna-0-1-RND3250_0_9</name> <nbytes>0.000000</nbytes> <max_nbytes>128000000.000000</max_nbytes> <status>0</status> <upload_url>http://www.gpugrid.org/PS3GRID_cgi/file_upload_handler</upload_url> </file>

7) Make sure you have exactly the right section: the last number before </name> should be _9, and there should be an <upload_url> line.

8) Change the first three numbers after <max_nbytes> from 128 to 256. Just those three numbers - don't accidentally delete any punctuation, change the number of zeroes, or make any other change.

9) Repeat steps (6), (7) and (8) for each separate VERYLONG task that you have on the system.

10) Save the file, restart BOINC, and relax. All done.


Richard,
Why is it that max_nbytes only needs to be change for the _9 file, and not the others? - TIA

<file> <name>2x10-GERARD_VERYLONG_CXCL12_confAna-0-1-RND3907_1_1</name> <nbytes>0.000000</nbytes> <max_nbytes>50000000.000000</max_nbytes> <status>0</status> <upload_url>http://www.gpugrid.org/PS3GRID_cgi/file_upload_handler</upload_url> </file> <file> <name>2x10-GERARD_VERYLONG_CXCL12_confAna-0-1-RND3907_1_2</name> <nbytes>0.000000</nbytes> <max_nbytes>50000000.000000</max_nbytes> <status>0</status> <upload_url>http://www.gpugrid.org/PS3GRID_cgi/file_upload_handler</upload_url> </file> <file> <name>2x10-GERARD_VERYLONG_CXCL12_confAna-0-1-RND3907_1_3</name> <nbytes>0.000000</nbytes> <max_nbytes>50000000.000000</max_nbytes> <status>0</status> <upload_url>http://www.gpugrid.org/PS3GRID_cgi/file_upload_handler</upload_url> </file> <file> <name>2x10-GERARD_VERYLONG_CXCL12_confAna-0-1-RND3907_1_4</name> <nbytes>0.000000</nbytes> <max_nbytes>256000000.000000</max_nbytes> <status>0</status> <upload_url>http://www.gpugrid.org/PS3GRID_cgi/file_upload_handler</upload_url> </file> <file> <name>2x10-GERARD_VERYLONG_CXCL12_confAna-0-1-RND3907_1_5</name> <nbytes>0.000000</nbytes> <max_nbytes>10000000.000000</max_nbytes> <status>0</status> <gzip_when_done/> <upload_url>http://www.gpugrid.org/PS3GRID_cgi/file_upload_handler</upload_url> </file> <file> <name>2x10-GERARD_VERYLONG_CXCL12_confAna-0-1-RND3907_1_6</name> <nbytes>0.000000</nbytes> <max_nbytes>10000000.000000</max_nbytes> <status>0</status> <gzip_when_done/> <upload_url>http://www.gpugrid.org/PS3GRID_cgi/file_upload_handler</upload_url> </file> <file> <name>2x10-GERARD_VERYLONG_CXCL12_confAna-0-1-RND3907_1_7</name> <nbytes>0.000000</nbytes> <max_nbytes>10000000.000000</max_nbytes> <status>0</status> <upload_url>http://www.gpugrid.org/PS3GRID_cgi/file_upload_handler</upload_url> </file> <file> <name>2x10-GERARD_VERYLONG_CXCL12_confAna-0-1-RND3907_1_8</name> <nbytes>0.000000</nbytes> <max_nbytes>256000000.000000</max_nbytes> <status>0</status> <upload_url>http://www.gpugrid.org/PS3GRID_cgi/file_upload_handler</upload_url> </file> <file> <name>2x10-GERARD_VERYLONG_CXCL12_confAna-0-1-RND3907_1_9</name> <nbytes>0.000000</nbytes> <max_nbytes>256000000.000000</max_nbytes> <status>0</status> <upload_url>http://www.gpugrid.org/PS3GRID_cgi/file_upload_handler</upload_url> </file> <file> <name>2x10-GERARD_VERYLONG_CXCL12_confAna-0-1-RND3907_1_10</name> <nbytes>0.000000</nbytes> <max_nbytes>10000.000000</max_nbytes> <status>0</status> <upload_url>http://www.gpugrid.org/PS3GRID_cgi/file_upload_handler</upload_url> </file> <file> <name>2x10-GERARD_VERYLONG_CXCL12_confAna-0-1-RND3907_1_11</name> <nbytes>0.000000</nbytes> <max_nbytes>5000000.000000</max_nbytes> <status>0</status> <upload_url>http://www.gpugrid.org/PS3GRID_cgi/file_upload_handler</upload_url> </file>

____________
FAQ's

HOW TO:
- Opt out of Beta Tests
- Ask for Help

Richard Haselgrove
Send message
Joined: 11 Jul 09
Posts: 1576
Credit: 5,518,086,851
RAC: 8,606,062
Level
Tyr
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 39649 - Posted: 23 Jan 2015 | 18:29:41 UTC - in response to Message 39648.

Richard,
Why is it that max_nbytes only needs to be change for the _9 file, and not the others? - TIA

Because, in my experience, only one of the multiple upload files grows proportionately to the runtime of the task. In the case of the VERYLONG tasks, it's file _9 which is the one which grows - and in this case grows too much.

No harm will be done if you extend the limits on the other files as well, but in my experience doing multiple repetitive edits is when I get bored, tired - and sloppy. And I make mistakes.

Client_state.xml is a very important and sensitive file, and if you break the file 'shape' - its XML structure - in even the most trivial way, you can lose a lot of work - not just from the project you're trying to tweak. I'd always advocate making the smallest, simplest change possible - hence KISS.

Profile Retvari Zoltan
Avatar
Send message
Joined: 20 Jan 09
Posts: 2343
Credit: 16,201,255,749
RAC: 16,606
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 39650 - Posted: 23 Jan 2015 | 18:33:07 UTC
Last modified: 23 Jan 2015 | 18:34:50 UTC

My GTX 980 host is uploading the first result.
The final file size is 186.980.704 bytes.
The total processing time is 15h 39m 51s.

Profile Beyond
Avatar
Send message
Joined: 23 Nov 08
Posts: 1112
Credit: 6,162,416,256
RAC: 0
Level
Tyr
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 39651 - Posted: 23 Jan 2015 | 18:39:50 UTC - in response to Message 39636.
Last modified: 23 Jan 2015 | 18:47:47 UTC

OK, so here are the KISS ('Keep it simple, stupid') instructions.

1) Check if you are running a GPUGrid task with 'GERARD_VERYLONG_CXCL12_confAna' in the task name - or if you have one waiting to start. If you can't see one anywhere, relax and do nothing (except maybe open a beer).

2) If you have one of these tasks, read to the end of these instructions. If you feel confident about following them and carrying out the (very simple) edit required - carry on. If you don't feel confident, abort the task - running without editing it is a waste of time.

Edit: Hmm from this example above it looks like some WUs may have _1_9 instead of the _0_9 that all of mine had:

<file>
<name>2x10-GERARD_VERYLONG_CXCL12_confAna-0-1-RND3907_1_9</name>
<nbytes>0.000000</nbytes>
<max_nbytes>256000000.000000</max_nbytes>
<status>0</status>
<upload_url>http://www.gpugrid.org/PS3GRID_cgi/file_upload_handler</upload_url>
</file>

3) OK, you've decided to edit. First, shut down BOINC, making sure that the science applications shut down as well.

4) Find the file 'client_state.xml' in your BOINC Data folder. Under Windows, this is likely - if you accepted the default installation setting - to be C:\Programdata\BOINC: under Linux, it might be /var/lib/boinc

5) Open client_state.xml for editing, using a plain-text editor. Under Windows, NotePad or one of its replacements like NotePad++ is recommended - linux users are on their own here, but probably know their own toolsets.

6) Search the file for that GERARD_VERYLONG_CXCL12_confAna text we started with. There will be many search hits: we are looking for a <file>...</file> section like this:

<file> <name>2x23-GERARD_VERYLONG_CXCL12_confAna-0-1-RND3250_0_9</name> <nbytes>0.000000</nbytes> <max_nbytes>128000000.000000</max_nbytes> <status>0</status> <upload_url>http://www.gpugrid.org/PS3GRID_cgi/file_upload_handler</upload_url> </file>

7) Make sure you have exactly the right section: the last number before </name> should be _9, and there should be an <upload_url> line.

8) Change the first three numbers after <max_nbytes> from 128 to 256. Just those three numbers - don't accidentally delete any punctuation, change the number of zeroes, or make any other change.

9) Repeat steps (6), (7) and (8) for each separate VERYLONG task that you have on the system.

10) Save the file, restart BOINC, and relax. All done.

Thanks Richard. I found 3 of the verylong WUs and executed your fix. One thing that may make this easier: I simply searched for _0_9 and in each case the first instance found was the correct one. Just make sure it's a verylong WU and not a Noelia that you're editing. All 3 of these are on 750Ti cards and completion looks to be around 50 hours.

Edit: Hmm from the example above it looks like some of the WUs may have _1_9 instead of _0_9 that all of mine had:

<file>
<name>2x10-GERARD_VERYLONG_CXCL12_confAna-0-1-RND3907_1_9</name>
<nbytes>0.000000</nbytes>
<max_nbytes>256000000.000000</max_nbytes>
<status>0</status>
<upload_url>http://www.gpugrid.org/PS3GRID_cgi/file_upload_handler</upload_url>
</file>

Gerard
Send message
Joined: 26 Mar 14
Posts: 101
Credit: 0
RAC: 0
Level

Scientific publications
wat
Message 39652 - Posted: 23 Jan 2015 | 18:40:34 UTC

The BOINC administrator just raised the upload limit to 512 Mb, please let us know if you can upload the WU now.

Richard Haselgrove
Send message
Joined: 11 Jul 09
Posts: 1576
Credit: 5,518,086,851
RAC: 8,606,062
Level
Tyr
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 39653 - Posted: 23 Jan 2015 | 18:49:14 UTC - in response to Message 39652.

The BOINC administrator just raised the upload limit to 512 Mb, please let us know if you can upload the WU now.

It should be fine for newly created WUs, but I'm not sure whether the change will propagate to automatically-generated replacements for tasks which fail - we'll need to keep an eye on those.

It certainly won't be passed to tasks which are already 'out in the field' - on volunteers' computers. They will have to be modified manually, or allowed to fail.

Profile Retvari Zoltan
Avatar
Send message
Joined: 20 Jan 09
Posts: 2343
Credit: 16,201,255,749
RAC: 16,606
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 39654 - Posted: 23 Jan 2015 | 18:55:31 UTC

The upload of the first result is finished.
Here are the details: 3x17-GERARD_VERYLONG_CXCL12_confAna-0-1-RND9026_1

Profile [AF>Amis des Lapins] Phil...
Send message
Joined: 16 Jul 13
Posts: 56
Credit: 1,626,354,890
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 39655 - Posted: 23 Jan 2015 | 18:55:31 UTC - in response to Message 39653.

The BOINC administrator just raised the upload limit to 512 Mb, please let us know if you can upload the WU now.

It should be fine for newly created WUs, but I'm not sure whether the change will propagate to automatically-generated replacements for tasks which fail - we'll need to keep an eye on those.

It certainly won't be passed to tasks which are already 'out in the field' - on volunteers' computers. They will have to be modified manually, or allowed to fail.


Even if you "update" the project ?

Richard Haselgrove
Send message
Joined: 11 Jul 09
Posts: 1576
Credit: 5,518,086,851
RAC: 8,606,062
Level
Tyr
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 39656 - Posted: 23 Jan 2015 | 18:55:32 UTC - in response to Message 39650.

My GTX 980 host is uploading the first result.
The final file size is 186.980.704 bytes.
The total processing time is 15h 39m 51s.

Credit 600,000.00

Congratulations on the home run - and thanks for the confirmation that the file edit is effective.

Profile Retvari Zoltan
Avatar
Send message
Joined: 20 Jan 09
Posts: 2343
Credit: 16,201,255,749
RAC: 16,606
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 39657 - Posted: 23 Jan 2015 | 19:08:54 UTC - in response to Message 39655.

The BOINC administrator just raised the upload limit to 512 Mb, please let us know if you can upload the WU now.

It should be fine for newly created WUs, but I'm not sure whether the change will propagate to automatically-generated replacements for tasks which fail - we'll need to keep an eye on those.

It certainly won't be passed to tasks which are already 'out in the field' - on volunteers' computers. They will have to be modified manually, or allowed to fail.


Even if you "update" the project ?

I've received a 1894-NOELIA_BI3_unbind-1-10-RND9593_0 just now, and the file info has the old size limit:
<file_info> <name>1894-NOELIA_BI3_unbind-1-10-RND9593_0_8</name> <nbytes>0.000000</nbytes> <max_nbytes>256000000.000000</max_nbytes> <generated_locally/> <status>0</status> <upload_when_present/> <url>http://www.gpugrid.org/PS3GRID_cgi/file_upload_handler</url> </file_info> <file_info> <name>1894-NOELIA_BI3_unbind-1-10-RND9593_0_9</name> <nbytes>0.000000</nbytes> <max_nbytes>128000000.000000</max_nbytes> <generated_locally/> <status>0</status> <upload_when_present/> <url>http://www.gpugrid.org/PS3GRID_cgi/file_upload_handler</url> </file_info>

Profile Retvari Zoltan
Avatar
Send message
Joined: 20 Jan 09
Posts: 2343
Credit: 16,201,255,749
RAC: 16,606
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 39658 - Posted: 23 Jan 2015 | 19:12:45 UTC - in response to Message 39656.

Congratulations on the home run - and thanks for the confirmation that the file edit is effective.

Thank you!
We had similar upload size problems before, and the solution was the same back then.

Richard Haselgrove
Send message
Joined: 11 Jul 09
Posts: 1576
Credit: 5,518,086,851
RAC: 8,606,062
Level
Tyr
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 39659 - Posted: 23 Jan 2015 | 19:21:12 UTC - in response to Message 39658.

Congratulations on the home run - and thanks for the confirmation that the file edit is effective.

Thank you!
We had similar upload size problems before, and the solution was the same back then.

And we recently had the same thing at CPDN, which was why I checked - it had been bumped back to the top of my list of "things project administrators forget to do" when they're excited by an interesting bit of research.

Which reminds me.....

@ Gerard,
If you find yourself having to re-generate all or part of this batch of 'verylong' tasks, could you please adjust <rsc_fpops_est> proportionately, so that our BOINC clients show a fair estimate of the task runtime from the beginning, and the task doesn't mess up DCF when it finishes?

Toni
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist
Send message
Joined: 9 Dec 08
Posts: 1006
Credit: 5,068,599
RAC: 0
Level
Ser
Scientific publications
watwatwatwat
Message 39660 - Posted: 23 Jan 2015 | 19:27:24 UTC - in response to Message 39659.
Last modified: 23 Jan 2015 | 19:28:07 UTC

I've raised the limit in the DB for VERYLONG WUs. I'm not sure whether such changes propagate to clients at some time.

Profile Retvari Zoltan
Avatar
Send message
Joined: 20 Jan 09
Posts: 2343
Credit: 16,201,255,749
RAC: 16,606
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 39661 - Posted: 23 Jan 2015 | 19:52:05 UTC

My second very long workunit is uploading.
7x6-GERARD_VERYLONG_CXCL12_confAna-0-1-RND0829_0

skydivingnerd
Send message
Joined: 26 Feb 13
Posts: 7
Credit: 2,242,660,281
RAC: 0
Level
Phe
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 39662 - Posted: 23 Jan 2015 | 23:18:20 UTC

I'm glad I checked this thread again. Two of my clients have one of the very long WUs. I implemented the fix as described by Richard. I'll keep an eye on it and check the status on completion.
http://www.gpugrid.net/result.php?resultid=13737049 (GTX 770)
http://www.gpugrid.net/result.php?resultid=13737185 (GTX 670)
____________

biodoc
Send message
Joined: 26 Aug 08
Posts: 183
Credit: 6,466,114,375
RAC: 1,393,151
Level
Tyr
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 39663 - Posted: 23 Jan 2015 | 23:55:24 UTC

Thanks for the detailed guidance Richard and Retvari!

I just got home from work and checked this thread. My WU was 90%+ done so just in time.

Profile caffeineyellow5
Avatar
Send message
Joined: 30 Jul 14
Posts: 225
Credit: 2,658,976,345
RAC: 0
Level
Phe
Scientific publications
watwatwatwatwatwatwatwat
Message 39664 - Posted: 24 Jan 2015 | 0:05:16 UTC - in response to Message 39662.

4x6-GERARD_VERYLONG_CXCL12_confAna-0-1-RND1754_0_0 working now. It started out and continued to count down about 18.5 hours till completion, then at 48.8% finished and about 13 hours is jumped to 13.5 hours left. hah! Anyway, with just around 10 hours showing left I am changing the xml according to Richard's instructions. Only have one since I only have the one machine I run longs on. Good thing it has the 3 780's in it. I'll keep an eye out to see if I get any more in the near future also. Right now I have 3 queued and 3 working and only 1 of these.
____________
1 Corinthians 9:16 "For though I preach the gospel, I have nothing to glory of: for necessity is laid upon me; yea, woe is unto me, if I preach not the gospel!"
Ephesians 6:18-20, please ;-)
http://tbc-pa.org

wiyosaya
Send message
Joined: 22 Nov 09
Posts: 114
Credit: 589,114,683
RAC: 0
Level
Lys
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 39665 - Posted: 24 Jan 2015 | 0:14:53 UTC - in response to Message 39600.

We just launched 400 very long WU (they will take about 24h in a 780GTX) named VERYLONG_CXCL12_confAna whose results we need as soon as possible (we are in a hurry).

I've got two of them on my GTX 780Ti host.
According to my linear approximation made at 12.3% progress, the total computing time will be about 18 hours and 15 minutes.

They come with a credit+bonus of 400K.

That's nice.

Please, if you don't have a good graphic card, reject them.

That's an inappropriate way to arrange such batches.
You should set up a third queue for these purposes.
I couldn't receive one of these workunits on my GTX980 host. I'm sure I'm not alone with that.
EDIT: I had to abort 10 other workunits to receive one of these on my GTX980 host.
That's why your method is dangerous: it propagates failed workunits (by encouraging user intervention)

For the brave ones, take it as a challenge and see you on the performance tab ;)

Challenge accepted. :)

Absolutely agree that this is an inappropriate way to handle these large work units. Another "very long" queue like Retvari says or have the server automatically figure out what computers should get them based on the installed graphics cards.
____________

Betting Slip
Send message
Joined: 5 Jan 09
Posts: 670
Credit: 2,498,095,550
RAC: 0
Level
Phe
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 39666 - Posted: 24 Jan 2015 | 0:17:14 UTC

Finally got one but on my GTX560ti which I aborted but nothing for my GTX970

Need to get a better system, no doubt about that.

Profile caffeineyellow5
Avatar
Send message
Joined: 30 Jul 14
Posts: 225
Credit: 2,658,976,345
RAC: 0
Level
Phe
Scientific publications
watwatwatwatwatwatwatwat
Message 39667 - Posted: 24 Jan 2015 | 1:26:21 UTC - in response to Message 39665.
Last modified: 24 Jan 2015 | 1:29:56 UTC

Absolutely agree that this is an inappropriate way to handle these large work units. Another "very long" queue like Retvari says or have the server automatically figure out what computers should get them based on the installed graphics cards.

As far as that goes, the Notice that went out and started this thread is clear that they had a limited number of work units that needed immediate release and ASAP completion. The fact that they are very long is secondary to the fact that they are needed ASAP. Having the priority on the ASAP means that adding a different queue for them involves either voluntary addition to that queue by the end users, maybe in response to a notice that goes out calling for them, or forcing everyone onto that queue which then ends in the exact thing you have right now, which is having them go out to the first come/first serve whether they can be completed or not by those machines.

I don't think either of these is an appropriate thing for an on the spot addition of a longer task that needs to be completed ASAP. So that leaves the other option, which is having the servers determine if the machine can run it in the time needed before assigning it to that machine. I suppose that could be done, but I think if it could be currently done immediately and it was not, it was just a bad judgment call. Based on that, I would assume that their side of the system does not currently have that ability past what the user tells them you can do, via the queues you choose for your machines, i.e. Short, Long, Test, CPU, etc. I think assigning these tasks to the machines that are set to receive "normal" long work units and then sending out an official BOINC Notice to flash on the client IS the right way to have done this this time. And then, based on finances and manpower, work on adding more functionality to their back-end to determine what machines can do what tasks to fine tune what is already in place in the voluntary queues.

It seems very clear that they were not expecting these VERYLONG work units too far in advance to actually have done this any better, making the way it was done the best way it could have been done. And now for the future, if it is to be done on occasion, not much manpower and time needs to go into it to "correct" the process, but if the VERYLONG work units are to become a regular thing for the grid, then time should be invested to add a queue or help their servers better determine the potential of machines to finish them in the times needed.

All in all, people who have no better solutions, but only want to share frustrations are better off for everyone involved to state that there is an issue, what they think the issue is, and then know that someone saw the statement and will work to fix it if it needs fixing. We don't need to get emotionally involved unless we are singled out as overtly ignored. And then, there is always more projects or more official channels than the fellow user base and volunteer workers on a forum board. Not flaming, just always want to see solution makers making solutions and agitators making quiet. Life works better that way around. :-)

Jamaar@Siam
Send message
Joined: 7 May 13
Posts: 1
Credit: 157,304,655
RAC: 0
Level
Ile
Scientific publications
watwatwatwatwatwatwatwatwatwatwat
Message 39668 - Posted: 24 Jan 2015 | 3:51:49 UTC

I see that after the first VERYLONG units errored out, it has been posted that it was necessary to edit the xml before finishing the task.

When I read that, it had taken my GTX 770 31 hours 24 minutes to complete the task.
Not amused to see the log:

24/01/2015 08:46:57 | GPUGRID | Output file 6x13-GERARD_VERYLONG_CXCL12_confAna-0-1-RND0906_0_9 for task 6x13-GERARD_VERYLONG_CXCL12_confAna-0-1-RND0906_0 exceeds size limit.
24/01/2015 08:46:57 | GPUGRID | File size: 186797160.000000 bytes. Limit: 128000000.000000 bytes


31 Hours of wasted time, which I could have used for 1 abandonned and 1 suspended NOELIA, just because I answered to the notice in my BOINC-manager which asked for help to finish those VERYLONG wu's ASAP. :-(

Viktor Svantner
Send message
Joined: 13 Feb 11
Posts: 25
Credit: 7,499,927,523
RAC: 7,484
Level
Tyr
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 39670 - Posted: 24 Jan 2015 | 5:35:30 UTC

Hello.

I got the same unhappy result.

http://www.gpugrid.net/result.php?resultid=13737081

It took my Titan black roughly 25 hours to complete the task.

Next time, please be more careful what you prepare for cranching. I am very keen to help, but this is a waste of time that could had been used for other tasks.

Viktor Svantner
Send message
Joined: 13 Feb 11
Posts: 25
Credit: 7,499,927,523
RAC: 7,484
Level
Tyr
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 39671 - Posted: 24 Jan 2015 | 5:35:34 UTC

Hello.

I got the same unhappy result.

http://www.gpugrid.net/result.php?resultid=13737081

It took my Titan black roughly 25 hours to complete the task.

Next time, please be more careful what you prepare for cranching. I am very keen to help, but this is a waste of time that could had been used for other tasks.
____________

RaymondFO*
Send message
Joined: 22 Nov 12
Posts: 72
Credit: 14,040,706,346
RAC: 0
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 39672 - Posted: 24 Jan 2015 | 5:52:46 UTC - in response to Message 39639.
Last modified: 24 Jan 2015 | 5:54:20 UTC

4) Find the file 'client_state.xml' in your BOINC Data folder. Under Windows, this is likely - if you accepted the default installation setting - to be C:\Programdata\BOINC: under Linux, it might be /var/lib/boinc


For linux, the file is under "/var/lib/boinc-client". You should be able to edit the file in terminal using "sudo nano".

Profile 2DJFcFTcRK5gHhxpndmdMJorY...
Avatar
Send message
Joined: 15 Nov 12
Posts: 10
Credit: 792,812,843
RAC: 0
Level
Glu
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwat
Message 39677 - Posted: 24 Jan 2015 | 14:53:35 UTC

Hi,

I got an upload error for one of long running tasks. Any Boinc infrastructure limit reached?


Task:
10583226 3x24-GERARD_VERYLONG_CXCL12_confAna-0-1-RND1384_0

Error Msg:
12:58:29 (8716): called boinc_finish

</stderr_txt>
<message>
upload failure: <file_xfer_error>
<file_name>3x24-GERARD_VERYLONG_CXCL12_confAna-0-1-RND1384_0_9</file_name>
<error_code>-131 (file size too big)</error_code>
</file_xfer_error>

</message>

Georg

Richard Haselgrove
Send message
Joined: 11 Jul 09
Posts: 1576
Credit: 5,518,086,851
RAC: 8,606,062
Level
Tyr
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 39678 - Posted: 24 Jan 2015 | 15:43:47 UTC - in response to Message 39677.

Hi,

I got an upload error for one of long running tasks. Any Boinc infrastructure limit reached?

No, not an actual BOINC intrinsic limit, just a configuration oversight in this particular model run. Read on further down this thread.

TJ
Send message
Joined: 26 Jun 09
Posts: 815
Credit: 1,470,385,294
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 39679 - Posted: 24 Jan 2015 | 16:36:33 UTC
Last modified: 24 Jan 2015 | 16:38:36 UTC

Errors can happen, so no hard feeling to the research project.
Unfortunately I was away yesterday when Zoltan mentioned the issue and together with Richard, a solution was given.

Mine finished in error before I could intervene, thus about 24hours wasted on my two fastest cards.

Edit: Repair the WU's Gerard, issue them again and I will let them crunch again on my rigs.
____________
Greetings from TJ

Dr. A
Send message
Joined: 2 Oct 10
Posts: 1
Credit: 87,885,612
RAC: 0
Level
Thr
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 39682 - Posted: 24 Jan 2015 | 17:33:22 UTC

mine also finished with result upload error:

upload failure: <file_xfer_error>
<file_name>3x1-GERARD_VERYLONG_CXCL12_confAna-0-1-RND5972_0_9</file_name>
<error_code>-131 (file size too big)</error_code>
</file_xfer_error>

sarge104
Send message
Joined: 21 Nov 13
Posts: 5
Credit: 7,420,264
RAC: 0
Level
Ser
Scientific publications
watwatwatwatwatwatwat
Message 39683 - Posted: 24 Jan 2015 | 18:38:41 UTC

For those that can't be bothered to read through the topic...Your fix for the "old" WU's.

http://www.gpugrid.net/forum_thread.php?id=3988&nowrap=true#39636

Crunch on! Thanks again to the awesome people that gave their time and energy for solving this issue in a timely manner! :D

Richard Haselgrove
Send message
Joined: 11 Jul 09
Posts: 1576
Credit: 5,518,086,851
RAC: 8,606,062
Level
Tyr
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 39684 - Posted: 24 Jan 2015 | 18:47:33 UTC - in response to Message 39683.

http://www.gpugrid.net/forum_thread.php?id=3988&nowrap=true#39636

Jim1348
Send message
Joined: 28 Jul 12
Posts: 819
Credit: 1,591,285,971
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 39685 - Posted: 24 Jan 2015 | 19:02:41 UTC - in response to Message 39636.
Last modified: 24 Jan 2015 | 19:03:16 UTC

OK, so here are the KISS ('Keep it simple, stupid') instructions.
1) Check if you are running a GPUGrid task with 'GERARD_VERYLONG_CXCL12_confAna' in the task name - or if you have one waiting to start. If you can't see one anywhere, relax and do nothing (except maybe open a beer).

That worked for a while, but I picked up one this morning on my GTX 660 Ti, so apparently it is not foolproof.

It is now 10 hours into a 40-hour run. However, the DB fix that Toni mentioned was included in the files that were downloaded, so I have:
<max_nbytes>512000000.000000</max_nbytes>

We will see if it all works. Thanks for the tips though.

Richard Haselgrove
Send message
Joined: 11 Jul 09
Posts: 1576
Credit: 5,518,086,851
RAC: 8,606,062
Level
Tyr
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 39686 - Posted: 24 Jan 2015 | 19:08:23 UTC - in response to Message 39685.

Provided the _1_9 file has the increased <max_nbytes> 512,000,000 you should be fine (_1 in this case, because it's a resent task).

Jim1348
Send message
Joined: 28 Jul 12
Posts: 819
Credit: 1,591,285,971
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 39687 - Posted: 24 Jan 2015 | 19:20:33 UTC - in response to Message 39686.

Yes, it is the _1_9 file.
<name>7x27-GERARD_VERYLONG_CXCL12_confAna-0-1-RND7320_1_9</name>

I don't know if a GTX 660 Ti is really worth doing it on, but since it started and a lot of others have errored out, I figured it would be the fastest way to get a result back.

Profile Beyond
Avatar
Send message
Joined: 23 Nov 08
Posts: 1112
Credit: 6,162,416,256
RAC: 0
Level
Tyr
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 39691 - Posted: 25 Jan 2015 | 1:05:56 UTC - in response to Message 39687.
Last modified: 25 Jan 2015 | 1:14:14 UTC

I don't know if a GTX 660 Ti is really worth doing it on,

I would think not a great proposition. I unfortunately have 4 of these running on my 750Ti cards. Didn't try to get them. The first one finished in 48.5 hours so 400k credits, no bonuses made it far less than simply running the normal WUs. That combined with the chance of erroring out, I hope we've seen the last of these. Anyway, I'd reserve these guys for only the very fastest GPUs and limit them to those.

Jim1348
Send message
Joined: 28 Jul 12
Posts: 819
Credit: 1,591,285,971
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 39693 - Posted: 25 Jan 2015 | 2:23:28 UTC - in response to Message 39691.

Quite true, but we don't have much flexibility here. Once they start, we may not catch them for a few hours as we both know. Then we have to decide whether to keep going or not. With the lack of any guidance, it is anyone's guess. I am sure they will think of a better way to do it next time.

Profile Beyond
Avatar
Send message
Joined: 23 Nov 08
Posts: 1112
Credit: 6,162,416,256
RAC: 0
Level
Tyr
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 39694 - Posted: 25 Jan 2015 | 3:04:35 UTC - in response to Message 39693.

Just got another one, but this time I could shunt it off to my one 670 card (thanks flashhawk!). We'll see how that goes.

Profile Retvari Zoltan
Avatar
Send message
Joined: 20 Jan 09
Posts: 2343
Credit: 16,201,255,749
RAC: 16,606
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 39695 - Posted: 25 Jan 2015 | 8:11:45 UTC

I've received another one of these, and I confirm that the upload size has been doubled, so this problem is fixed by now.

<file_info> <name>8x3-GERARD_VERYLONG_CXCL12_confAna-0-1-RND4073_2_8</name> <nbytes>0.000000</nbytes> <max_nbytes>256000000.000000</max_nbytes> <generated_locally/> <status>0</status> <upload_when_present/> <url>http://www.gpugrid.org/PS3GRID_cgi/file_upload_handler</url> </file_info> <file_info> <name>8x3-GERARD_VERYLONG_CXCL12_confAna-0-1-RND4073_2_9</name> <nbytes>0.000000</nbytes> <max_nbytes>512000000.000000</max_nbytes> <generated_locally/> <status>0</status> <upload_when_present/> <url>http://www.gpugrid.org/PS3GRID_cgi/file_upload_handler</url> </file_info>

However it is not clear for me, why is the size of the _9 file bigger than the size of the _8 file, while the in the reality it's the opposite.

Strebsi
Send message
Joined: 10 Jan 13
Posts: 5
Credit: 52,061,897
RAC: 0
Level
Thr
Scientific publications
watwatwatwatwatwatwatwatwat
Message 39703 - Posted: 25 Jan 2015 | 19:31:52 UTC

I got 2 of the very long wus and computed 60 hours total. After reaching 99 % each wu aborted with an computing error.
---> 0 (Zero) Points for 60 hours computing
Thank you...

Dayle Diamond
Send message
Joined: 5 Dec 12
Posts: 84
Credit: 1,628,088,415
RAC: 1,457,783
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 39707 - Posted: 25 Jan 2015 | 21:49:54 UTC - in response to Message 39703.

I got 2 of the very long wus and computed 60 hours total. After reaching 99 % each wu aborted with an computing error.
---> 0 (Zero) Points for 60 hours computing
Thank you...


That is unfortunate. Please post the error message.

Jacob Klein
Send message
Joined: 11 Oct 08
Posts: 1127
Credit: 1,901,927,545
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 39711 - Posted: 25 Jan 2015 | 22:36:15 UTC - in response to Message 39707.

I got 2 of the very long wus and computed 60 hours total. After reaching 99 % each wu aborted with an computing error.
---> 0 (Zero) Points for 60 hours computing
Thank you...


That is unfortunate. Please post the error message.


He doesn't have to. You can click on his name, then click View Computers, then see the 2 failed tasks. They failed because of the "upload file size too big" error that was outlined in this thread.

Tough break, it happens. Try not to think of it in terms of "lost credits". Think of it instead in terms of "I tried to help humanity, but it didn't work out. I hope it does next time."

Regards,
Jacob Klein

Profile Mumak
Avatar
Send message
Joined: 7 Dec 12
Posts: 92
Credit: 225,897,225
RAC: 0
Level
Leu
Scientific publications
watwatwatwatwatwatwatwatwatwatwat
Message 39716 - Posted: 25 Jan 2015 | 23:13:49 UTC

I have just noticed that my 750 Ti got one of these.
It's at 92% now and total time seems to be 44 hours. Not bad for such a small and low power GPU...

Richard Haselgrove
Send message
Joined: 11 Jul 09
Posts: 1576
Credit: 5,518,086,851
RAC: 8,606,062
Level
Tyr
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 39717 - Posted: 25 Jan 2015 | 23:36:15 UTC - in response to Message 39716.

I have just noticed that my 750 Ti got one of these.
It's at 92% now and total time seems to be 44 hours. Not bad for such a small and low power GPU...

Check that it doesn't suffer from the "upload file size too big" problem discussed in this thread (it shouldn't - you got a resend issued well after remedial action was taken).

Otherwise all your hard work won't help with the urgent research project.

Profile Mumak
Avatar
Send message
Joined: 7 Dec 12
Posts: 92
Credit: 225,897,225
RAC: 0
Level
Leu
Scientific publications
watwatwatwatwatwatwatwatwatwatwat
Message 39719 - Posted: 26 Jan 2015 | 7:03:44 UTC - in response to Message 39717.

I have just noticed that my 750 Ti got one of these.
It's at 92% now and total time seems to be 44 hours. Not bad for such a small and low power GPU...

Check that it doesn't suffer from the "upload file size too big" problem discussed in this thread (it shouldn't - you got a resend issued well after remedial action was taken).

Otherwise all your hard work won't help with the urgent research project.


Yup, I checked. It was a resend, so the max_nbytes was properly adjusted.

Richard Haselgrove
Send message
Joined: 11 Jul 09
Posts: 1576
Credit: 5,518,086,851
RAC: 8,606,062
Level
Tyr
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 39721 - Posted: 26 Jan 2015 | 11:06:00 UTC

I'd be fascinated to hear how well we did with the original 400-WU research challenge over the weekend, after that slightly shaky start.

Gerard
Send message
Joined: 26 Mar 14
Posts: 101
Credit: 0
RAC: 0
Level

Scientific publications
wat
Message 39722 - Posted: 26 Jan 2015 | 15:18:02 UTC

Before anything, we would like to apologize for the big mess of this last weekend. It obviously was a peculiar situation and as such, our settings proved not to be correctly prepared for it. I hope we have learned the lesson.

About the results, I just correctly fetched around 50% of the WU I was expecting (~200) and they are ready to be analysed.

If someone is interested in this particular experiment: we are trying to assess the conformational behavior of a chemokine named CXCL12, related to a vast number of diseases including different types of cancer, neurodegenerative diseases and autoinmune disorders. By analysing the conformational changes occurring in a scale of 200ns (the length of the simulations you just crunched) we are aiming to unravel new surfaces and druggable protein pockets that we can use to design drugs against. Molecular simulations are, perhaps at exception of some NMR techniques, the only way of atomically determining the protein conformational space at such small scales and thus provide a unique insight into the natural dynamics of proteins.

Thanks again for your big crunching contribution and such fantastic forum support.

Jozef J
Send message
Joined: 7 Jun 12
Posts: 112
Credit: 1,118,845,172
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwat
Message 39723 - Posted: 26 Jan 2015 | 15:48:53 UTC
Last modified: 26 Jan 2015 | 15:50:08 UTC

pretty, early in the morning, first and successful :-)
13744739 10583385 25 Jan 2015 | 2:12:05 UTC 26 Jan 2015 | 5:10:58 UTC Completed and validated 75,488.86 28,692.76 500,000.00.-

http://www.gpugrid.net/result.php?resultid=13744739

Viktor Svantner
Send message
Joined: 13 Feb 11
Posts: 25
Credit: 7,499,927,523
RAC: 7,484
Level
Tyr
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 39724 - Posted: 26 Jan 2015 | 16:04:56 UTC

Hello.

I got one too and is already fixed, so my gtx 970 can crunch all night long.

Thanks guys for quick fix-up.
____________

VT
Send message
Joined: 28 Nov 13
Posts: 17
Credit: 153,786,987
RAC: 0
Level
Ile
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 39740 - Posted: 27 Jan 2015 | 6:54:47 UTC - in response to Message 39724.

confirmed new very longs are fixed.

seems like these ones have low atom count - only 95% power use at 1.5Ghz 970X

mymbtheduke
Send message
Joined: 3 Sep 12
Posts: 40
Credit: 186,780,650
RAC: 0
Level
Ile
Scientific publications
watwatwatwatwatwatwatwatwatwatwat
Message 39767 - Posted: 27 Jan 2015 | 18:25:53 UTC - in response to Message 39740.

So I get home from a work trip and see a very long running on my GTX 750. The task is at 49 hours and 77%. Despite missing the bonus. We should all get the bonus if we are putting in the time. I want it to finish. Looks like 15 hours to go. I notice the 750 was stuck at 400Mhz so I rebooted twice and it went back up to 1250Mhz. Not sure why or how long it ran at 400 but it is fixed now.

My question is: do I need to edit the text files for this to upload or will it go on its own now. I don't want all of this time to go to waste.

Thanks for any help.

Profile Beyond
Avatar
Send message
Joined: 23 Nov 08
Posts: 1112
Credit: 6,162,416,256
RAC: 0
Level
Tyr
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 39771 - Posted: 27 Jan 2015 | 19:19:31 UTC - in response to Message 39767.
Last modified: 27 Jan 2015 | 19:20:35 UTC

My question is: do I need to edit the text files for this to upload or will it go on its own now. I don't want all of this time to go to waste.

I'd for sure check it according to the instructions above. I fixed 3 of them and they all finished fine. Without the fix they would have failed. Two more were already fixed, so did 5 in all. Don't care to see any more of these mammoths: aborted the last 3 received so they could run on faster cards. Your 750 time sounds about right as my 750Ti cards completed 4 of them, finishing in 48.5-50.5 hours. My lone 670 ran faster, about 38.5 hours.

mymbtheduke
Send message
Joined: 3 Sep 12
Posts: 40
Credit: 186,780,650
RAC: 0
Level
Ile
Scientific publications
watwatwatwatwatwatwatwatwatwatwat
Message 39772 - Posted: 27 Jan 2015 | 19:58:54 UTC - in response to Message 39771.

I don't know about you but I am very impressed with the 750. I can overclock the proc by 50 and the memory by 1 Ghz with no problem. Best card for $80 ever.

mymbtheduke
Send message
Joined: 3 Sep 12
Posts: 40
Credit: 186,780,650
RAC: 0
Level
Ile
Scientific publications
watwatwatwatwatwatwatwatwatwatwat
Message 39773 - Posted: 27 Jan 2015 | 20:08:10 UTC - in response to Message 39772.

This is what I found in my config file:

<file>
<name>10x40-GERARD_VERYLONG_CXCL12_confAna-0-1-RND4988_1_9</name>
<nbytes>0.000000</nbytes>
<max_nbytes>512000000.000000</max_nbytes>
<status>0</status>
<upload_url>http://www.gpugrid.org/PS3GRID_cgi/file_upload_handler</upload_url>
</file>


Does this look right? It has 512 instead of 128.

Richard Haselgrove
Send message
Joined: 11 Jul 09
Posts: 1576
Credit: 5,518,086,851
RAC: 8,606,062
Level
Tyr
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 39779 - Posted: 27 Jan 2015 | 22:26:36 UTC - in response to Message 39773.

Does this look right? It has 512 instead of 128.

Yes, that one should run through to the end and report normally, everything else being equal. No action needed on your part.

[CSF] Thomas H.V. DUPONT
Send message
Joined: 20 Jul 14
Posts: 732
Credit: 100,630,366
RAC: 0
Level
Cys
Scientific publications
watwatwatwatwatwatwatwat
Message 39789 - Posted: 28 Jan 2015 | 8:15:58 UTC - in response to Message 39722.

Before anything, we would like to apologize for the big mess of this last weekend. It obviously was a peculiar situation and as such, our settings proved not to be correctly prepared for it. I hope we have learned the lesson.

About the results, I just correctly fetched around 50% of the WU I was expecting (~200) and they are ready to be analysed.

If someone is interested in this particular experiment: we are trying to assess the conformational behavior of a chemokine named CXCL12, related to a vast number of diseases including different types of cancer, neurodegenerative diseases and autoinmune disorders. By analysing the conformational changes occurring in a scale of 200ns (the length of the simulations you just crunched) we are aiming to unravel new surfaces and druggable protein pockets that we can use to design drugs against. Molecular simulations are, perhaps at exception of some NMR techniques, the only way of atomically determining the protein conformational space at such small scales and thus provide a unique insight into the natural dynamics of proteins.

Thanks again for your big crunching contribution and such fantastic forum support.

Thanks Gerard for the heads-up and these details about this research :)
Really appreciated.
____________
[CSF] Thomas H.V. Dupont
Founder of the team CRUNCHERS SANS FRONTIERES 2.0
www.crunchersansfrontieres

Profile caffeineyellow5
Avatar
Send message
Joined: 30 Jul 14
Posts: 225
Credit: 2,658,976,345
RAC: 0
Level
Phe
Scientific publications
watwatwatwatwatwatwatwat
Message 39792 - Posted: 28 Jan 2015 | 9:23:24 UTC - in response to Message 39789.

It looks like I just got a brand new one 3 hours ago and I am crunching it now with the highest prio. 10.901% at 3:00:00 on my device 2 of 0,1,2.
____________
1 Corinthians 9:16 "For though I preach the gospel, I have nothing to glory of: for necessity is laid upon me; yea, woe is unto me, if I preach not the gospel!"
Ephesians 6:18-20, please ;-)
http://tbc-pa.org

TomaszPawel
Send message
Joined: 18 Aug 08
Posts: 121
Credit: 59,836,411
RAC: 0
Level
Thr
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 39797 - Posted: 28 Jan 2015 | 13:28:23 UTC
Last modified: 28 Jan 2015 | 13:28:35 UTC

GTX570 - 53 h 44m 07s ufff done :)

WU
____________
POLISH NATIONAL TEAM - Join! Crunch! Win!

mymbtheduke
Send message
Joined: 3 Sep 12
Posts: 40
Credit: 186,780,650
RAC: 0
Level
Ile
Scientific publications
watwatwatwatwatwatwatwatwatwatwat
Message 39798 - Posted: 28 Jan 2015 | 13:51:40 UTC - in response to Message 39797.

ERRRRRRR. 217,000 secs of compute for nothing. It says error while computing. That was a waste. Yes, I am frustrated. I know I shouldn't be but 60 hours of compute down the drain. Don't know why. Any ideas?

http://www.gpugrid.net/result.php?resultid=13744228

http://www.gpugrid.net/result.php?resultid=13744228[url][/url]

Profile Retvari Zoltan
Avatar
Send message
Joined: 20 Jan 09
Posts: 2343
Credit: 16,201,255,749
RAC: 16,606
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 39802 - Posted: 28 Jan 2015 | 17:35:58 UTC - in response to Message 39798.
Last modified: 28 Jan 2015 | 17:38:18 UTC

ERRRRRRR. 217,000 secs of compute for nothing. It says error while computing. That was a waste. Yes, I am frustrated. I know I shouldn't be but 60 hours of compute down the drain. Don't know why. Any ideas?
http://www.gpugrid.net/result.php?resultid=13744228

This task's log contains a lot of error messages:

# The simulation has become unstable. Terminating to avoid lock-up (1) # Attempting restart (step 36378000) ... SWAN : FATAL : Cuda driver error 715 in file 'swanlibnv2.cpp' in line 1965. # SWAN swan_assert 0 ... # The simulation has become unstable. Terminating to avoid lock-up (1) # Attempting restart (step 37079000) ... # The simulation has become unstable. Terminating to avoid lock-up (1) # Attempting restart (step 37133000) ... # The simulation has become unstable. Terminating to avoid lock-up (1) # Attempting restart (step 37963000) ... # The simulation has become unstable. Terminating to avoid lock-up (1) # Attempting restart (step 41979000) ... # The simulation has become unstable. Terminating to avoid lock-up (1) # Attempting restart (step 42124000) ... # The simulation has become unstable. Terminating to avoid lock-up (1) # Attempting restart (step 42709000) ... # The simulation has become unstable. Terminating to avoid lock-up (1) # Attempting restart (step 42718000) ... # The simulation has become unstable. Terminating to avoid lock-up (1) # Attempting restart (step 43071000) ... # The simulation has become unstable. Terminating to avoid lock-up (1) # Attempting restart (step 44636000) ... # The simulation has become unstable. Terminating to avoid lock-up (1) # Attempting restart (step 44846000) ... # The simulation has become unstable. Terminating to avoid lock-up (1) # Attempting restart (step 45046000) ... # The simulation has become unstable. Terminating to avoid lock-up (1) # Attempting restart (step 48215000) ... # The simulation has become unstable. Terminating to avoid lock-up (1)

Probably it's overclocked, and it can't take that much of that.
Its temperature doesn't go over 73°C, so it's not overheating.
You should either decrease the GPU clock by 10-20-50MHz, or increase the GPU voltage by 25mV (increase its power target).
But I don't recommend the latter.

mymbtheduke
Send message
Joined: 3 Sep 12
Posts: 40
Credit: 186,780,650
RAC: 0
Level
Ile
Scientific publications
watwatwatwatwatwatwatwatwatwatwat
Message 39804 - Posted: 28 Jan 2015 | 18:10:12 UTC - in response to Message 39802.

Good to know. Thanks for looking into it. Guess the overclock is too much.

Jacob Klein
Send message
Joined: 11 Oct 08
Posts: 1127
Credit: 1,901,927,545
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 39805 - Posted: 28 Jan 2015 | 18:13:53 UTC - in response to Message 39804.

A good test for overclocking stability, is to see if Heaven 4.0 can run for 5 solid hours without any freezing and without any TDRs logged in C:\Windows\LiveKernelReports\WATCHDOG.

Keep downclocking in 13 Mhz intervals, until you are sure it is rock solid in that test case :)

Profile caffeineyellow5
Avatar
Send message
Joined: 30 Jul 14
Posts: 225
Credit: 2,658,976,345
RAC: 0
Level
Phe
Scientific publications
watwatwatwatwatwatwatwat
Message 39828 - Posted: 29 Jan 2015 | 10:40:19 UTC - in response to Message 39792.

Re- issued task due to time-out past deadline of original release. 26.60 hours, finished with no errors.

Profile Retvari Zoltan
Avatar
Send message
Joined: 20 Jan 09
Posts: 2343
Credit: 16,201,255,749
RAC: 16,606
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 39829 - Posted: 29 Jan 2015 | 10:59:42 UTC
Last modified: 29 Jan 2015 | 11:01:16 UTC

There's another longer than usual batch called GERARD_CXCL12_confAnaFX_ada2, it took 9 hours for my GTX780Ti to complete.
It's granted 251,400 credits (including the 50% bonus).

Profile Tiger
Send message
Joined: 30 Jan 15
Posts: 7
Credit: 402,017,837
RAC: 0
Level
Gln
Scientific publications
watwatwatwatwatwatwatwat
Message 39930 - Posted: 31 Jan 2015 | 0:40:33 UTC

Knocked it out in 5 hours ... EVGA GeForce 980 .. no sweat

____________

Dayle Diamond
Send message
Joined: 5 Dec 12
Posts: 84
Credit: 1,628,088,415
RAC: 1,457,783
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 39932 - Posted: 31 Jan 2015 | 9:29:51 UTC - in response to Message 39930.

Knocked it out in 5 hours ... EVGA GeForce 980 .. no sweat


This appears to be the only task you've ever done:

854-NOELIA_PNP-5-10-RND5715_0

jlhal
Send message
Joined: 1 Mar 10
Posts: 147
Credit: 1,077,535,540
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 39938 - Posted: 31 Jan 2015 | 13:52:25 UTC
Last modified: 31 Jan 2015 | 14:15:52 UTC

Hi !
Got 1 VERYLONG e few days ago : http://www.gpugrid.net/result.php?resultid=13744487


Titan Black Ghz Edition
ADAPTIVE , 1176Mhz Core Clk, 6Ghz Mem Clk, 73°C, 43% Fan speed .

GPU 79,604.53 (roughly 22h 7mn)
CPU 22,063.62
Credit 500000

PS : Gerard gives less credits than others for his WUs ! This is not very much "paid" got the spent time !

Regards
____________
Lubuntu 16.04.1 LTS x64

Profile Tiger
Send message
Joined: 30 Jan 15
Posts: 7
Credit: 402,017,837
RAC: 0
Level
Gln
Scientific publications
watwatwatwatwatwatwatwat
Message 39939 - Posted: 31 Jan 2015 | 14:30:20 UTC - in response to Message 39932.
Last modified: 31 Jan 2015 | 14:32:05 UTC

yes.. just started/joined yesterday late afternoon..
:)
____________

Dayle Diamond
Send message
Joined: 5 Dec 12
Posts: 84
Credit: 1,628,088,415
RAC: 1,457,783
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 39946 - Posted: 1 Feb 2015 | 0:04:35 UTC - in response to Message 39939.

Welcome Tiger and great to have you as a fellow volunteer. This forum is about the CXCL12_confAna tasks from a week ago.

NOELIA_PNP is a different project and has a very different runtime.

Profile Tiger
Send message
Joined: 30 Jan 15
Posts: 7
Credit: 402,017,837
RAC: 0
Level
Gln
Scientific publications
watwatwatwatwatwatwatwat
Message 39949 - Posted: 1 Feb 2015 | 4:08:53 UTC - in response to Message 39946.

Thanx Dale .. I'm off to the NOELIA thread .. happy crunching

Bedrich Hajek
Send message
Joined: 28 Mar 09
Posts: 467
Credit: 8,104,721,966
RAC: 12,865,884
Level
Tyr
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 39951 - Posted: 1 Feb 2015 | 5:50:48 UTC

It seems to me that the WUs are getting longer at faster pace than the video cards are increasing in speed. I have one suggestion for this, create the ability to run these very long WUs on more than one video card at a time. This would seem to me to be a more prudent use resources than the multi-CPU apps.



Profile bcavnaugh
Send message
Joined: 8 Nov 13
Posts: 56
Credit: 1,002,640,163
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwat
Message 39999 - Posted: 2 Feb 2015 | 2:39:21 UTC
Last modified: 2 Feb 2015 | 2:46:08 UTC

Is their a way to select to run GERARD_VERYLONG_CXCL12_confAna Tasks?
I would like to run 4 at once on 4 GTX 980 HC Graphics Cards.

Profile Retvari Zoltan
Avatar
Send message
Joined: 20 Jan 09
Posts: 2343
Credit: 16,201,255,749
RAC: 16,606
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 40011 - Posted: 2 Feb 2015 | 22:53:49 UTC - in response to Message 39999.

Is their a way to select to run GERARD_VERYLONG_CXCL12_confAna Tasks?

No.

Profile bcavnaugh
Send message
Joined: 8 Nov 13
Posts: 56
Credit: 1,002,640,163
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwat
Message 40013 - Posted: 3 Feb 2015 | 1:43:38 UTC - in response to Message 40011.

Is their a way to select to run GERARD_VERYLONG_CXCL12_confAna Tasks?

No.


Thanks for the info.

Profile dskagcommunity
Avatar
Send message
Joined: 28 Apr 11
Posts: 456
Credit: 817,865,789
RAC: 0
Level
Glu
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 40024 - Posted: 3 Feb 2015 | 22:40:50 UTC

Oh no had two i saw now. Disk limit exceed after 120k und. 135k computing ^^ outsch seems i must adjust something for the future because i use only 40gb disks on some boinc machines O.o
____________
DSKAG Austria Research Team: http://www.research.dskag.at



Co
Send message
Joined: 31 Jan 15
Posts: 2
Credit: 10,877,525
RAC: 0
Level
Pro
Scientific publications
watwat
Message 40027 - Posted: 4 Feb 2015 | 9:25:04 UTC
Last modified: 4 Feb 2015 | 9:28:30 UTC

Everything runs smoothly.
Temp of my CPU (i7 4770k) = 55C-59C
Temp of my GPU (MSI GTX 770 twin frozr) = 61C-70C
Both without overclocking.

I increased the Fan speed of the GPU to 70% and the temp is stable at 62C.
Power of GPU = 60%
GPU use = 88%
All while 3 monitors are connected.
27" at 1920x1080 on DVI port
24" at 1920x1200 on VGA port
22" at 1680x1050 on HDMI port (using a HAMA HDMI-VGA interface)

The system has been running non-stop between jan 31, 14:55 and feb 3, 16:54 and completed 7 long run tasks.

Running 2 projects. GPUGRID and rosetta@home

Jozef J
Send message
Joined: 7 Jun 12
Posts: 112
Credit: 1,118,845,172
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwat
Message 40064 - Posted: 6 Feb 2015 | 11:58:55 UTC - in response to Message 39723.
Last modified: 6 Feb 2015 | 12:02:07 UTC

pretty, early in the morning, first and successful :-)
13744739 10583385 25 Jan 2015 | 2:12:05 UTC 26 Jan 2015 | 5:10:58 UTC Completed and validated 75,488.86 28,692.76 500,000.00.-

http://www.gpugrid.net/result.php?resultid=13744739


Why was this task canceled . That's what mean ?! A couple of weeks has been recognized as correctly calculated and is now cleared.
and on the performance board is also deleted.

Jurgen
Send message
Joined: 7 Nov 14
Posts: 4
Credit: 6,518,889
RAC: 0
Level
Ser
Scientific publications
watwatwat
Message 40072 - Posted: 6 Feb 2015 | 16:26:08 UTC - in response to Message 39575.

What do you call good graphic cards? Mines gtx 650 Nvidia
____________

Dayle Diamond
Send message
Joined: 5 Dec 12
Posts: 84
Credit: 1,628,088,415
RAC: 1,457,783
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 40073 - Posted: 6 Feb 2015 | 17:13:13 UTC

The number in the hundred's slot represents the generation, and the number in the ten's slot is the relative strength compared to other cards within that generation. Here's a handy chart of their relative usefulness for you, cira Febrary 2nd, 2015:

http://www.tomshardware.com/reviews/gaming-graphics-card-review,3107-7.html

It doesn't take heat or power consumption into account, but for GPUgrid, you probably should. Roughly, the newer generations are better.

Jozef J
Send message
Joined: 7 Jun 12
Posts: 112
Credit: 1,118,845,172
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwat
Message 40090 - Posted: 7 Feb 2015 | 19:55:17 UTC - in response to Message 40064.

pretty, early in the morning, first and successful :-)
13744739 10583385 25 Jan 2015 | 2:12:05 UTC 26 Jan 2015 | 5:10:58 UTC Completed and validated 75,488.86 28,692.76 500,000.00.-

http://www.gpugrid.net/result.php?resultid=13744739


Why was this task canceled . That's what mean ?! A couple of weeks has been recognized as correctly calculated and is now cleared.
and on the performance board is also deleted.

I ask one more time to explain the disappearance of this task and remove it from the board performance.
You are able for the intelligent conversation?

Richard Haselgrove
Send message
Joined: 11 Jul 09
Posts: 1576
Credit: 5,518,086,851
RAC: 8,606,062
Level
Tyr
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 40091 - Posted: 7 Feb 2015 | 20:12:37 UTC - in response to Message 40090.

pretty, early in the morning, first and successful :-)
13744739 10583385 25 Jan 2015 | 2:12:05 UTC 26 Jan 2015 | 5:10:58 UTC Completed and validated 75,488.86 28,692.76 500,000.00.-

http://www.gpugrid.net/result.php?resultid=13744739


Why was this task canceled . That's what mean ?! A couple of weeks has been recognized as correctly calculated and is now cleared.
and on the performance board is also deleted.

I ask one more time to explain the disappearance of this task and remove it from the board performance.
You are able for the intelligent conversation?

All tasks, at all BOINC projects, are purged from the online database after a pre-determined period of time - otherwise the database would grow to an unmanageable size, and performance would slow to a crawl.

The oldest valid task in my own current list is dated 28 January: I think we can take it that the 'purge interval' at this project is 10 days, although some projects use 24 hours or even less.

Your 26 January task is simply too old to be retained in the online transactional database: that does not mean that it has been 'cancelled' - the science will have been moved to another place, and your credits will be retained in your totals.

biodoc
Send message
Joined: 26 Aug 08
Posts: 183
Credit: 6,466,114,375
RAC: 1,393,151
Level
Tyr
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 40092 - Posted: 7 Feb 2015 | 20:14:02 UTC
Last modified: 7 Feb 2015 | 20:17:23 UTC

Yeah, I completed 2 of these verylong WUs and the one that was completed in the shortest time is no longer on the performance board either.

I can't find them in my completed WUs list either. I guess those WUs are cleared after ~2 weeks?

EDIT: Thanks for the info on this Richard. I think we posted at the same time.

I guess these are also purged from the performance database too?

Jozef J
Send message
Joined: 7 Jun 12
Posts: 112
Credit: 1,118,845,172
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwat
Message 40093 - Posted: 7 Feb 2015 | 20:52:43 UTC
Last modified: 7 Feb 2015 | 20:58:33 UTC

Do not do this by himself uncomprehendingly richard.....
the whole world sees that I was f*** out. from performance tab

Pharmaceutical companies are not as scientists. are mainly businessmen.
When I start here so longtime volunteer warned me before im start of supporting gpu grid. Now I see you were right.
I do not need rac performance. on it are different here as volunteers protectionist complain of BU.

Jacob Klein
Send message
Joined: 11 Oct 08
Posts: 1127
Credit: 1,901,927,545
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 40095 - Posted: 7 Feb 2015 | 21:47:31 UTC

I do not understand you, Jozef.
Richard's explanation was correct, so far as I know.

Michael Wintory
Send message
Joined: 2 Nov 14
Posts: 1
Credit: 103,700,063
RAC: 0
Level
Cys
Scientific publications
watwatwat
Message 40151 - Posted: 12 Feb 2015 | 1:41:15 UTC - in response to Message 39575.

I'm running Intel Core i7-4790 CPU @4.00GHz with NVIDIA GeForce GTX980. Would something like this be able to help you guy out?
____________

Profile Retvari Zoltan
Avatar
Send message
Joined: 20 Jan 09
Posts: 2343
Credit: 16,201,255,749
RAC: 16,606
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 40156 - Posted: 12 Feb 2015 | 22:36:19 UTC - in response to Message 40151.

I'm running Intel Core i7-4790 CPU @4.00GHz with NVIDIA GeForce GTX980. Would something like this be able to help you guy out?

Sure it would, but this batch is finished now.

Post to thread

Message boards : News : WARNING/CHALLENGE: VERY LONG WU (VERYLONG_CXCL12_confAna)

//