1) Message boards : Number crunching : have a lot of stuck tasks, abort some? (Message 53245)
Posted 1579 days ago by Profile Dingo
I have a few tasks that have not uploaded for a while and all have "Upload Pending Project Backoff" Do I just let them sit there and wait till they upload. I have tried stopping and starting BOINC but that did not fix it.

GPUGRID initial_1687-ELISA_GSN0V1-8-100-RND7000_0_0 1.054 10.65 K 00:00:20 - 15:10:42 0.00 Kbps Upload pending (Project backoff: 00:27:19) Rack-01
GPUGRID initial_1687-ELISA_GSN0V1-8-100-RND7000_0_1 0.003 3816.54 K 00:00:18 - 14:50:55 0.00 Kbps Upload pending (Project backoff: 00:27:19) Rack-01
GPUGRID initial_1687-ELISA_GSN0V1-8-100-RND7000_0_2 0.003 3816.54 K 00:00:11 - 12:41:05 0.00 Kbps Upload pending (Project backoff: 00:27:19) Rack-01
GPUGRID initial_1687-ELISA_GSN0V1-8-100-RND7000_0_9 0.000 68065.03 K 00:00:07 - 12:29:29 0.00 Kbps Upload pending (Project backoff: 00:27:19) Rack-01
GPUGRID initial_1687-ELISA_GSN0V1-8-100-RND7000_0_10 100.000 0.27 K 00:00:39 - 12:37:38 0.00 Kbps Upload pending (Project backoff: 00:27:19) Rack-01
GPUGRID initial_1440-ELISA_GSN0V1-9-100-RND9376_0_0 1.057 10.62 K 00:00:42 - 12:39:17 0.00 Kbps Upload pending (Project backoff: 00:27:19) Rack-01
GPUGRID initial_1440-ELISA_GSN0V1-9-100-RND9376_0_1 0.003 3816.54 K 00:00:22 - 12:24:37 0.00 Kbps Upload pending (Project backoff: 00:27:19) Rack-01
GPUGRID initial_1440-ELISA_GSN0V1-9-100-RND9376_0_2 0.003 3816.54 K 00:00:21 - 12:20:38 0.00 Kbps Upload pending (Project backoff: 00:27:19) Rack-01
GPUGRID initial_1440-ELISA_GSN0V1-9-100-RND9376_0_9 0.000 68067.50 K 00:00:04 - 12:08:51 0.00 Kbps Upload pending (Project backoff: 00:27:19) Rack-01
GPUGRID initial_1440-ELISA_GSN0V1-9-100-RND9376_0_10 100.000 0.27 K 00:00:03 - 06:35:08 0.00 Kbps Upload pending (Project backoff: 00:27:19) Rack-01
GPUGRID initial_1509-ELISA_GSN0V1-9-100-RND3769_0_0 1.022 10.99 K 00:00:14 - 08:49:10 0.00 Kbps Upload pending (Project backoff: 00:06:46) bundy-2
GPUGRID initial_1509-ELISA_GSN0V1-9-100-RND3769_0_1 0.003 3816.54 K 00:00:16 - 07:01:25 0.00 Kbps Upload pending (Project backoff: 00:06:46) bundy-2
GPUGRID initial_1509-ELISA_GSN0V1-9-100-RND3769_0_2 0.003 3816.54 K 00:00:11 - 07:53:31 0.00 Kbps Upload pending (Project backoff: 00:06:46) bundy-2
GPUGRID initial_1509-ELISA_GSN0V1-9-100-RND3769_0_9 0.000 68066.14 K 00:00:08 - 06:20:56 0.00 Kbps Upload pending (Project backoff: 00:06:46) bundy-2
GPUGRID initial_1509-ELISA_GSN0V1-9-100-RND3769_0_10 100.000 0.27 K 00:00:07 - 06:05:22 0.00 Kbps Upload pending (Project backoff: 00:06:46) bundy-2
GPUGRID initial_1622-ELISA_GSN4V1-14-100-RND6105_2_0 1.031 10.90 K 00:00:20 - 16:13:14 0.00 Kbps Upload pending (Project backoff: 00:35:15) bundy-3
GPUGRID initial_1622-ELISA_GSN4V1-14-100-RND6105_2_1 0.003 3817.50 K 00:00:20 - 14:14:37 0.00 Kbps Upload pending (Project backoff: 00:35:15) bundy-3
GPUGRID initial_1622-ELISA_GSN4V1-14-100-RND6105_2_2 0.003 3817.50 K 00:00:11 - 13:01:50 0.00 Kbps Upload pending (Project backoff: 00:35:15) bundy-3
GPUGRID initial_1622-ELISA_GSN4V1-14-100-RND6105_2_9 0.000 68080.19 K 00:00:10 - 12:35:48 0.00 Kbps Upload pending (Project backoff: 00:35:15) bundy-3
GPUGRID initial_1622-ELISA_GSN4V1-14-100-RND6105_2_10 100.000 0.27 K 00:00:07 - 11:41:37 0.00 Kbps Upload pending (Project backoff: 00:35:15) bundy-3
2) Message boards : Number crunching : New app update (acemd3) (Message 53074)
Posted 1586 days ago by Profile Dingo
I crunched a couple of the new tasks (New version of ACEMD v2.10 (cuda100))on my GTX1660Ti which is the new turing type and they processed and validated. Good that I can use this machine on GPUGrid now.

https://www.gpugrid.net/show_host_detail.php?hostid=517492
3) Message boards : Graphics cards (GPUs) : New Nvidia Driver error (Message 52369)
Posted 1702 days ago by Profile Dingo
OK all is fine now. Must have been a problem of the update happening while GPUGRID was running ??
4) Message boards : Graphics cards (GPUs) : New Nvidia Driver error (Message 52368)
Posted 1703 days ago by Profile Dingo
OK I will try another task and see what happens. This is th task that is running now:

https://www.gpugrid.net/workunit.php?wuid=16682627
5) Message boards : Graphics cards (GPUs) : New Nvidia Driver error (Message 52364)
Posted 1703 days ago by Profile Dingo
I did the driver update for Nvidia to 431.6 and there is an error in the driver code that stops me from running GPU Grid as all the work since then has this error. It is on my windows machine with my 1080Ti.

I can run Primegrid on the machine after the update so looks like a project code issue ???


This is the machine: https://www.gpugrid.net/results.php?hostid=453402
At the very end of processing:

Name e9s120_e3s89p1f137-PABLO_V4_UCB_p27_sj403_no_salt_IDP-0-2-RND6771_0
Workunit 16678301
Created 28 Jul 2019 | 19:10:51 UTC
Sent 28 Jul 2019 | 20:27:33 UTC
Received 29 Jul 2019 | 2:37:26 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status -55 (0xffffffffffffffc9) Unknown error number
Computer ID 453402
Report deadline 2 Aug 2019 | 20:27:33 UTC
Run time 21,941.82
CPU time 1,907.74
Validate state Invalid
Credit 0.00
Application version Long runs (8-12 hours on fastest card) v9.22 (cuda80)
<core_client_version>7.14.2</core_client_version>
<![CDATA[
<message>
(unknown error) - exit code -55 (0xffffffc9)</message>
<stderr_txt>
# GPU [GeForce GTX 1080 Ti] Platform [Windows] Rev [3212] VERSION [80]
# SWAN Device 0 :
# Name : GeForce GTX 1080 Ti
# ECC : Disabled
# Global mem : 11264MB
# Capability : 6.1
# PCI ID : 0000:0A:00.0
# Device clock : 1645MHz
# Memory clock : 5505MHz
# Memory width : 352bit
# Driver version : r431_31 : 43136
# GPU 0 : 71C
# GPU [GeForce GTX 1080 Ti] Platform [Windows] Rev [3212] VERSION [80]
# SWAN Device 0 :
# Name : GeForce GTX 1080 Ti
# ECC : Disabled
# Global mem : 11264MB
# Capability : 6.1
# PCI ID : 0000:0A:00.0
# Device clock : 1645MHz
# Memory clock : 5505MHz
# Memory width : 352bit
# Driver version : r431_31 : 43136
# GPU 0 : 68C
# GPU 0 : 69C
# GPU 0 : 70C
SWAN : FATAL : Cuda driver error 999 in file 'swanlibnv2.cpp' in line 1965.
# SWAN swan_assert 0
6) Message boards : Graphics cards (GPUs) : 1660 Ti (Message 51704)
Posted 1806 days ago by Profile Dingo
I guess this is the error because GTX 1660 Ti can't run this project. If it is when can an update be expected. It can be run on a number of projects like SETI, Primegrid, Amicable Numbers and Moo. As these new cards are becoming more popular can an update be seen soon:

Stderr output
<core_client_version>7.14.2</core_client_version>
<![CDATA[
<message>
(unknown error) - exit code -59 (0xffffffc5)</message>
<stderr_txt>
# GPU [GeForce GTX 1660 Ti] Platform [Windows] Rev [3212] VERSION [80]
# SWAN Device 0 :
# Name : GeForce GTX 1660 Ti
# ECC : Disabled
# Global mem : 6144MB
# Capability : 7.5
# PCI ID : 0000:01:00.0
# Device clock : 1830MHz
# Memory clock : 6001MHz
# Memory width : 192bit
# Driver version : r419_50 : 42531
#SWAN: FATAL: cannot find image for module [.nonbonded.cu.] for device version 750

</stderr_txt>
]]>
7) Message boards : News : The experimental QC for W10 is now called QC_beta (Message 49023)
Posted 2227 days ago by Profile Dingo
Is there no work ? I have the Prefences set to Yes for the Beta. Is that the correct one ??
Quantum Chemistry (CPU): no
Quantum Chemistry (CPU, beta): yes
8) Message boards : Number crunching : NOELIA WU (Message 37022)
Posted 3581 days ago by Profile Dingo
I am getting the same error today when I started the first task on my GTX 750 Ti:
lystrpx8-NOELIA_SH2eq-0-1-RND2046_1
Workunit 8238384
Created 7 Jun 2014 | 14:50:16 UTC
Sent 7 Jun 2014 | 16:19:10 UTC
Received 7 Jun 2014 | 16:26:05 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status -98 (0xffffffffffffff9e) Unknown error number
Computer ID 170120
Report deadline 12 Jun 2014 | 16:19:10 UTC
Run time 2.00
CPU time 0.14
Validate state Invalid
Credit 0.00
Application version Short runs (2-3 hours on fastest card) v8.41 (cuda60)


Stderr output
<core_client_version>7.3.19</core_client_version>
<![CDATA[
<message>
(unknown error) - exit code -98 (0xffffff9e)
</message>
<stderr_txt>
# GPU [GeForce GTX 750 Ti] Platform [Windows] Rev [3301M] VERSION [60]
# SWAN Device 0 :
# Name : GeForce GTX 750 Ti
# ECC : Disabled
# Global mem : 2048MB
# Capability : 5.0
# PCI ID : 0000:01:00.0
# Device clock : 1137MHz
# Memory clock : 2700MHz
# Memory width : 128bit
# Driver version : r337_00 : 33788
ERROR: file mdioload.cpp line 162: No CHARMM parameter file specified
12:27:22 (4164): called boinc_finish

</stderr_txt>
]]>



Looks like my buddy on this had a different error:

http://www.gpugrid.net/workunit.php?wuid=8238384

9) Message boards : News : New app on acemdbeta with Maxwell support (Message 35645)
Posted 3666 days ago by Profile Dingo
I too have a GTX 750 TI and before I saw this post tried to run GPUGRID and the tasks all failed. I set my preferences to receive Beta and also the acemdbeta task but there are no tasks available.

I just updated to Driver version 335.23

Is there going to more of these tasks???
Cheers
10) Message boards : Number crunching : WUs for AMD/ATI cards? (Message 26822)
Posted 4219 days ago by Profile Dingo
Other project collaborate with the ATI folks on improving the drivers and SDK so they will run their projects better. Has GPUGRID even approached ATI/AMD??


Next 10
//