1) Message boards : Graphics cards (GPUs) : GPU run failures (Message 13257)
Posted 5292 days ago by Reddogg
Hi,
the pc temperature is ok, the card is an AMP! Edition so it is factory-overclocked. But I think it is not the reason, because weeks ago under Win XP 32bit/Win 7 64bit GPUGRId runs very well, it is a lately problem.
That's the reason I put some problems in the thread here. Because it seems that are different failure's?
2) Message boards : Graphics cards (GPUs) : GPU run failures (Message 13254)
Posted 5292 days ago by Reddogg
Greetings,
in the last days the failure rates of GPUGRID using is increasing. Here are some failure reports:
<core_client_version>6.10.13</core_client_version>
<![CDATA[
<message>
Unzul�ssige Funktion. (0x1) - exit code 1 (0x1)
</message>
<stderr_txt>
# Using CUDA device 0
# There is 1 device supporting CUDA
# Device 0: "GeForce 8800 GTS 512"
# Clock rate: 1.73 GHz
# Total amount of global memory: 536870912 bytes
# Number of multiprocessors: 16
# Number of cores: 128
MDIO ERROR: cannot open file "restart.coor"
# Using CUDA device 0
# There is 1 device supporting CUDA
# Device 0: "GeForce 8800 GTS 512"
# Clock rate: 1.73 GHz
# Total amount of global memory: 536870912 bytes
# Number of multiprocessors: 16
# Number of cores: 128
MDIO ERROR: cannot open file "restart.coor"
Cuda error: Kernel [geomhash_kernel] failed in file 'gridcell.cu' in line 209 : unknown error.

</stderr_txt>
]]>

another one:
<core_client_version>6.10.13</core_client_version>
<![CDATA[
<message>
- exit code 98 (0x62)
</message>
<stderr_txt>
# Using CUDA device 0
# There is 1 device supporting CUDA
# Device 0: "GeForce 8800 GTS 512"
# Clock rate: 1.73 GHz
# Total amount of global memory: 536870912 bytes
# Number of multiprocessors: 16
# Number of cores: 128
MDIO ERROR: cannot open file "restart.coor"
ERROR: c:\cygwin\home\speechserver\gpumd2\src\pme\CPME_cufft.cu, line 11: cufftExecR2C (gridcalc1)
called boinc_finish

</stderr_txt>
]]>

and another too:
<core_client_version>6.10.13</core_client_version>
<![CDATA[
<message>
Unzul�ssige Funktion. (0x1) - exit code 1 (0x1)
</message>
<stderr_txt>
# Using CUDA device 0
# There is 1 device supporting CUDA
# Device 0: "GeForce 8800 GTS 512"
# Clock rate: 1.73 GHz
# Total amount of global memory: 536870912 bytes
# Number of multiprocessors: 16
# Number of cores: 128
MDIO ERROR: cannot open file "restart.coor"
Cuda error: Kernel [pme_fill_charges_grid_kernel] failed in file 'fillcharges.cu' in line 55 : unknown error.

</stderr_txt>
]]>

I think the most of this failures I get when I started a HD-Video, Games (it is irrelevant if I started the game when boinc is running or not).
Can anyone help me please to minimize the failure rates.
It is very horrible if u running GPUGRID for 10 hours and it fails.

Thank you for all your hints.

Regards,
Reddogg


//