Message boards : Number crunching : High Failure Rate of SANTI Tasks
Author | Message |
---|---|
I am creating this thread because the other SANTI thread has been closed. | |
ID: 36863 | Rating: 0 | rate: / Reply Quote | |
I keep having many SANTIs failing on my 750Ti on Linux. Some of them succeed, but most of them fail. They run completely OK on my 650Ti on the same computer. Many of them fail on other systems as well, not just on mine. Eventually, they seem to locate a system of their liking and succeed! Two recent examples (my computer being 171276): | |
ID: 37032 | Rating: 0 | rate: / Reply Quote | |
I switched my 750Ti to Einstein for about a day and a half just to make sure the card is OK. I crunched 10 WUs successfully, so I am positive my card is OK. | |
ID: 37043 | Rating: 0 | rate: / Reply Quote | |
I hope you did realize that one of the failed WUs was completed with a 750ti under windows. There is some indication that GPU memory is a factor for some of GPUGRID WUs. For the last one you linked, it will be interesting to see if that 780ti can complete it since it has more memory available than the other cards. | |
ID: 37046 | Rating: 0 | rate: / Reply Quote | |
My setup is handling heat pretty adequately, to give you an idea, with ambient temperatures at ~30C my GPUs and CPU maxes out at ~70C. I've invested much time in building my crunching rig for heat and I think it works fine. It's also pretty quiet, you can easily sit next to it. It's not easy to tolerate the heat it emits though, that top exhaust fan is like a small oven! | |
ID: 37048 | Rating: 0 | rate: / Reply Quote | |
Other systems are completing those failed WUs so it is not the WUs. (Okay, there are occasionally bad batches which usually get sorted out very quickly.) Hopefully you figure out what is the problem with your system. | |
ID: 37054 | Rating: 0 | rate: / Reply Quote | |
I upgraded the kernel today to Ubuntu's 12.04 latest - 3.13. I then got two NOELIA_TRPS1S4, the one of which that was assigned to the 750Ti failed after ~1100 sec... It then got a SANTI_p53final and it's still crunching that ~10 hours later. Maybe it will complete it (knocking on wood!) | |
ID: 37055 | Rating: 0 | rate: / Reply Quote | |
I also had two WUs from SANTI that failed with the message The simulation has become unstable. Terminating to avoid lock-up (1) This card runs other WUs just fine, so I don't think it is a hardware or driver problem. http://www.gpugrid.net/result.php?resultid=12188313 http://www.gpugrid.net/result.php?resultid=12529134 | |
ID: 37060 | Rating: 0 | rate: / Reply Quote | |
I'm running 7 EVGA & PNY factory OCed 750 Ti cards in Win7-64 and have had only 1 error: | |
ID: 37073 | Rating: 0 | rate: / Reply Quote | |
Hey Beyond, thanks for your response! Yes, I have concluded it is not the SANTIs after all, but something with my system. The card has failed with all sorts of WUs, but I have made some interesting observations. Take a look at the 750TI-650TI Combo on Linux thread, where I am continuing this discussion, as it is not a matter of SANTIs any more and I don't want to keep this thread at the head of the Number crunching section. Your input is always welcome and appreciated! | |
ID: 37074 | Rating: 0 | rate: / Reply Quote | |
Message boards : Number crunching : High Failure Rate of SANTI Tasks