Message boards : Graphics cards (GPUs) : New version tasks failing on Windows hosts
Author | Message |
---|---|
I'm getting a bunch of new work for my Windows hosts but they are all failing with Error while computing. Here is a sample stderr log. | |
ID: 57318 | Rating: 0 | rate:
![]() ![]() ![]() | |
You're getting error code 0xc0000135, which officially means 'status_dll_not_found'. | |
ID: 57319 | Rating: 0 | rate:
![]() ![]() ![]() | |
My first successful WU from this particular batch: | |
ID: 57320 | Rating: 0 | rate:
![]() ![]() ![]() | |
My first successful WU from this particular batch: Watching at Hosts Ranking page, many other windows hosts are starting to succeed their tasks, and are quickly ascending in rank. Nice to see that Windows hosts are recovered to contribute to the Project along with Linux ones! Output file 09 is very large, over 472 megs. Just to remember that if some file happened to exceed a 512 MB size, it would become stalled when uploading to server. Ian&Steve C. mentioned it and was discussed at can't upload results. file size too big being blocked? thread. I experienced this same problem in one of my hosts, Message #57194. | |
ID: 57321 | Rating: 0 | rate:
![]() ![]() ![]() | |
The file size limit is set on the server, but implemented by the client on your own machine. It's possible to inspect the current value in client_state.xml, and even change it if you catch it soon enough. | |
ID: 57322 | Rating: 0 | rate:
![]() ![]() ![]() | |
I'm getting a bunch of new work for my Windows hosts but they are all failing with Error while computing. Here is a sample stderr log. Sometimes, resetting Gpugrid Project at BOINC Manager may help, given that all app related files will be reloaded. | |
ID: 57323 | Rating: 0 | rate:
![]() ![]() ![]() | |
I'm getting a bunch of new work for my Windows hosts but they are all failing with Error while computing. Here is a sample stderr log. Perhaps worth a try, but I think it's unlikely to help with this one. Another tried-and-tested solution to some rare problems is to perform a full system restart. | |
ID: 57324 | Rating: 0 | rate:
![]() ![]() ![]() | |
Another WU is done: | |
ID: 57325 | Rating: 0 | rate:
![]() ![]() ![]() | |
I have had three tasks complete successfully. They all ran on Quadro RTX 4000 GPU's. Is it possible that only RTX series cards will work? | |
ID: 57326 | Rating: 0 | rate:
![]() ![]() ![]() | |
I have completed two tasks so far, one on a RTX 2080 and another on a GTX 1080 Ti. | |
ID: 57327 | Rating: 0 | rate:
![]() ![]() ![]() | |
Can anyone tell whether these new tasks run on Ampere cards, too? | |
ID: 57328 | Rating: 0 | rate:
![]() ![]() ![]() | |
Can anyone tell whether these new tasks run on Ampere cards, too? Yes, they run fine on Ampere cards also. | |
ID: 57330 | Rating: 0 | rate:
![]() ![]() ![]() | |
Can anyone tell whether these new tasks run on Ampere cards, too? Thanks, Keith, for the valuable information. Just to make sure: you talk about Windows, too, or just Linux? | |
ID: 57332 | Rating: 0 | rate:
![]() ![]() ![]() | |
Can anyone tell whether these new tasks run on Ampere cards, too? In both Windows and Linux systems with updated drivers, at last. | |
ID: 57333 | Rating: 0 | rate:
![]() ![]() ![]() | |
I found one task that failed and the acemd3 application had crashed. | |
ID: 57334 | Rating: 0 | rate:
![]() ![]() ![]() | |
I had this happen with Folding@Home, several weeks ago. Maybe you are talking about the same project anyway :-) | |
ID: 57339 | Rating: 0 | rate:
![]() ![]() ![]() | |
I've now got a Windows v218, flavour cuda101, running and saved for posterity. | |
ID: 57345 | Rating: 0 | rate:
![]() ![]() ![]() | |
I think the problem is due to older Windows OS versions that don't have the newer Microsoft Visual C++ 2015-2019 Redistributable. The new ACEMD app seems to require this to work. | |
ID: 57353 | Rating: 0 | rate:
![]() ![]() ![]() | |
Well, as you can see from my screenshot yesterday (which was taken on a machine running Windows 7, BTW), acemd3 has loaded vcruntime140 from the BOINC slot directory - i.e., from an image circulated by the project via its various downloads. That file is actually v14.28.29325.2, dated Fri Sep 25 2020. | |
ID: 57356 | Rating: 0 | rate:
![]() ![]() ![]() | |
Well, welcome to DLL Hell (yes, that's a thing). | |
ID: 57357 | Rating: 0 | rate:
![]() ![]() ![]() | |
Good news. The machine which failed yesterday is now running an acemd3 cuda101 task. Here is the full set of C runtime library files, and where they're being run from: | |
ID: 57359 | Rating: 0 | rate:
![]() ![]() ![]() | |
I'll drop a line to Gianni, to pass on to the development / deployment team. any reaction from their side so far ? | |
ID: 57374 | Rating: 0 | rate:
![]() ![]() ![]() | |
Sadly, none at all. I might try Toni, after our conversation in the news thread yesterday. | |
ID: 57375 | Rating: 0 | rate:
![]() ![]() ![]() | |
This is already fixed in acemdbeta and will be fixed as soon as we deploy the new app. | |
ID: 57413 | Rating: 0 | rate:
![]() ![]() ![]() | |
Sadly, I'm not completely convinced by this. | |
ID: 57415 | Rating: 0 | rate:
![]() ![]() ![]() | |
I didn't manually intervene with anything and got 3 beta tasks that ran successfully on my Win10 machine. They all finished successfully within a few minutes. Does that help? | |
ID: 57416 | Rating: 0 | rate:
![]() ![]() ![]() | |
That's reassuring - thanks. But I'll keep an eye on my Windows 7 machines when the project next ramps up. | |
ID: 57417 | Rating: 0 | rate:
![]() ![]() ![]() | |
This is already fixed in acemdbeta and will be fixed as soon as we deploy the new app. No further explanation ? How can we fix it ? What is the problem ? Not only the C++ (I have all dll and C++ manually installed) Till june all was working. Now, not more (without any change in my config. It seems the problem comes from the WU | |
ID: 57542 | Rating: 0 | rate:
![]() ![]() ![]() | |
Message boards : Graphics cards (GPUs) : New version tasks failing on Windows hosts