Advanced search

Message boards : News : Linux app failing

Author Message
Profile GDF
Volunteer moderator
Project administrator
Project developer
Project tester
Volunteer developer
Volunteer tester
Project scientist
Send message
Joined: 14 Mar 07
Posts: 1909
Credit: 629,356
RAC: 0
Level
Gly
Scientific publications
watwatwatwatwat
Message 48814 - Posted: 5 Feb 2018 | 9:20:29 UTC

We will need to upload a new linux app soon to solve the expired current executable.

gdf

Keith Myers
Send message
Joined: 13 Dec 17
Posts: 82
Credit: 9,586,188
RAC: 2,271
Level
Ser
Scientific publications
wat
Message 48818 - Posted: 5 Feb 2018 | 19:26:31 UTC

Thanks for the update.

Profile GDF
Volunteer moderator
Project administrator
Project developer
Project tester
Volunteer developer
Volunteer tester
Project scientist
Send message
Joined: 14 Mar 07
Posts: 1909
Credit: 629,356
RAC: 0
Level
Gly
Scientific publications
watwatwatwatwat
Message 48819 - Posted: 5 Feb 2018 | 19:28:14 UTC - in response to Message 48818.

I should have deprecated the current one, so at least it does not get out.

Keith Myers
Send message
Joined: 13 Dec 17
Posts: 82
Credit: 9,586,188
RAC: 2,271
Level
Ser
Scientific publications
wat
Message 48822 - Posted: 5 Feb 2018 | 20:07:50 UTC - in response to Message 48819.
Last modified: 5 Feb 2018 | 20:34:00 UTC

I should have deprecated the current one, so at least it does not get out.

Yes, no point in wasting download bandwidth and gpu resources on tasks that error out immediately.

James Medeiros
Send message
Joined: 22 Apr 14
Posts: 4
Credit: 239,737,315
RAC: 498,644
Level
Leu
Scientific publications
watwatwatwatwatwat
Message 48829 - Posted: 6 Feb 2018 | 1:48:12 UTC - in response to Message 48822.

Whew! Glad it's not my machine... I was up to hundreds of days of uptime and removed/reinstalled all of BOINC - then restarted, thinking something was really gummed up on my end ;P Glad there'll be a new app soon for Linux. Thanks!

Robert Gammon
Send message
Joined: 28 May 12
Posts: 63
Credit: 641,668,080
RAC: 18,901
Level
Lys
Scientific publications
watwatwatwatwatwatwatwatwatwatwat
Message 48843 - Posted: 6 Feb 2018 | 16:34:14 UTC - in response to Message 48814.

My last successful Long Run was 30 Jan. I keep getting messages about Long Runs failing to execute when there are ZERO Long Runs present on the server

Profile Retvari Zoltan
Avatar
Send message
Joined: 20 Jan 09
Posts: 1886
Credit: 11,897,750,244
RAC: 5,167,427
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 48880 - Posted: 8 Feb 2018 | 0:54:18 UTC - in response to Message 48819.
Last modified: 8 Feb 2018 | 0:55:21 UTC

I should have deprecated the current one, so at least it does not get out.
It should have been deprecated from the short runs as well. It took me by surprise that short tasks are present on my hosts, which have failed before on Linux hosts (but after this issue have been recognized).

Keith Myers
Send message
Joined: 13 Dec 17
Posts: 82
Credit: 9,586,188
RAC: 2,271
Level
Ser
Scientific publications
wat
Message 48881 - Posted: 8 Feb 2018 | 1:04:30 UTC - in response to Message 48880.

I was surprised to see short tasks today. I had never seen them before. They failed with the same exit code as the long tasks. Didn't realize they used the same application.

mmonnin
Send message
Joined: 2 Jul 16
Posts: 137
Credit: 177,213,592
RAC: 512,425
Level
Ile
Scientific publications
wat
Message 48913 - Posted: 12 Feb 2018 | 14:36:03 UTC - in response to Message 48814.

We will need to upload a new linux app soon to solve the expired current executable.

gdf


Any update to this?

mmonnin
Send message
Joined: 2 Jul 16
Posts: 137
Credit: 177,213,592
RAC: 512,425
Level
Ile
Scientific publications
wat
Message 48920 - Posted: 12 Feb 2018 | 19:57:46 UTC

Awesome, bunch of spammers here...

Keith Myers
Send message
Joined: 13 Dec 17
Posts: 82
Credit: 9,586,188
RAC: 2,271
Level
Ser
Scientific publications
wat
Message 48921 - Posted: 12 Feb 2018 | 20:40:40 UTC
Last modified: 12 Feb 2018 | 20:43:20 UTC

My understanding was that the developer for the Linux gpu apps was out or unavailable last week but expected to return this week.

I will have to keep bumping this thread if I do not see the new application executable by the end of the week.

Profile GDF
Volunteer moderator
Project administrator
Project developer
Project tester
Volunteer developer
Volunteer tester
Project scientist
Send message
Joined: 14 Mar 07
Posts: 1909
Credit: 629,356
RAC: 0
Level
Gly
Scientific publications
watwatwatwatwat
Message 48926 - Posted: 13 Feb 2018 | 10:28:36 UTC - in response to Message 48921.

I have uploaded a new linux version of the old app. But we are also expecting to deploy a new app soon with faster performance.


the current new linux version is 919. Let me know if it works.

gdf

mmonnin
Send message
Joined: 2 Jul 16
Posts: 137
Credit: 177,213,592
RAC: 512,425
Level
Ile
Scientific publications
wat
Message 48928 - Posted: 13 Feb 2018 | 12:38:13 UTC - in response to Message 48926.

I have uploaded a new linux version of the old app. But we are also expecting to deploy a new app soon with faster performance.


the current new linux version is 919. Let me know if it works.

gdf


That's even better news!

Just waiting on some tasks.

Profile Retvari Zoltan
Avatar
Send message
Joined: 20 Jan 09
Posts: 1886
Credit: 11,897,750,244
RAC: 5,167,427
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 48929 - Posted: 13 Feb 2018 | 15:45:05 UTC - in response to Message 48926.
Last modified: 13 Feb 2018 | 15:45:49 UTC

I have uploaded a new linux version of the old app. But we are also expecting to deploy a new app soon with faster performance.


the current new linux version is 919. Let me know if it works.

gdf

Could you please update the Linux app in the Short queue as well?

Keith Myers
Send message
Joined: 13 Dec 17
Posts: 82
Credit: 9,586,188
RAC: 2,271
Level
Ser
Scientific publications
wat
Message 48931 - Posted: 13 Feb 2018 | 20:21:33 UTC - in response to Message 48929.

Absolutely, this should have been done automatically along with the Long Tasks.

jiipee
Send message
Joined: 4 Jun 15
Posts: 1
Credit: 776,715,766
RAC: 982,103
Level
Glu
Scientific publications
watwatwat
Message 48934 - Posted: 14 Feb 2018 | 5:58:54 UTC

Is there something that needs to done on client side to get these updated linux apps?

Profile Retvari Zoltan
Avatar
Send message
Joined: 20 Jan 09
Posts: 1886
Credit: 11,897,750,244
RAC: 5,167,427
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 48935 - Posted: 14 Feb 2018 | 8:33:54 UTC - in response to Message 48934.

Is there something that needs to done on client side to get these updated linux apps?
No, it will be automatically downloaded when you receive a new task.

biodoc
Send message
Joined: 26 Aug 08
Posts: 114
Credit: 772,027,133
RAC: 551,520
Level
Glu
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 48938 - Posted: 14 Feb 2018 | 14:41:04 UTC
Last modified: 14 Feb 2018 | 14:41:40 UTC

Has anyone received any GPU tasks on linux yet? I'm currently running Quantum Chemistry tasks and when I request new work I get the following:

Wed 14 Feb 2018 09:28:49 AM EST | GPUGRID | update requested by user
Wed 14 Feb 2018 09:28:52 AM EST | GPUGRID | Sending scheduler request: Requested by user.
Wed 14 Feb 2018 09:28:52 AM EST | GPUGRID | Requesting new tasks for CPU and NVIDIA GPU
Wed 14 Feb 2018 09:28:54 AM EST | GPUGRID | Scheduler request completed: got 0 new tasks
Wed 14 Feb 2018 09:28:54 AM EST | GPUGRID | No tasks sent
Wed 14 Feb 2018 09:28:54 AM EST | GPUGRID | This computer has reached a limit on tasks in progress


There's no mention of short or long GPU tasks being available or not.

Is there anything in my app_config file that is preventing me from getting GPU tasks?

<app_config>
<app>
<name>acemdlong</name>
<gpu_versions>
<gpu_usage>1</gpu_usage>
<cpu_usage>1</cpu_usage>
</gpu_versions>
</app>
<app>
<name>acemdshort</name>
<gpu_versions>
<gpu_usage>1</gpu_usage>
<cpu_usage>1</cpu_usage>
</gpu_versions>
</app>
<app>
<name>QC</name>
<max_concurrent>2</max_concurrent>
</app>
<app_version>
<app_name>QC</app_name>
<plan_class>mt</plan_class>
<avg_ncpus>4</avg_ncpus>
<cmdline>--nthreads 4</cmdline>
</app_version>
</app_config>


I know GPU tasks are rare but I'm just trying to confirm I've got everything set correctly.

STARBASEn
Avatar
Send message
Joined: 17 Feb 09
Posts: 34
Credit: 453,042,714
RAC: 1,133,460
Level
Gln
Scientific publications
watwatwatwatwat
Message 48940 - Posted: 14 Feb 2018 | 16:29:32 UTC - in response to Message 48938.

Has anyone received any GPU tasks on linux yet? I'm currently running Quantum Chemistry tasks and when I request new work I get the following:

Wed 14 Feb 2018 09:28:49 AM EST | GPUGRID | update requested by user
Wed 14 Feb 2018 09:28:52 AM EST | GPUGRID | Sending scheduler request: Requested by user.
Wed 14 Feb 2018 09:28:52 AM EST | GPUGRID | Requesting new tasks for CPU and NVIDIA GPU
Wed 14 Feb 2018 09:28:54 AM EST | GPUGRID | Scheduler request completed: got 0 new tasks
Wed 14 Feb 2018 09:28:54 AM EST | GPUGRID | No tasks sent
Wed 14 Feb 2018 09:28:54 AM EST | GPUGRID | This computer has reached a limit on tasks in progress


There's no mention of short or long GPU tasks being available or not.


I am getting the same message every time boinc requests gpu tasks for my 3 gpu capable machines. I haven't received any gpu work since the announcement of linux v9.19. I suspect that gpu work is so scarce right now it's hard to get any.

biodoc
Send message
Joined: 26 Aug 08
Posts: 114
Credit: 772,027,133
RAC: 551,520
Level
Glu
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 48941 - Posted: 14 Feb 2018 | 16:39:36 UTC - in response to Message 48940.

You are probably right.

I'm a little concerned about the line stating "this computer has reached a limit on tasks in progress". Does that mean both CPU and GPU tasks or just CPU Quantum Chemistry tasks?

Time will tell I guess.

biodoc
Send message
Joined: 26 Aug 08
Posts: 114
Credit: 772,027,133
RAC: 551,520
Level
Glu
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 48943 - Posted: 14 Feb 2018 | 18:25:46 UTC

Just got a "Long" GPU WU.

mmonnin
Send message
Joined: 2 Jul 16
Posts: 137
Credit: 177,213,592
RAC: 512,425
Level
Ile
Scientific publications
wat
Message 48945 - Posted: 14 Feb 2018 | 20:58:22 UTC

App was updated today:
http://www.gpugrid.net/apps.php

Nothing yet for me.

STARBASEn
Avatar
Send message
Joined: 17 Feb 09
Posts: 34
Credit: 453,042,714
RAC: 1,133,460
Level
Gln
Scientific publications
watwatwatwatwat
Message 48946 - Posted: 14 Feb 2018 | 21:36:52 UTC

Got a Long WU about an hour and a half ago and it appears to be running fine with 32% completed using ACEMD.919-80.bin

STARBASEn
Avatar
Send message
Joined: 17 Feb 09
Posts: 34
Credit: 453,042,714
RAC: 1,133,460
Level
Gln
Scientific publications
watwatwatwatwat
Message 48947 - Posted: 14 Feb 2018 | 21:45:36 UTC - in response to Message 48941.

You are probably right.

I'm a little concerned about the line stating "this computer has reached a limit on tasks in progress". Does that mean both CPU and GPU tasks or just CPU Quantum Chemistry tasks?

Time will tell I guess.


I suspect that line refers to CPU work in this case. My GPU's were all asking for xxxxxx seconds of work because I had none but no tasks were downloaded because none were available at the asking time.

biodoc
Send message
Joined: 26 Aug 08
Posts: 114
Credit: 772,027,133
RAC: 551,520
Level
Glu
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 48949 - Posted: 14 Feb 2018 | 21:59:02 UTC - in response to Message 48946.

Got a Long WU about an hour and a half ago and it appears to be running fine with 32% completed using ACEMD.919-80.bin


My WU is 72% complete and is also running the same app.

biodoc
Send message
Joined: 26 Aug 08
Posts: 114
Credit: 772,027,133
RAC: 551,520
Level
Glu
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 48952 - Posted: 15 Feb 2018 | 0:19:07 UTC - in response to Message 48949.

Got a Long WU about an hour and a half ago and it appears to be running fine with 32% completed using ACEMD.919-80.bin


My WU is 72% complete and is also running the same app.


WU finished successfully.

http://www.gpugrid.net/result.php?resultid=17050694

STARBASEn
Avatar
Send message
Joined: 17 Feb 09
Posts: 34
Credit: 453,042,714
RAC: 1,133,460
Level
Gln
Scientific publications
watwatwatwatwat
Message 48953 - Posted: 15 Feb 2018 | 0:37:06 UTC

Mine just finished successfully as well and caught another that's in progress.

mmonnin
Send message
Joined: 2 Jul 16
Posts: 137
Credit: 177,213,592
RAC: 512,425
Level
Ile
Scientific publications
wat
Message 48957 - Posted: 15 Feb 2018 | 3:43:05 UTC

I finally received one.

I would expect it to perform in the exact same manor with the updated license.

NUCCpod_NAPTIMELABS_01
Send message
Joined: 18 Aug 17
Posts: 6
Credit: 18,577,273
RAC: 406,792
Level
Pro
Scientific publications
wat
Message 48960 - Posted: 15 Feb 2018 | 8:12:02 UTC

Got a short and a long today, both executed and completed without error.
Seems like 919 fixed it up.

mmonnin
Send message
Joined: 2 Jul 16
Posts: 137
Credit: 177,213,592
RAC: 512,425
Level
Ile
Scientific publications
wat
Message 48969 - Posted: 16 Feb 2018 | 3:38:46 UTC

Are these the same type of tasks? All the credit values are different. By what was said I was expecting the exact same app, same performance, same results. Just with a refreshed license/exe.

Keith Myers
Send message
Joined: 13 Dec 17
Posts: 82
Credit: 9,586,188
RAC: 2,271
Level
Ser
Scientific publications
wat
Message 48970 - Posted: 16 Feb 2018 | 4:31:55 UTC - in response to Message 48969.

The developer said the new application would be faster along with fixing the expired license issue.

Message 48926

I suspect the newer app already got the proposed speedup. The developer will have to comment if that is actually the case.

mmonnin
Send message
Joined: 2 Jul 16
Posts: 137
Credit: 177,213,592
RAC: 512,425
Level
Ile
Scientific publications
wat
Message 48971 - Posted: 16 Feb 2018 | 5:19:51 UTC - in response to Message 48970.
Last modified: 16 Feb 2018 | 5:22:48 UTC

The developer said the new application would be faster along with fixing the expired license issue.

Message 48926

I suspect the newer app already got the proposed speedup. The developer will have to comment if that is actually the case.


That is a separate app. The refresh is the same. Check the server status and apps page for an app called "New version of ACEMD" which I'm guessing new app coming soon part. Been around too many apps so know that 'soon' is not the next day or even next week.

I did complete one for the 252,750 normal credit. 6 others were something different besides the 111k, 252k, 387k tasks we had been getting for awhile.

And I was seeing much lower credit/hr than better anyway. Same GTX 970.
23,402.30 23,335.39 96,300.00
22,216.21 22,108.95 111,300.00

15,161.22 15,109.74 12,750.00
8,710.36 8,682.41 17,250.00

Keith Myers
Send message
Joined: 13 Dec 17
Posts: 82
Credit: 9,586,188
RAC: 2,271
Level
Ser
Scientific publications
wat
Message 48972 - Posted: 16 Feb 2018 | 8:02:13 UTC

I've looked at that apps page before and the one labelled "New version of ACEMD" is dated January of 2017.

The 9.19 version we just got would be much newer release obviously and its original release date for the previous 9.14 version was April of 2017. So again newer.

I wonder if the different allocation of credit has anything to do with molecule size like was explained how the credit for the QC tasks is awarded.

mmonnin
Send message
Joined: 2 Jul 16
Posts: 137
Credit: 177,213,592
RAC: 512,425
Level
Ile
Scientific publications
wat
Message 48973 - Posted: 16 Feb 2018 | 10:36:24 UTC - in response to Message 48972.

Exactly...'soon'

biodoc
Send message
Joined: 26 Aug 08
Posts: 114
Credit: 772,027,133
RAC: 551,520
Level
Glu
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 48974 - Posted: 16 Feb 2018 | 13:25:34 UTC

No problems with the new linux app thus far after completing 18 long and 5 short tasks.

Post to thread

Message boards : News : Linux app failing