Advanced search

Message boards : Number crunching : What's your CPU usage?

Author Message
wujj123456
Send message
Joined: 9 Jun 10
Posts: 16
Credit: 1,804,219,823
RAC: 3,640,288
Level
His
Scientific publications
watwatwatwat
Message 20353 - Posted: 6 Feb 2011 | 19:49:48 UTC

Hi everyone,

BOINC reports 0.26 CPU for a gpugrid task. However, my hardware monitor reports that the task consumes 1 cpu + 1 gpu.

Is there anyway to tell BOINC that a gpugrid task uses 1 CPU + 1 GPU so that BOINC won't run too many jobs? All jobs a significantly slowed down due to excessive resource contention and context switches.

Werkstatt
Send message
Joined: 23 May 09
Posts: 121
Credit: 321,525,386
RAC: 18,187
Level
Asp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 20354 - Posted: 6 Feb 2011 | 21:59:45 UTC - in response to Message 20353.


Is there anyway to tell BOINC that a gpugrid task uses 1 CPU + 1 GPU so that BOINC won't run too many jobs? All jobs a significantly slowed down due to excessive resource contention and context switches.

If you are using an app_info.xml, yes.
Find the part
<avg_ncpus>0.260000</avg_ncpus>
<max_ncpus>0.26000</max_ncpus>

and change these two numbers to 1.0

wujj123456
Send message
Joined: 9 Jun 10
Posts: 16
Credit: 1,804,219,823
RAC: 3,640,288
Level
His
Scientific publications
watwatwatwat
Message 20356 - Posted: 7 Feb 2011 | 5:31:28 UTC - in response to Message 20354.

Thanks. Where can I get or generate a default app_info.xml? I searched forum, but didn't find any.

Profile skgiven
Volunteer moderator
Volunteer tester
Avatar
Send message
Joined: 23 Apr 09
Posts: 3968
Credit: 1,995,359,260
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 20357 - Posted: 7 Feb 2011 | 7:06:38 UTC - in response to Message 20356.

Create a new file.

Werkstatt
Send message
Joined: 23 May 09
Posts: 121
Credit: 321,525,386
RAC: 18,187
Level
Asp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 20361 - Posted: 7 Feb 2011 | 11:56:01 UTC - in response to Message 20357.
Last modified: 7 Feb 2011 | 12:07:58 UTC

Create a new file.

Your'e joking ...

Here is mine. You need to make the changes as described.

<app_info>
<app>
<name>acemd2</name>
</app>
<file_info>
<name>acemd2_6.13_windows_intelx86__cuda31.exe</name>
<executable/>
</file_info>
<file_info>
<name>cudart32_31_9.dll</name>
<executable/>
</file_info>
<file_info>
<name>cufft32_31_9.dll</name>
<executable/>
</file_info>
<file_info>
<name>tcl85.dll</name>
<executable/>
</file_info>
<app_version>
<app_name>acemd2</app_name>
<version_num>613</version_num>
<plan_class>cuda</plan_class>
<avg_ncpus>0.260000</avg_ncpus>
<max_ncpus>0.260000</max_ncpus>
<flops>2923397447.700002</flops>
<coproc>
<type>CUDA</type>
<count>1</count>
</coproc>
<file_ref>
<file_name>acemd2_6.13_windows_intelx86__cuda31.exe</file_name>
<main_program/>
</file_ref>
<file_ref>
<file_name>cudart32_31_9.dll</file_name>
</file_ref>
<file_ref>
<file_name>cufft32_31_9.dll</file_name>
</file_ref>
<file_ref>
<file_name>tcl85.dll</file_name>
</file_ref>
</app_version>
</app_info>

Profile skgiven
Volunteer moderator
Volunteer tester
Avatar
Send message
Joined: 23 Apr 09
Posts: 3968
Credit: 1,995,359,260
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 20362 - Posted: 7 Feb 2011 | 13:38:42 UTC - in response to Message 20361.
Last modified: 7 Feb 2011 | 15:25:45 UTC

I was just saying that the anonymous platform file won't be there unless wujj123456 has already created, copied, or renamed it (unlikely, otherwise he would not be asking). By default app_info.xml does not exist, so a new file needs to be created.

It might be a good idea to maintain default app_info.xml files for GPUGrid.

For Vista/W7 the default location to place it is,
C:\ProgramData\BOINC\projects\www.gpugrid.net\

For Linux its the BOINC\projects\folder (location depends on installation).
wujj123456 is using 2.6.35-24-generic but has not attached for a month.

I'm not using Linux but I suspect acemd2_6.13_windows_intelx86__cuda31.exe may not be correct (Linux is x64 only, I think),

Werkstatt
Send message
Joined: 23 May 09
Posts: 121
Credit: 321,525,386
RAC: 18,187
Level
Asp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 20363 - Posted: 7 Feb 2011 | 14:25:37 UTC

Oh, I did not check the operating system. My mistake.
To make that clear, my app_info is for windows, I'm running win7x64.

So could someone with Linux experience please post the necessary changes? THX.

Werkstatt
Send message
Joined: 23 May 09
Posts: 121
Credit: 321,525,386
RAC: 18,187
Level
Asp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 20364 - Posted: 7 Feb 2011 | 14:36:57 UTC

I looked around a little bit.
Saenger posted this Message 19291 - Posted 6 Nov 2010 9:36:13 UTC

<app_info>
<app>
<name>GPUGrid</name>
<user_friendly_name>GPUGrid</user_friendly_name>
</app>
<file_info>
<name>acemd2_6.04_x86_64-pc-linux-gnu__cuda</name>
<executable/>
</file_info>
<file_info>
<name>libcudart.so.2</name>
<executable/>
</file_info>
<app_version>
<app_name>GPUGrid</app_name>
<version_num>604</version_num>
<flops>1.0e11</flops>
<avg_ncpus>0.05</avg_ncpus>
<max_ncpus>0.05</max_ncpus>
<coproc>
<type>CUDA</type>
<count>1.0</count>
</coproc>
<file_ref>
<file_name>acemd2_6.04_x86_64-pc-linux-gnu__cuda</file_name>
<main_program/>
</file_ref>
<file_ref>
<file_name>libcudart.so.2</file_name>
</file_ref>
</app_version>
</app_info>

maybe it's outdated, but could be a start for your experiments.

Profile skgiven
Volunteer moderator
Volunteer tester
Avatar
Send message
Joined: 23 Apr 09
Posts: 3968
Credit: 1,995,359,260
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 20365 - Posted: 7 Feb 2011 | 15:34:16 UTC - in response to Message 20364.
Last modified: 7 Feb 2011 | 15:35:26 UTC

I think it's acemd2_6.13_x86_64-pc-linux-gnu__cuda31,

<app_info>
<app>
<name>GPUGrid</name>
<user_friendly_name>GPUGrid</user_friendly_name>
</app>
<file_info>
<name>acemd2_6.13_x86_64-pc-linux-gnu__cuda31</name>
<executable/>
</file_info>
<file_info>
<name>libcudart.so.2</name>
<executable/>
</file_info>
<app_version>
<app_name>GPUGrid</app_name>
<version_num>610</version_num>
<flops>1.0e11</flops>
<avg_ncpus>1.00</avg_ncpus>
<max_ncpus>1.00</max_ncpus>
<coproc>
<type>CUDA</type>
<count>1.0</count>
</coproc>
<file_ref>
<file_name>acemd2_6.13_x86_64-pc-linux-gnu__cuda31</file_name>
<main_program/>
</file_ref>
<file_ref>
<file_name>libcudart.so.2</file_name>
</file_ref>
</app_version>
</app_info>

Thanks Werkstatt,

wujj123456
Send message
Joined: 9 Jun 10
Posts: 16
Credit: 1,804,219,823
RAC: 3,640,288
Level
His
Scientific publications
watwatwatwat
Message 20366 - Posted: 7 Feb 2011 | 17:31:55 UTC - in response to Message 20365.

Thank you very much, Werkstatt and skgiven. I will give this a shot tonight and post the outcome here with my Linux app_info.xml.

PS: I detached several times due to some mysterious hang-ups when my gpu is heavily loaded for hours, along with some blue screens when gaming under Windows. I finally fixed the overclocking settings that I misunderstood and I am coming back crunching numbers. :)

wujj123456
Send message
Joined: 9 Jun 10
Posts: 16
Credit: 1,804,219,823
RAC: 3,640,288
Level
His
Scientific publications
watwatwatwat
Message 20374 - Posted: 8 Feb 2011 | 4:30:44 UTC

The xml didn't work right away. Looks like some version/file mismatch.
I changed the <name> field to acemd2 and the <version_num> to 613. Then I got the job, but had computation error in 1 minute. Btw, i only want to run acemd standard and beta. Is acemd2 the right app? libcudart.so.2 doesn't exist on server as well. It was libcudart.so.1.xx in gpugrid folder before I use app_info.xml, but it's libcudart.so.3 in my cuda library directory. Which one will gpugrid use? The one in www.gpugrid.net folder or the one in my cuda library path?

I will run a couple of default jobs to see what executable/library acemd uses. Should be easy to fix once I figured out the version and library.

Thank you again for your help. Good to know a solution exists. I will let you know if I make any progress.

Profile skgiven
Volunteer moderator
Volunteer tester
Avatar
Send message
Joined: 23 Apr 09
Posts: 3968
Credit: 1,995,359,260
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 20375 - Posted: 8 Feb 2011 | 7:23:08 UTC - in response to Message 20374.

Makes sense for a GTX260. We were blind to your system (nothing in a month).

Instead of using, ACEMD2: GPU molecular dynamics v6.13 (cuda31), you will need to use ACEMD: GPU molecular dynamics v6.12 (cuda30) and appropriate drivers.

Possibly,
acemd2_6.12_x86_64-pc-linux-gnu__cuda30

wujj123456
Send message
Joined: 9 Jun 10
Posts: 16
Credit: 1,804,219,823
RAC: 3,640,288
Level
His
Scientific publications
watwatwatwat
Message 20392 - Posted: 10 Feb 2011 | 7:02:14 UTC - in response to Message 20375.

Looks like the workload changed since I detached last time. The units I got recently really uses only 4% of CPU, which is excellent! Now I can easily keep GPU busy, without hurting the CPU jobs.

Thank you for all your help. I will wait until next application to see if I need the app_info.xml.

Post to thread

Message boards : Number crunching : What's your CPU usage?

//