Advanced search

Message boards : Multicore CPUs : long running process

Author Message
Mitchell
Send message
Joined: 18 Aug 07
Posts: 15
Credit: 22,771,987
RAC: 0
Level
Pro
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwat
Message 801 - Posted: 28 Jan 2008 | 22:52:54 UTC

I have a ps3 wu running at 45 hours now and its at 86% but shows time running increasing, and time left at over 5 hours and not decreasing. Is this wu dead and should be aborted, or do some wu\'s actually run this long? I\'ve even restarted boinc with the wu\'s still acting the same. I\'d hate to lose all the processing and abort the wu. Any ideas?

Profile GDF
Volunteer moderator
Project administrator
Project developer
Project tester
Volunteer developer
Volunteer tester
Project scientist
Send message
Joined: 14 Mar 07
Posts: 1957
Credit: 629,356
RAC: 0
Level
Gly
Scientific publications
watwatwatwatwat
Message 802 - Posted: 28 Jan 2008 | 23:23:53 UTC - in response to Message 801.

I have a ps3 wu running at 45 hours now and its at 86% but shows time running increasing, and time left at over 5 hours and not decreasing. Is this wu dead and should be aborted, or do some wu\'s actually run this long? I\'ve even restarted boinc with the wu\'s still acting the same. I\'d hate to lose all the processing and abort the wu. Any ideas?



write \"top\" in terminal and see the CPU usage of the workunit (it should be around 130%).
WUs should not last longer than 24-26 hours. If you have some other process slowing the processor it will last much longer.

gdf

Mitchell
Send message
Joined: 18 Aug 07
Posts: 15
Credit: 22,771,987
RAC: 0
Level
Pro
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwat
Message 804 - Posted: 29 Jan 2008 | 2:34:07 UTC - in response to Message 802.

I have a ps3 wu running at 45 hours now and its at 86% but shows time running increasing, and time left at over 5 hours and not decreasing. Is this wu dead and should be aborted, or do some wu\'s actually run this long? I\'ve even restarted boinc with the wu\'s still acting the same. I\'d hate to lose all the processing and abort the wu. Any ideas?



write \"top\" in terminal and see the CPU usage of the workunit (it should be around 130%).
WUs should not last longer than 24-26 hours. If you have some other process slowing the processor it will last much longer.

gdf



Here is a typical top output. I\'ve never see the processes go beyond 100% to near 130% are you said, and actually I have 2 ps3 running processes that have exceeded the time aloted to complete and both are running at over 2 days, show 2-7 hours to complete. I guess I wont get any credit for them anyhow will I, so I should abort them. How does grid project treat wu turned in later? no credit awarded even though I\'ve run them for over 2 days ? Can they look at the wu\'s and tell what wrong with them ?

Mitchell
Send message
Joined: 18 Aug 07
Posts: 15
Credit: 22,771,987
RAC: 0
Level
Pro
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwat
Message 805 - Posted: 29 Jan 2008 | 2:34:31 UTC - in response to Message 804.

I have a ps3 wu running at 45 hours now and its at 86% but shows time running increasing, and time left at over 5 hours and not decreasing. Is this wu dead and should be aborted, or do some wu\'s actually run this long? I\'ve even restarted boinc with the wu\'s still acting the same. I\'d hate to lose all the processing and abort the wu. Any ideas?



write \"top\" in terminal and see the CPU usage of the workunit (it should be around 130%).
WUs should not last longer than 24-26 hours. If you have some other process slowing the processor it will last much longer.

gdf



Here is a typical top output. I\'ve never see the processes go beyond 100% to near 130% are you said, and actually I have 2 ps3 running processes that have exceeded the time aloted to complete and both are running at over 2 days, show 2-7 hours to complete. I guess I wont get any credit for them anyhow will I, so I should abort them. How does grid project treat wu turned in later? no credit awarded even though I\'ve run them for over 2 days ? Can they look at the wu\'s and tell what wrong with them ?


Using username \"mitch\".
mitch@192.168.0.12\'s password:
Last login: Thu Jan 24 15:09:57 2008 from 192.168.0.15
[mitch@localhost ~]$ more dave
top - 19:54:15 up 7 days, 33 min, 2 users, load average: 1.15, 1.09, 1.03
Tasks: 107 total, 2 running, 105 sleepin
g, 0 stopped, 0 zombie
Cpu(s): 23.2% us, 31.1% sy, 3.1% ni,
42.1% id, 0.1% wa, 0.2% hi,
0.1% si, 0
PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND
17702 mitch 25 0 97336 22m 1724 R 90 10.7 2896:04 ga_5.04_powerpc
17105 mitch 15 0 2736 1032 816 R 4 0.5 0:00.03 top
17501 mitch 15 0 19248 11m 2988 S 4 5.7 251:30.15 Xvnc
3511 root 16 0 2008 640 544 S 2 0.3 6:30.92 syslogd
1 root 15 0 2628 740 656 S 0 0.3 0:07.92 init
2 root 12 -5 0 0 0 S 0 0.0 0:00.00 kthreadd
3 root RT -5 0 0 0 S 0 0.0 0:00.61 migration/0
4 root 34 19 0 0 0 S 0 0.0 0:10.53 ksoftirqd/0
5 root RT -5 0 0 0 S 0 0.0 0:00.00 watchdog/0
6 root RT -5 0 0 0 S 0 0.0 0:00.51 migration/1

8 root RT -5 0 0 0 S 0 0.0 0:00.00 watchdog/1
9 root 10 -5 0 0 0 S 0 0.0 0:41.63 events/0
10 root 10 -5 0 0 0 S 0 0.0 0:40.68 events/1
11 root 10 -5 0 0 0 S 0 0.0 0:00.02 khelper
82 root 10 -5 0 0 0 S 0 0.0 0:01.11 kblockd/0
83 root 10 -5 0 0 0 S 0 0.0 0:00.54 kblockd/1
85 root 20 -5 0 0 0 S 0 0.0 0:00.00 ata/0
86 root 10 -5 0 0 0 S 0 0.0 0:00.00 ata/1
87 root 20 -5 0 0 0 S 0 0.0 0:00.00 ata_aux
90 root 10 -5 0 0 0 S 0 0.0 0:00.06 khubd
92 root 20 -5 0 0 0 S 0 0.0 0:00.00 kseriod
113 root 10 -5 0 0 0 S 0 0.0 0:00.00 ps3avd
128 root 10 -5 0 0 0 S 0 0.0 0:06.70 kswapd0
129 root 20 -5 0 0 0 S 0 0.0 0:00.00 aio/0
130 root 10 -5 0 0 0 S 0 0.0 0:00.00 aio/1
138 root 10 -5 0 0 0 S 0 0.0 2:13.15 ps3fb
721 root 19 -5 0 0 0 S 0 0.0 0:00.00 khvcd
747 mitch 15 0 9928 3996 2712 S 0 1.9 13:26.59 boinc
795 root 19 -5 0 0 0 S 0 0.0 0:00.00 khvcsd
955 root 10 -5 0 0 0 S 0 0.0 0:46.10 kjournald
999 root 11 -5 0 0 0 S 0 0.0 0:00.00 kauditd
1024 root 14 -4 2904 508 428 S 0 0.2 0:02.69 udevd
1108 root 12 -5 0 0 0 S 0 0.0 0:00.00 scsi_eh_1
1109 root 10 -5 0 0 0 S 0 0.0 0:00.00 ps3rom
[mitch@localhost ~]$

Post to thread

Message boards : Multicore CPUs : long running process

//