From mjbaysek at cs.cmu.edu Wed Jul 1 11:02:29 2009 From: mjbaysek at cs.cmu.edu (Michael J. Baysek) Date: Wed, 01 Jul 2009 11:02:29 -0400 Subject: [auton-users] LOR2 Down Message-ID: <4A4B7A85.70500@cs.cmu.edu> Hi Lab. LOR2 has been very unreliable of late. This compute node will be down until further notice while diagnostics and corrective actions are performed. I will let you all know when it is back up and usable. -Mike -- -- Michael J. Baysek, Systems Analyst Carnegie Mellon University - Auton Lab www.cmu.edu - www.autonlab.org 412-268-8939 From mjbaysek at cs.cmu.edu Mon Jul 6 13:40:08 2009 From: mjbaysek at cs.cmu.edu (Michael J. Baysek) Date: Mon, 06 Jul 2009 13:40:08 -0400 Subject: [auton-users] LOQ1 restarted Message-ID: <4A5236F8.4080005@cs.cmu.edu> If you were running anything on LOQ1, you may want to check your jobs, as it has gone down and been restarted. -- -- Michael J. Baysek, Systems Analyst Carnegie Mellon University - Auton Lab www.cmu.edu - www.autonlab.org 412-268-8939 From mjbaysek at cs.cmu.edu Fri Jul 31 16:33:44 2009 From: mjbaysek at cs.cmu.edu (Michael J. Baysek) Date: Fri, 31 Jul 2009 16:33:44 -0400 Subject: [auton-users] LOS1 Abandoned Processes Message-ID: <4A735528.6010202@cs.cmu.edu> Hi Lab, Please check the status page for LOS1. If you have jobs showing up there, and they are no longer valid, please kill them. A current list follows: Please note that the status page has changed to show the PID of individual threads. Therefore the PID you see on the status page is only going to be the actual PID of your process if your job is running single-threaded. If your process is single threaded, then there is no change. Some processes like MATLAB spawn many threads, so be aware of this when you look at the status page. The change to the status page was necessary in order to properly display processes which do all of their work in threads. The motivation for this is that some programmers in the lab have begun multi-threading some algorithms, and the status page was silently not displaying those processes. That said, it's best to run "ps -U yourusername" after logging in to the machine to see all of the actual processes you have running. 16:15:19 up 335 days, 17:15, 73 users, load average: 5.72, 5.20, 5.37 Tasks: 753 total, 6 running, 722 sleeping, 25 stopped, 0 zombie Cpu(s): 96.5% us, 2.6% sy, 0.0% ni, 0.0% id, 0.0% wa, 0.0% hi, 0.9% si Mem: 32399204k total, 25325240k used, 7073964k free, 17540k buffers Swap: 5767160k total, 2680272k used, 3086888k free, 15713408k cached PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND 19812 sray 25 0 4381m 4.2g 1732 R 100.8 13.6 115:36.00 learning_curve 19814 sray 25 0 4381m 4.2g 1732 R 97.3 13.6 105:50.49 learning_curve 19811 sray 25 0 4381m 4.2g 1732 R 90.3 13.6 117:18.50 learning_curve 19813 sray 25 0 4381m 4.2g 1732 R 86.9 13.6 106:41.58 learning_curve 3006 psarkar 16 0 33376 504 500 T 0.0 0.0 622:37.57 rwdisk 3300 lujiec 16 0 9672 20 16 T 0.0 0.0 0:00.08 top 8363 daria 15 0 593m 872 868 T 0.0 0.0 0:02.40 python 8364 daria 16 0 18444 316 312 T 0.0 0.0 0:52.36 less 11017 lxiong 16 0 3892m 2.3g 31m T 0.0 7.5 749:21.07 MATLAB 11086 lxiong 16 0 3892m 2.3g 31m T 0.0 7.5 0:00.00 MATLAB 11087 lxiong 16 0 3892m 2.3g 31m T 0.0 7.5 0:00.00 MATLAB 11088 lxiong 16 0 3892m 2.3g 31m T 0.0 7.5 0:00.00 MATLAB 11089 lxiong 16 0 3892m 2.3g 31m T 0.0 7.5 0:00.00 MATLAB 11090 lxiong 15 0 3892m 2.3g 31m T 0.0 7.5 0:00.00 MATLAB 11091 lxiong 16 0 3892m 2.3g 31m T 0.0 7.5 0:00.00 MATLAB 11092 lxiong 16 0 3892m 2.3g 31m T 0.0 7.5 0:00.00 MATLAB 11093 lxiong 16 0 3892m 2.3g 31m T 0.0 7.5 0:00.00 MATLAB 11094 lxiong 16 0 3892m 2.3g 31m T 0.0 7.5 0:00.00 MATLAB 11095 lxiong 16 0 3892m 2.3g 31m T 0.0 7.5 0:00.14 MATLAB 11096 lxiong 16 0 3892m 2.3g 31m T 0.0 7.5 0:00.27 MATLAB 11097 lxiong 16 0 3892m 2.3g 31m T 0.0 7.5 0:00.00 MATLAB 11098 lxiong 15 0 3892m 2.3g 31m T 0.0 7.5 0:00.02 MATLAB 11099 lxiong 16 0 3892m 2.3g 31m T 0.0 7.5 0:00.00 MATLAB 22096 yizhang1 15 0 1629m 3236 2428 R 0.0 0.0 0:02.98 MATLAB 24272 psarkar 16 0 4092 16 12 T 0.0 0.0 0:00.00 less 27007 daria 16 0 57364 744 480 T 0.0 0.0 0:00.00 bash 27147 daria 16 0 57364 720 452 T 0.0 0.0 0:00.00 bash 27149 daria 25 0 202m 196m 964 T 0.0 0.6 0:31.74 bt_predict 28485 psarkar 16 0 5384 440 436 T 0.0 0.0 0:00.00 sh 28486 psarkar 15 0 3932 396 392 T 0.0 0.0 49:49.52 sort -- -- Michael J. Baysek, Systems Analyst Carnegie Mellon University - Auton Lab www.cmu.edu - www.autonlab.org 412-268-8939