[auton-users] LOW1 restored

Donghan (Jarod) Wang donghanw at cs.cmu.edu
Thu May 24 12:23:25 EDT 2012


Hi everyone,

The LOW1 compute node had to be rebooted unexpectedly this morning. Any
processes or sessions on that machine were killed. Please check your jobs.

To ensure the system stability, I'm in the process of investigating and
implementing a way, OOM (Out Of Memory) killer, to handle memory
overcommit. OOM basically starts killing the greediest memory hog process, when
the kernel runs out of memory.

You may want to reserve for a host if significant cpu/memory consumption is
expected. To reserve, please refer to
http://www.autonlab.org/auton_intranet/computing/reservations.html

Thanks,
Jarod

-- 
Donghan (Jarod) Wang
Research Programmer
Robotics Institute
Carnegie Mellon University
5000 Forbes Avenue
Pittsburgh, PA 15213
Email: donghanw at cs.cmu.edu
Tel: +1 412 268 1238
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://mailman.srv.cs.cmu.edu/mailman/private/autonlab-users/attachments/20120524/9111e181/attachment.html>


More information about the Autonlab-users mailing list