Monitoring for auton cluster?

Predrag Punosevac predragp at andrew.cmu.edu
Wed Oct 16 13:34:19 EDT 2019


We have multiple monitoring systems. monit.int.autonlab.org:8080 for
functional monitoring (up and down view). observium.int.autonlab.org
for complete remote telemetry. You obviously need to be on the
internal network via X2go to launch the browser which can reach those
panels.

In both cases username is auton and the password is Dr.Who.

Cheers,
Predrag

P.S. I am CC-ing users as other people might have the same question.


On Wed, Oct 16, 2019 at 8:59 AM Viraj Mehta <virajm at andrew.cmu.edu> wrote:
>
> Hi Predrag,
>
> Hope you are well. I was going to run some code on the cluster and it occurred to me that it would be helpful if there was a place to go to see what machines are least loaded / have free GPUs. Does anything like that exist in our system? If not, I’ll go machine-by-machine and run nvidia-smi; htop but I was curious. Thanks a bunch.
>
> Viraj



More information about the Autonlab-users mailing list