GPUs 1-9 offline

Predrag Punosevac predragp at andrew.cmu.edu
Thu Dec 1 23:14:49 EST 2022


Hi Conor,

I just noticed myself. It is not just GPUs 1-9 it is also Denver. The
common thing for all those 10 servers is that they draw electricity from
the same Metered 17.3 kW PDU. Sure enough IPMI is off as well which
confirms that there is no electric power in that server RACK. Somebody cut
the electricity to the RACK A1-2A or PDU had a catastrophic failure. I am
now calling the server room to have them physically inspect the rack.

Best,
Predrag

On Thu, Dec 1, 2022 at 6:37 PM Conor Igoe <cigoe at cs.cmu.edu> wrote:

> Predrag,
>
> Sorry to bother you, but I was wondering if you knew why GPUs 1-9 are
> offline since earlier today?
>
> Best,
> *Conor*
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mailman.srv.cs.cmu.edu/pipermail/autonlab-users/attachments/20221201/0f60640f/attachment.html>


More information about the Autonlab-users mailing list