Cluster is back online, fully functional
Artur Dubrawski
awd at cs.cmu.edu
Sun May 21 20:04:33 EDT 2023
This is amazing, whenever anyone shuts down a system of such complexity,
many things can go wrong when the components are being brought back up.
Thank you very much Predrag and Piotr and congratulations on the job well
done!
Cheers
Artur
On Sun, May 21, 2023 at 7:19 PM Predrag Punosevac <predragp at andrew.cmu.edu>
wrote:
> Dear Autonians,
>
> The cluster is back on-line and fully functional. The majority of work was
> done by Piotr with a bit of my assistance (ssh and the phone). Currently
> only two computing nodes are not operational.
>
> lov3 needs a new power supply. It will get it tomorrow morning as Piotr
> has to eat and sleep after 8h in the server room.
>
> gpu1 most likely has a dead RAM module. It is not booting. Piotr will see
> if we have a spare one or if we have to get it on Ebay (the machine is
> almost 10 years old). I am not concerned about it. We will get more life
> out of that machine.
>
> Best,
> Predrag
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mailman.srv.cs.cmu.edu/pipermail/autonlab-users/attachments/20230521/fe5131c7/attachment.html>
More information about the Autonlab-users
mailing list