<div dir="ltr">This is amazing, whenever anyone shuts down a system of such complexity, many things can go wrong when the components are being brought back up. Thank you very much Predrag and Piotr and congratulations on the job well done!<div><br></div><div>Cheers</div><div>Artur<br><div><br></div></div></div><br><div class="gmail_quote"><div dir="ltr" class="gmail_attr">On Sun, May 21, 2023 at 7:19 PM Predrag Punosevac <<a href="mailto:predragp@andrew.cmu.edu">predragp@andrew.cmu.edu</a>> wrote:<br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div dir="ltr">Dear Autonians,<div><br></div><div>The cluster is back on-line and fully functional. The majority of work was done by Piotr with a bit of my assistance (ssh and the phone). Currently only two computing nodes are not operational.</div><div><br></div><div>lov3 needs a new power supply. It will get it tomorrow morning as Piotr has to eat and sleep after 8h in the server room.</div><div><br></div><div>gpu1 most likely has a dead RAM module. It is not booting. Piotr will see if we have a spare one or if we have to get it on Ebay (the machine is almost 10 years old). I am not concerned about it. We will get more life out of that machine. </div><div><br></div><div>Best,</div><div>Predrag</div></div>
</blockquote></div>