Ceci est une ancienne révision du document !


20221027 / prepare for reboot...

there's a ghost network glitch that make compute nodes unavailable

“connexion reset by peer”, “launch failed requeued held”, “JobHeldAdmin” → all the same, the node is gone rogue.

E5 and Lake partitions are in “DRAIN” mode, awaiting a general reboot of compute nodes.

E5-GPU and Cascade are already OK, login nodes and visualization nodes also.

DO NOT WRITE DIRECTLY TO PSMN STAFF, USE THE WEB FORMS: Formulaires du PSMN

Stay tuned with this newsfeed.

newsfeed/20221027.1666877797.txt.gz · Dernière modification : 2022/10/27 13:36 de ltaulell