S'abonner au fil des news (flux RSS)
Bonjour, les serveurs de compilation E5-2670comp1 et E5-2670comp2 vont être arrêtés temporairement mardi 14/05/2019 pour permettre leur déplacement. L'arrêt durera au plus 2 heures.
Edit 2019/05/14 11h30
E5-2670comp1 et E5-2670comp2 sont OK
Une interruption de l'alimentation électrique a eu lieu hier mercredi 8 mai 2019 vers 13h, cela a causé l'arrêt de très nombreux serveurs. La remise en condition opérationnelle normale va durer au moins toute la matinée.
Hervé Gilquin pour le staff.
Scratch E5 is back from the dead, as fresh as a new born (hence empty).
Reminder, new scratch hierarchy:
/scratch/ ├── E5/ (existing E5 scratch, available to E5 cluster) ├── nvme/ (local to some servers) ├── ssd/ (local to some servers) ├── project_name (local to some servers, with dedicated hardware) ... └── X5/ (existing X5 scratch, available to X5 cluster)
We are up, except for some nodes not mounting homes and scratch E5 FUBAR.
Maintenance operations will occur next week.
Hi all,
FYI, next planified main power outage for ENS Lyon (Monod site) is scheduled for Saturday, April 20th 2019.
Planned stops for PSMN are:
Scheduling of restart will depend on maintenance operations from both PSMN and ENS (ASAP, when DC goes back to “fully operationnal”).
At this occasion, there will be an OS upgrade on compute servers (Debian 9.5 → 9.8), hence possible updates on softwares.
Request For Comments:
We would like to upgrade nvidia driver and CUDA devkit from 8 to '9.0 + 9.2' on all compute servers (including visualization ones).
We propose a new scratchs hierarchy, enabling easy inclusions of upcoming hardware.
/scratch/ ├── E5/ (existing E5 scratch, available to E5 cluster) ├── nvme/ (local to some servers) ├── ssd/ (local to some servers) ├── project_name (local to some servers, with dedicated hardware) ... └── X5/ (existing X5 scratch, available to X5 cluster)
Main documentation will be updated to include these changes. You will need to change your scripts accordingly.
E5 scratch is in bad shape and need a fresh cleanup. We propose to erase it and restart from zero.
If any comments, send it to staff.psmn, please.