S'abonner au fil des news (flux RSS)
A new partition is available, with NVidia L4 GPU, see documentation: Cascade-GPU
Also, we will perform some hardware modifications on scratch servers on the Cascade cluster, next Tuesday (17-12-2024), expect problems with scratch/Cral
and scratch/Cascade
that day.
We need to perform a maintenance to E5-GPU partition's nodes. Nodes are in draining mode, jobs will not start until maintenance is done.
EDIT, 2024-12-05: Most nodes done, queue OK.
Following a night of heavy network problems, our slurmctl is DOWN. We are working on it.
EDIT 11:00: slurmctl is back ONLINE.
Following the power outage of this week-end, we are currently still upgrading our servers.
Follow this newsfeed to stay updated.
EDIT 14:20: We are restarting. Still, not all nodes are available.
EDIT 16:30: Full restart.
kind reminder: a complete power outage is scheduled Saturday Oct 26th 2024 (tomorrow).
as of now, we are starting to shutdow our HPC infrastructure (partitions, then gateways, then nodes and filers).