S'abonner au fil des news (flux RSS)

Fil des news

20231114 / Xnfs/abc

The /Xnfs/abc volume will be moved to a new server Thursday 16th of November, in the morning.

Any nf (NextFlow) running at that time might need a restart if crashed.

2023/11/14 13:53 · ltaulell

20231030 / CRAL mounts (and homes)

A disk on data8 (main CRAL fileserver) was… not well. After a good hammer blow, all export were restarted.

homes and exports may have been unavailable for a few moments.

2023/10/30 11:14 · ltaulell

20231026 / startup complete

  • new scratch/Cascade is (finally) available
  • Lake-flix and Cascade-flix are open to everybody, for short duration (no longer than 2 days is best) small parallel and sequential jobs, with requeue in case of high priority jobs (see documentation)
2023/10/26 15:02 · ltaulell

20231025 / Slow start

We are restarting slowly:

  • E5, E5-GPU, Epyc are “ALL GREEN
  • Lake cluster
    • with a new partition Lake-flix, with preemption mode, doc has been updated
    • c6420nodes are still in upgrade progress (this is taking waaaay too long)
  • Cascade cluster
    • new scratch/Cascade is unavailable (cable/card IB problem)
    • with a new partition Cascade-flix, with preemption mode, doc has been updated

Any problem(s): please open a ticket.

As a reminder: the use of our web forms is not optional, it is our way of creating and tracking our intervention tickets :

Formulaires du PSMN

Thank you.

2023/10/25 09:10 · ltaulell

20231024 / upgrades in progress

Apart for a few expected hiccups, this power outage went smoothly. We are doing our last upgrades on fileservers (which still need a few reboots).

Expected restart this evening, at best tomorrow, we hope.

EDIT:

  • gateways are up, access to files should be OK.
  • slurmctl is still down/drain, nodes are (still) performing upgrades
2023/10/24 10:23 · ltaulell
news/blog.txt · Dernière modification : 2020/08/25 15:58 de 127.0.0.1