Page History

...

150

	Booster	DCGP
Model	Atos BullSequana X2135 "Da Vinci" single-node GPU blade	Atos BullSequana X2140 three-node CPU blade
Racks	116	22
Nodes	3456	1536
Processors	single socket 32 cores Intel Ice Lake Lake CPU 1 x Intel Xeon Platinum 8358 CPU, 2.60GHz60GHz TDP 250W	dual socket 56 cores sockets Intel Sapphire Rapids CPU 2 x Intel CPUXeon Platinum 8480p, 2.00 GHz TDB TDP 350W
Accelerators	4 x NVIDIA Ampere GPUs/node, 64GB HBM2 HBM2e NVLink 3.0 (200GB/s)	-
Cores	32 cores/node	112 cores/node
RAM	512 (8x64) GB DDR4 3200 MHz	512 (16 x 32) GB DDR5 4800 MHz
Peak Performance	about 309 Pflop/s	9 Pflops/s
Internal Network	DragonFly+ 200 Gbps (NVIDIA Mellanox HDR DragonFly++ 200Gb/s 2 x NVIDIA HDR 2 x 100 Gb/s cards (Booster nodes) 3 x Nvidia HDR100 1 x 100 Gb/s cards (DCGP nodes)Infiniband HDR)
Internal Network	2 x dual port HDR100 per node	single port HDR100 per node
Storage (raw capacity)	137.6 PB based on DDN ES7990X and Hard Drive Disks		Disk Space	106PB Large capacity storage, 620 GB/s (Capacity Tier) 5.4 PB of High performance storage, 1.4 TB/s 7 PB based on DDN ES400NVX2 and Solid State Drives (Fast Tier)

Peak performance details

Node Performance
Theoretical Peak Performance	CPU (nominal/peak freq.)	1680 Gflops
	GPU	75000 Gflops
	Total	76680 GFlops
Memory Bandwidth (nominal/peak freq.)		24.4 GB/s

...

Please note that the accounting is in terms of consumed core hours, but it strongly depends also on the requested memory and local storage, and number of GPUs, please refer to the dedicated section.

...

	Total Dimension (TB)	Quota (GB)	Notes
$HOME	0.46 PiB	50GB per user	permanent backed up (suspended) user specific
$CINECA_SCRATCH	40 PiB	no quota	HDD storage temporary user specific no backup automatic cleaning procedure of data older than 40 days (time interval can be reduced in case of critical usage ratio of the area. In this case, users will be notified via HPC-News).
$PUBLIC	0.46 PiB	50GB per user	permanent user specific no backup
$WORK	30 PB	1TB per project	permanent project specific no backup extensions can be considered if needed (mailto: superc@cineca.it)
$FAST	3.5PB	1TB per project	permanent project specific no backup

...

It is also available a temporary storage area local to nodes on login and compute nodes (on the latter it is generated when the job starts and removed when it ends) and accessible via environment variable $TMPDIR. This area is:

on the local SSD disks on login nodes (14 TB of capacity), mounted as /scratch_local (TMPDIR=/scratch_local). This is a shared area with no quota, remove all the files once they are not requested anymore. A cleaning procedure will be enforced in case of improper use of the area.
on the local SSD disks on the serial node (lrd_all_serial, 14TB of capacity), managed via the slurm job_container/tmpfs plugin. This plugin provides a job-specific, private temporary file system space, with private instances of /tmp and /dev/shm in the job's user space (TMPDIR=/tmp, visible via the command "df -h"), removed at the end of the serial job. You can request the resource via sbatch directive or srun option "--gres=tmpfs:XX" (for instance: --gres=tmpfs:200G), with a maximum of 1 TB for the serial jobs. If not explicitly requested, the /tmp has the default dimension of 10 GB.
on the local SSD disks on DCGP nodes (3 TB of capacity). Like with the serial node, the local /tmp and /dev/shm areas are managed via plugin, which at the start of the jobs mounts private instances of /tmp and /dev/shm in the job's user space (TMPDIR=/tmp, visible via the command "df -h /tmp"), and unmounts them at the end of the job (all data will be lost). You can request the resource via sbatch directive or srun option "--gres=tmpfs:XX", with a maximum of all the available 3 TB for DCGP nodes. Like with the serial node, if not explicitly requested, the /tmp has the default dimension of 10 GB. Please note: for the DCGP jobs the requested amount of gres/tmpfs resource contributes to the consumed budget, changing the number of accounted equivalent core hours, see the dedicated section on the Accounting
on RAM on the diskless booster nodes (with a fixed size of 10 GB, no increase is allowed, and the gres/tmpfs resource is disabled).

For a general discussion on the TMPDIR area, For more details please see the dedicated section of Data storage and FileSystems.

...

Page tree

Versions Compared

Old Version 126

New Version Current

Key

Peak performance details