System¶
Overview¶
The operating system on Jubail is Linux. Make sure you know the basics. Useful links:
Generic diagram of the cluster architecture and outside connectivity.
Hardware¶
The new HPC cluster includes the integration of previous HPC cluster Dalma with Jubail HPC cluster.
The Jubail HPC consists of more than 28K cores.
The CPU model is AMD EPYC 7742 64-Core Processor @2.25GHz,supporting AVX2.
480GB per node.
3.75 GB memory per core by default.
- 128 CPU cores per node, implicits the following:
If your code is serial and doesn’t has multithreading capabilities, use one core/CPU per job.
If your code doesn’t support MPI, or you don’t know what MPI is, use maximum 128 cores per job.
For MPI jobs using more than one node, always use a number of cores divisible by 128, to utilize the full nodes.
The Dalma HPC consists of more than 12K CPU cores. But it is very unlikely that your code can scale up to use them all (contact us directly if you are confident). From the user perspective, here are the important specifications for most nodes:
The CPU model is Intel(R) Xeon(R) CPU E5-2680 v4 @ 2.40GHz, supporting AVX2.
4 GB memory per core by default.
- 28 CPU cores per node, implicits the following:
If your code doesn’t support MPI, or you don’t know what MPI is, use maximum 28 cores per job.
For MPI jobs using more than one node, always use a number of cores divisible by 28, to utilize the full nodes.
Contact us if you need special configuration (extra large memory, GPU, etc…)
Typical Workflow¶
(One time only) Let us know your computational requirement.
(One time only) Apply an HPC account and pass our quiz.
If needed, transfer your input data to the HPC.
Log on to HPC login nodes.
Submit jobs on login nodes.
Your jobs will queue for execution.
Once done, examine the output.
Summary of Nodes¶
Node Type |
Num Nodes |
CPUs / Node |
Memory / node |
GPUs / node |
Remarks |
---|---|---|---|---|---|
Jubail Compute |
224 |
128 |
480GB |
None |
New HPC Compute nodes |
Jubail Gpu |
20 |
128 |
480GB |
1/2/3 (Nvidia A100) |
New HPC GPU nodes |
Dalma Compute |
428 |
28/40 |
102 GB / 480 GB |
None |
Small jobs < 28 CPUs will be sent to Dalma |
Dalma GPUs |
14 |
40 |
360 GB / 1 TB |
2/8 (Nvidia V100) |
Two nodes have 8 GPU cards each, rest of them have 2 cards each |
Bigmem |
5 |
32/64/72/128 |
1 TB / 2TB |
None |
Used when memory requirement per node is greater than 500GB |
Visual |
4 |
32 |
105 GB |
2 (Nvidia Quadro P4000) |
Used for GUI |
Access¶
Once your account is ready, you can access Jubail HPC with Linux or Mac in NYU AD/NY network. Simply ssh in your local terminal:
ssh <NetID>@jubail.abudhabi.nyu.edu
If you use Windows or outside NYU AD/NY network, follow the instructions here: Access Jubail.
Whenever you login, you land up on one of the four login nodes, which is shown on left most section of
your terminal.It may look something like [wz22@login2 ~]$
suggesting that you are on the second login node.
Access allowed by pam_access
- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
Welcome to Jubail!
For documentation & examples: https://crc-docs.abudhabi.nyu.edu
For support: nyuad.it.help@nyu.edu
- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
Last login: Wed Feb 15 15:27:08 2023 from 10.224.42.159
Disk quotas for wz22 (uid 3387153):
DISK SPACE # FILES (1000's)
filesystem size quota number quota
-------------------------- --------------------------
/home 10099MB 20GB ( 49%) 77 150 ( 52%)
/scratch 53GB 5000GB ( 1%) 74 500 ( 15%)
/archive 24GB 5120GB ( 0%) 1 125 ( 1%)
[wz22@login2 ~]$
Important
Please refrain from running jobs on the login nodes. This can lead to your account getting suspended.