Storage¶
We have 3 storage systems for you: $HOME (/home/<NetID>), $SCRATCH (/scratch/<NetID>) and $ARCHIVE (/archive/<NetID>).
Note
Kindly note that $WORK is now discontinued. $WORK and $ARCHIVE are mereged together to $ARCHIVE
to incoporate both their functionalities into one. Kindly refer the doc here
for the major changes
In short, you should
- Put all your data to - $SCRATCHand run your jobs from there.
- Only a small persistent fraction to - $HOME(e.g., source code, executables).
- For long-term storage, archive them to - $ARCHIVE.
- $ARCHIVEis not visible on compute nodes but mountable on your local workstation, best suited to quick post-processing, analysis and visualization, without moving your data.
Important
Backing up is a user’s own responsibility. E.g., if a user deleted something accidentally, we can not recover, unfortunately.
Caution
Running jobs from /home is a serious violation of HPC policy. Any users who intentionally violate this policy will get their account suspended.
$HOME SSDs are not designed for this purpose, it will kill the SSDs quickly.
Summary¶
| $HOME | $SCRATCH | $ARCHIVE | |
|---|---|---|---|
| Use for storing | source code / executables | data | anything | 
| Accessible From | login / compute | login / compute | login | 
| Use to Run Jobs | No | Yes | No | 
| Retention Time (Days) | No Limit | 90 | No Limit | 
| Mountable | No | No | Yes | 
| Default Quota | 50GB, 500K Files | 5TB, 500K Files | No Limit | 
Know Your Quota¶
- Run - myquotacommand in the terminal on the HPC to check your current usage and quota where it is only accessible from the login nodes. Example output:
                DISK SPACE                # FILES (1000's)
filesystem       size      quota            number      quota
            --------------------------   --------------------------
     /home       92KB       50GB (  0%)         0       500 (  0%)
  /scratch        4KB     5000GB (  0%)         0       500 (  0%)
  /archive        4KB     5120GB (  0%)         0       125 (  0%)
For data transfers please refer to the Data Trasnfers section here
Best Practices¶
| Dos | Remarks | 
|---|---|
| Periodically clean your  | Files which have not been accessed for 90 days in  | 
| Once a project is completed move the data over to  | Moving data to  | 
| Use tar files to archive directories with large file count | Lesser the number of files, faster is the archiving and dearchiving process |