The Abacus system is equipped with a fast storage device that is mounted via NFS on all machines under the path /storage
. This storage serves as the primary location for storing and managing all data, projects, containers, and code. Within /storage
, each unit has a dedicated directory named after the unit (e.g., /storage/DSH
). These directories are assigned to the respective Linux group of the unit, allowing all members of that unit to read from and write to their designated folder. This disk is regularly backed up.
Users must avoid to put working data and model in their /home
directory on the login
. Use this for operational data, not for heavy files. For those you should use the storage dedicated to your unit and the NMVe storage on each machine.
When creating a folder within the unit’s directory, the folder will automatically be owned by the user who created it and assigned to their primary group. However, by default, also members out of the unit may still be able to read the contents. To restrict access so that only the folder owner can read and modify the contents, it is recommended to change the folder’s permissions using the command:
chmod 770 "path/to/folder"
<aside>
💡 Hint:
You can change the default behavior by setting umask 007
in your ~/.bashrc
file. While chmod <numbers>
allows you to assign permissions, umask <numbers>
removes them. A reasonable umask
could be umask 022
, which gives all permissions to the user but restricts write access for the group and others.
For further info on Linux permissions, have a look here.
</aside>
<aside>
💡 For members of the DSH unit, the /storage/DSH
directory is further divided into two subdirectories: projects
and personal
.
personal
directory is intended for individual experiments and exploratory projects, such as testing new architectures.projects
directory is reserved for official lab projects (e.g., D34h
).When setting up a new project, please follow these steps:
projects
directory, using the project’s name.chgrp "group_name" "path_to_project"
</aside>
Additionally, every machine in the Abacus cluster is equipped with a local NVMe fast disk mounted at /mnt/md0/data
. This storage is intended for temporary data and is optimized for high-speed I/O operations, such as reading data for training models or writing generated data. Please note that the contents of these disks are temporary and will be automatically deleted one week after their creation. Users should ensure that any important data is moved to permanent storage before this period elapses.
You can perform this operations either on the login-node or by starting an interactive job with appropriate mounts (preferred).
Create a Project Folder: Start by creating a folder under your unit’s directory on the main storage (/storage/unit_name
) for your specific project. This folder will be used to store data, container images, and codes relevant to the project.
mkdir /storage/unit_name/project_name
Prepare for Job Execution:
Set Up Working Directory:
/data
for your project (e.g., /data/project_name
)./mnt/md0/data
).Work with the Data: Perform your computations, data processing, or model training using the data stored in the /data/project_name
directory within your container. save your outputs in the same /data/project_name
directory.
Save and Backup Results:
/storage/unit_name/project_name
) for long-term storage and further analysis.