System administration for HPC
Location: Online
Organisers
Trainers
- Alan O'Cais (University of Barcelona)
- Helena Vela (HPCNow! - Do IT Now Group)
BioNT - BIO Network for Training - is an international consortium of academic entities and small and medium-sized enterprises (SMEs). BioNT is dedicated to providing a comprehensive training program and fostering a community for digital skills relevant to the biotechnology industry and biomedical sector. With a curriculum tailored for both beginners and advanced professionals, BioNT aims to equip individuals with the necessary expertise in handling, processing, and visualising biological data, as well as utilising computational biology tools. Leveraging the consortium's strong background in digital literacy training and extensive network of collaborations, BioNT is poised to professionalise life sciences data management, processing, and analysis skills.
Here, we are offering a 3-day workshop, composed of 3 full-day sessions, with the primary goal of introducing participants to the daily tasks of HPC system administrators through realistic scenarios using industry-standard tools and technologies. This practical training is intended for junior system administrators, technical staff, or Linux users transitioning into HPC environments. The program blends foundational system administration concepts with hands-on HPC-specific practices.
Day 1 focuses on system administration: user and group management, permissions, filesystems, package and service management, and firewalls.
Day 2 introduces HPC cluster-specific operations, with an in-depth look at the Slurm workload manager, and modern container technologies like Docker and Singularity.
Day 3 covers automation (Ansible), monitoring (Prometheus, Grafana), and software stack management (EasyBuild, EESSI, and Spack).
This workshop offers a comprehensive, practical introduction to HPC system administration, empowering junior and aspiring administrators to confidently support and grow HPC infrastructures.
Join this workshop if you are:
- A junior or aspiring system administrator entering HPC cluster management
- A technical support staff or Linux user scaling up to support multi-user computing environments
- In academia, research, or industry and need to manage scientific applications or HPC software stacks
- Curious about modern HPC tools such as Ansible, Prometheus, EasyBuild, and containers
Learning oucomes:
By the end of this workshop, you will be able to:
- Apply core Linux system administration skills in an HPC context
- Manage shared filesystems, users, services, and packages
- Set up and administer Slurm workload manager from a system perspective
- Deploy and support scientific applications in containers
- Automate configuration using Ansible
- Monitor and troubleshoot cluster health with Prometheus and Grafana
- Build and manage HPC software environments with EasyBuild and EESSI
Requirements:
- The lessons require you to have access to a terminal application with ssh capabilities. If it is unclear what this requirement means, please click here for guidance on how to make this available for your operating system
- There is no need for programming or informatics skills but a prior knowledge of file systems and the Unix shell is required. If you wish to participate but do not meet these prerequisites, we recommend watching the video recording from our previous workshop, available in the BioNT Lhumos space here.
- PC/Laptop with an up-to-date browser. Chrome, Safari and Firefox browsers are all supported (some older browsers, including Internet Explorer version 9 and below, may not be)
Recommendations for your setup and interacting during the workshop:
- To follow the workshop more efficiently, we recommend having a two-screen setup: for example, one to display the instructor’s shared screen and the collaborative pad, and another one for your own coding.
- To actively communicate during the workshop, please familiarise yourself with Markdown formatting by reviewing the HedgeDoc features document
Interaction between participants, trainers and helpers
The workshop will be delivered in a Zoom webinar format, with participants’ visibility disabled to preserve their privacy. You, as a participant, will be able to see and learn from the trainers but a direct interaction (e.g. chat or voice) will not be possible during the sessions. Instead, a collaborative document, previously setup by the trainers, will be shared with you before the session. You will be expected to engage and interact anonymously with other participants as well as with the workshop helpers and trainers directly in this document.
Trainer Hubs
All BioNT workshops are offered at no cost, but there are a limited number of seats available. To make workshops more accessible for members of the same company we highly recommend organising what we refer to as "Training Hubs." In this arrangement, one person is formally registered for the workshop, but the knowledge sharing can be expanded to numerous colleagues within their company or SME through live-streaming the session.
Topics
Day |
Topic |
Tutorial |
Day 1 |
Linux Administration and |
|
Day 2 |
More advanced slurm (Admin oriented) Container technologies |
|
Day 3 |
Orchestration, Metrics and Monitoring and Scientific Application Building |
|
How to register
The workshop is free of charge. To participate, please follow these steps:
To participate, please follow these steps:
- Click on the window “Participate” at the top of this page
- You will be redirected to the members.cecam.org page. If you already have an account on our platform, please proceed to step 5
- On the top-right corner click "Register" and complete the provided form. As indicated, completing this form does not register you to the workshop. Within 72 hours you will receive an email confirming your account has been activated. Due to this processing time, we advise you to register a few days before the registration deadline
- After receiving the account activation confirmation, visit the workshop page again and follow instructions starting from step 1
- You should now have an active account. After login in with your login details, you should be redirected to the workshop registration page
- In order to start your registration please follow the instructions of the linked pre-workshop survey until you will get your unique identifier
- To finalise your registration please use the unique identifier in the CECAM platform in the corresponding section and press “Send mail”
- Your application is now submitted for evaluation. If selected, you will be contacted later to confirm your attendance and provide instructions for installing the required software and participating in the online workshop.
References
Silvia Di Giorgio (ZB MED – Information Centre for Life Sciences) - Organiser
Teresa Müller (University of Freiburg) - Organiser
Helena Vela (HPCNow! - Do IT Now Group) - Organiser