9/10
5 Days
High-Performance Computing (HPC) systems are at the core of advanced research, data-intensive industries, and modern enterprise workloads. This course equips IT professionals and system administrators with the knowledge and hands-on skills to configure, manage, and optimize HPC clusters. Participants will gain practical experience in Linux-based administration, workload scheduling, security, and performance monitoring, preparing them to manage HPC infrastructure in both research and industry contexts.
Instructor-led interactive sessions
Hands-on labs with HPC cluster environments
Real-world case studies and benchmarking tools
Group exercises and scenario-based projects
Understand HPC architecture, components, and operating environments.
Install, configure, and manage Linux-based HPC systems.
Deploy and manage workload schedulers such as Slurm.
Configure essential services (NFS, SSH, IPMI, DNS) for cluster operations.
Monitor, benchmark, and optimize HPC system performance.
Apply security measures to protect HPC infrastructure.
Document configurations and maintain long-term system health.
Day 1: HPC Foundations
Session 1: Introduction to High-Performance Computing and cluster architecture
Session 2: Linux refresher for system administrators
Session 3: HPC hardware components – compute, storage, interconnects
Session 4: Access and management – SSH, IPMI basics
Day 2: Core Services for HPC Systems
Session 1: Network services – DNS, DHCP, NFS setup
Session 2: Shared storage and data access in clusters
Session 3: Remote management and monitoring tools
Session 4: Hands-on lab – setting up NFS and SSH for HPC nodes
Day 3: Workload Scheduling and Application Management
Session 1: Job scheduling concepts and introduction to Slurm
Session 2: Installing and configuring workload managers
Session 3: Compiling and running applications on multi-node clusters
Session 4: Lab – submitting jobs and managing queues
Day 4: Performance Optimization and Security
Session 1: Benchmarking HPC systems and applications
Session 2: Performance tuning for compute and storage nodes
Session 3: HPC security basics – firewalls, intrusion detection, access control
Session 4: Exercise – securing and testing a cluster environment
Day 5: Documentation, Advanced Topics & Project
Session 1: Documentation and reporting best practices for HPC admins
Session 2: Containers and HPC (Docker, Singularity overview)
Session 3: Emerging trends – cloud HPC, hybrid infrastructure
Session 4: Capstone project – designing and presenting a mini HPC cluster management plan
We are open to customizing this program to align with your specific learning objectives. If your team has particular goals or areas they wish to focus on, we would be happy to tailor the course outline to meet those needs and ensure the program supports the achievement of your desired outcomes.
This course is built for tech professionals who are ready to step into the world of containers, orchestration, and cloud-native application development.
This course is for developers who want to stop treating Kubernetes like a black box and start building, deploying, and managing real applications inside it.
This course is built for system administrators, DevOps engineers, and cloud professionals who want to take control of Kubernetes from the inside out.
This hands-on course is designed for IT professionals who are ready to take control of their data protection strategies using Veeam Backup & Replication v12
This course is tailored for IT professionals who want to gain hands-on expertise in managing VMware vSphere environments—the industry-standard platform for data center virtualization.
This course is for professionals who want to master the art of designing scalable, secure, and cost-efficient cloud solutions on Amazon Web Services (AWS).
This course is ideal for system administrators and IT professionals who want to build a strong foundation in virtualization using VMware vSphere 6.7
This course is built for virtualization and cloud professionals who want to take their infrastructure skills into the Kubernetes era.
This course is built for IT professionals, architects, and cloud engineers who want to master the foundational skills of designing scalable, secure, and resilient infrastructure on AWS.
This course is crafted for system administrators and IT professionals who manage High Performance Computing (HPC) environments.
This hands-on training course is tailored for professionals preparing for the HashiCorp Certified: Terraform Associate exam.
This course is designed to prepare participants for the AWS Certified Cloud Practitioner certification exam, while building a solid foundation in core AWS cloud services, architecture, and security.
Lets Discuss