Batch system: Submitting jobs with SLURM
The queuing system is based on SLURM, an open source, fault-tolerant and highly scalable cluster management and job scheduling system for Linux clusters.
- Slurm has three key functions:
It allocates exclusive and/or non-exclusive access to resources (compute nodes) to users for some duration of time.
It provides a framework for starting, executing and monitoring jobs on the set of allocated nodes.
It arbitrates contention for resources by managing a queue of pending work.
For more information about SLURM you can consult the guides and tutorials available on their website: Quickstart user guide and SLURM tutorials. There’s also some useful links at portalusuarios.cesga.es
Index of contents:
- Memory and time
- Basic commands
- Job array
- Multiple tasks on a job
- Tasks on specific cores
- Sample Job Scripts
- Job states
- Output files
- Email notifications