Slurm Workload Manager: Difference between revisions

Content deleted Content added
Undid revision 439171756 by 24.12.49.27 (talk)
Undid revision 439289589 by Raysonho (talk)
Line 1:
'''Simple Linux Utility for Resource Management''' (or simply '''SLURM''') is an [[opensource]] [[job scheduler]] used by many of the world's [[supercomputer]]s and computer clusters. It provides three key functions. First it allocates exclusive and/or non-exclusive access to resources (computer nodes) to users for some duration of time so they can perform work. Second, it provides a framework for starting, executing, and monitoring work (typically a parallel job such as [[Message Passing Interface|MPI]]) on a set of allocated nodes. Finally, it arbitrates contention for resources by managing a queue of pending jobs.
 
SLURM is the batch system on many of the [[TOP500]] supercomputers, including the second fastest one in the world, China's [[Tianhe-I1]]. SLURM is designed to handle thousands of nodes in a single cluster and can sustain throughput of 120,000 jobs per hour.
 
==History ==