Slurm Workload Manager: Difference between revisions

Content deleted Content added
m Typo fixing, possibly other edits, typo(s) fixed: user’s → user's (3)
Line 1:
{{Multiple issues|
{{primarysourcesprimary sources|date=July 2010}}
{{Notability|date=July 2010}}
}}
Line 36:
* arbitrating contention for resources by managing a queue of pending jobs.
 
Slurm is the workload manager on about 60% of the [[TOP500]] supercomputers. {{Citation Neededneeded|date=May 2018}}
 
Slurm uses a [[curve fitting|best fit algorithm]] based on [[Hilbert curve scheduling]] or [[fat tree]] network topology in order to optimize locality of task assignments on parallel computers.<ref name=Eitan>{{Cite conference|doi=10.1007/978-3-642-04633-9_8|title=Effects of Topology-Aware Allocation Policies on Scheduling Performance|conference=Job Scheduling Strategies for Parallel Processing|series=Lecture Notes in Computer Science|year=2009|last1=Pascual|first1=Jose Antonio|last2=Navaridas|first2=Javier|last3=Miguel-Alonso|first3=Jose|isbn=978-3-642-04632-2|volume=5798|pages=138–144}}</ref>
Line 117:
* {{cite conference|doi=10.1007/10968987_3|url=http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.10.6834|title=SLURM: Simple Linux Utility for Resource Management|conference=Job Scheduling Strategies for Parallel Processing|series=Lecture Notes in Computer Science|year=2003|last1=Yoo|first1=Andy B.|last2=Jette|first2=Morris A.|last3=Grondona|first3=Mark|isbn=978-3-540-20405-3|volume=2862|page=44}}
{{Div col end}}
 
 
==SLURM Commands==
Line 123 ⟶ 122:
The following is a list of useful commands available for SLURM. Some of these were built by CCR to allow easier reporting for users.
 
For usage information for these commands, use --helpuse—help (example: sinfo --helpsinfo—help)
 
Use the linux command 'man' for more information about most of these commands (example: man sinfo)
Line 140 ⟶ 139:
|['''Cancel a pending or running job'''||scancel '''jobid'''
|-
|['''Check the state of a user’suser's jobs'''||squeue --usersqueue—user='''username'''
|-
|['''Allocate compute nodes for interactive use'''||salloc'''
Line 154 ⟶ 153:
|['''Get the efficiency of a running job'''||sueff '''user-name'''
|-
|['''Get SLURM accounting information for a user’suser's jobs from start date to now'''||suacct start-date '''user-name'''
|-
|['''Get SLURM accounting and node information for a job'''||slist '''jobid'''
|-
|['''Get resource usage and accounting information for a user’suser's jobs from start date to now'''||slogs '''start-date user-list'''
|-
|['''Get estimated starting times for queued jobs'''||stimes '''[various squeue options]'''