Changes

Jump to: navigation, search

K-Scale Cluster

772 bytes added, 23:55, 24 May 2024
Andromeda Cluster
Don't do anything computationally expensive on the main node or you will crash it for everyone. Instead, when you need to run some experiments, reserve a GPU (see below).
 
==== SLURM Commands ====
 
Show all currently running jobs:
 
<syntaxhighlight lang="bash">
squeue
</syntaxhighlight>
 
Show your own running jobs:
 
<syntaxhighlight lang="bash">
squeue --me
</syntaxhighlight>
 
Show the available partitions on the cluster:
 
<syntaxhighlight lang="bash">
sinfo
</syntaxhighlight>
 
You'll see something like this:
 
<syntaxhighlight lang="bash">
$ sinfo
PARTITION AVAIL TIMELIMIT NODES STATE NODELIST
compute* up infinite 8 idle compute-permanent-node-[68,285,493,580,625-626,749,801]
</syntaxhighlight>
 
This means:
 
* There is one compute node type, called <code>compute</code>
* There are 8 nodes of that type, all currently in <code>idle</code> state
* The node names are things like <code>compute-permanent-node-68</code>
==== Reserving a GPU ====
488
edits

Navigation menu