Changes

K-Scale Cluster

772 bytes added, 23:55, 24 May 2024

→‎Andromeda Cluster

Don't do anything computationally expensive on the main node or you will crash it for everyone. Instead, when you need to run some experiments, reserve a GPU (see below).

==== SLURM Commands ====

Show all currently running jobs:

squeue

</syntaxhighlight>

Show your own running jobs:

squeue --me

</syntaxhighlight>

Show the available partitions on the cluster:

sinfo

</syntaxhighlight>

You'll see something like this:

$ sinfo

PARTITION AVAIL TIMELIMIT NODES STATE NODELIST

compute* up infinite 8 idle compute-permanent-node-[68,285,493,580,625-626,749,801]

</syntaxhighlight>

This means:

* There is one compute node type, called <code>compute</code>

* There are 8 nodes of that type, all currently in <code>idle</code> state

* The node names are things like <code>compute-permanent-node-68</code>

==== Reserving a GPU ====

Ben

blockimmune, Bureaucrats, Administrators

488

edits

Humanoid Robots Wiki β

Changes

K-Scale Cluster

Humanoid Robots Wiki ^β