Open main menu

Humanoid Robots Wiki β

Changes

K-Scale Cluster

11 bytes added, 25 April
no edit summary
* You may be sharing your part of the cluster with other users. If so, it is a good idea to avoid using all the GPUs. If you're training models in PyTorch, you can do this using the <code>CUDA_VISIBLE_DEVICES</code> command.
* You should avoid storing data files and model checkpoints to your root directory. Instead, use the `<code>/ephemeral` </code> directory. Your home directory should come with a symlink to a subdirectory which you have write access to.
431
edits