Slurm Installation on Debian

From Supercomputación y Cálculo Científico UIS
Revision as of 15:50, 4 May 2015 by Ltorres (talk | contribs)


Slurm Installation

In this section we describe all the administration tasks for the Slurm Workload Manager in the frontend node (Server) and in the compute nodes (Client)

Slurm Installation on Debian

  1. Make sure the clocks, users and groups (UIDs and GIDs) are synchronized across the cluster.

  2. Install slurm from repositories - apt-get install slurm-llnl o apt-get install slurm-wlm
  3. Open the local file in a browser sftp://ip-node/usr/share/doc/slurmctld/slurm-wlm-configurator.html where ip-node is the ip from slurmctld machine, and fill out the form. The slurm.conf file is generated and copy in /etc/slurm.conf in each machine
  4. Exist two ways for generate munge.key - /usr/sbin/create-munge-key or with dd if=/dev/random bs=1 count=1024 >/etc/munge/munge.key (recommended for the paranoid) and dd if=/dev/urandom bs=1 count=1024 >/etc/munge/munge.key (recommended for the impatient)
  5. copy munge.key in the nodes - ssh /etc/munge/munge.key root@nodes:/etc/munge/munge.key
  6. Edit file /etc/passwd and modify munge:x:501:501::var/run/munge;/sbin/nologin in each machine
  7. Test munge in local machine - munge -n | unmunge
  8. Test munge in nodes machine - munge -n | ssh host1 unmunge
  9. Start slurm - /etc/init.d/slurmctld start in the server machine and /etc/init.d/slurmd start in nodes machine