Would like to move to to https://github.com/rug-cit-hpc/pg-playbooks but has large files...
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
Egon Rijpkema aa545626eb Added nhc and /apps 3 years ago
documentation Forgot role.. 4 years ago
group_vars/all Added profiling stuff. 3 years ago
promtools Use default slurm exporter (they accepted our PR) 3 years ago
roles Added nhc and /apps 3 years ago
.gitignore don't commit pyc files 4 years ago
ansible.cfg Using a prometheus server on knyft instead of prox 4 years ago
apps.yml Added nhc and /apps 3 years ago
common.yml These tasks are for all peregrine hosts. 4 years ago
etc_hosts.yml Updated '/etc/hosts' from live peregrine. 3 years ago
gpu.yml Switched to singular. 4 years ago
haswell_sym.yml Fix typo 3 years ago
hosts Added cgroup.conf and gres.conf. 3 years ago
hosts-dev recipy to build slurm docker 5 years ago
interactive.yml Refactored the touch alert to own role. 4 years ago
ipmi_exporter.yml Added ipmi monitoring 4 years ago
kill_memory_hogs.yml Made a script that kills user programs 3 years ago
ldap_client.yml ldap and lustre-client roles. 3 years ago
login.yml Refactored the touch alert to own role. 4 years ago
lustre_client.yml ldap and lustre-client roles. 3 years ago
lustre_exporter.yml Also on metadata 3 years ago
metadata.yml This playbook is still needed for the metadara role. 4 years ago
node_exporter.yml no longher using proxy 4 years ago
nvidia_smi_exporter.yml Added nvidia_smi_exporter for prometheus 4 years ago
prom_sql.yml Added role for prometheus sql exporter. 3 years ago
prometheus.yml Using a prometheus server on knyft instead of prox 4 years ago
readme.md Explaned a little bit more. 4 years ago
sandybridge_sym.yml Renamed file to make it similar to other files 3 years ago
site.yml Added skylake_sym.yml, renamed sandybridge file 3 years ago
skylake_sym.yml Make symlinks in /software for Skylake nodes 3 years ago
slurm.yml We're using singular group names now. 4 years ago
slurm_client.yml Added simple slurm client role 3 years ago
slurm_exporter.yml NO proxy client needed anymore. 4 years ago


ansible playbooks for peregrine

This repository contains an inventory and ansible playbooks for the peregrine cluster.

Install slurm.

To install slurm server:

ansible-playbook  --vault-password-file=.vault_pass.txt  slurm.yml

Skip building of docker images.

The building of docker images takes al lot of time and is only nessecary when the docker file has been changed. You can skip this with the following command.

ansible-playbook --vault-password-file=.vault_pass.txt slurm.yml  --skip-tags build

Furthermore, you can prevent the services from starting inmediately by providing the --skip-tags start-service flag.

Setting the state of a single node.

If you want to bring a node's configuration up to date. For example after it has been rolled out via xcat, you can run the following command.
This will configure all state for that node. (node exporter for prometheus, if it is a gpu node, gpu monitoring etc)

ansible-playbook --limit pg-node023 site.yml