Server load
Benedikt Boecking
boecking at andrew.cmu.edu
Thu Nov 14 11:35:12 EST 2019
Hi all,
We are currently running into problems on several servers due to some users spawning too many threads. On several compute nodes we have several thousand threads running in parallel.
To be a good lab member please:
1. Use top/htop and nvidia-smi to monitor your resource usage and keep it reasonable (memory, cpu, and gpu usage).
2. Check that you aren’t using more threads than intended due to automatic multiprocessing. This happens for example with numpy and spicy linalg functions on the servers.You can control most of this behavior by setting the following variables before running scripts or notebooks, setting them to however many threads you want:
export MKL_NUM_THREADS=1
export NUMEXPR_NUM_THREADS=1
export OMP_NUM_THREADS=1
3. If you have (interactive) session open that you don’t need anymore please close them to free memory (in particular Matlab and jupyter notebooks)
Thanks,
Ben
More information about the Autonlab-users
mailing list