gpu20 and gpu21 provisioned

Jeff Schneider jeff4 at andrew.cmu.edu
Mon May 18 12:10:56 EDT 2020


Thanks for the hard work bringing these online!

On 5/15/2020 11:06 PM, Predrag Punosevac wrote:
> Dear Autonians,
>
> After about 12h of work you can finally log into the newest addition to
> our cluster: GPU20 and GPU21. These are some of the finest GPU nodes on
> the CMU campus. They came with a price tag of 37K each after 8K per unit
> education discount. I have no idea whom Dr. Jeff Schneider shook down
> for money but he surely knows how to do it. Each machine has 4 Tesla
> V100 GPU cards which have 32GB of GPU memory. That is sufficient to
> train 3D neural networks and to answer all the questions Dr. Dubrawski
> might ask you :-) On a serious note, we had really hard time getting
> these monsters onto the campus during the Cov19 and provisioning them
> under current circumstances. We didn't have deep rack space for these
> nor electricity so I am temporary borrowing space and electricity from
> somebody else. You will notice that network is only 1Gigabit as I could
> not get 10Gigabit network working with 30m Cat 5e which was needed to
> plug machines into our switch. That is how far they are actually
> physically located from our cluster.
>
> I will have to take sleep before adding scratch directories and
> installing MATLAB.
>
> Best,
> Predrag
>
> P.S. Dr. Schneider has more surprises but I will need to make a new trip
> to CMU in my hazmat suit before those babes are brought online.
>
>
>
>
>
>
>
> predragp at gpu20$ nvidia-smi
> Fri May 15 22:25:55 2020
> +-----------------------------------------------------------------------------+
> | NVIDIA-SMI 440.64.00    Driver Version: 440.64.00    CUDA Version:
> 10.2     |
> |-------------------------------+----------------------+----------------------+
> | GPU  Name        Persistence-M| Bus-Id        Disp.A | Volatile
> Uncorr. ECC |
> | Fan  Temp  Perf  Pwr:Usage/Cap|         Memory-Usage | GPU-Util
> Compute M. |
> |===============================+======================+======================|
> |   0  Tesla V100-SXM2...  Off  | 00000000:61:00.0 Off |
>    0 |
> | N/A   42C    P0    53W / 300W |      0MiB / 32510MiB |      0%
> Default |
> +-------------------------------+----------------------+----------------------+
> |   1  Tesla V100-SXM2...  Off  | 00000000:62:00.0 Off |
>    0 |
> | N/A   41C    P0    53W / 300W |      0MiB / 32510MiB |      0%
> Default |
> +-------------------------------+----------------------+----------------------+
> |   2  Tesla V100-SXM2...  Off  | 00000000:89:00.0 Off |
>    0 |
> | N/A   40C    P0    56W / 300W |      0MiB / 32510MiB |      0%
> Default |
> +-------------------------------+----------------------+----------------------+
> |   3  Tesla V100-SXM2...  Off  | 00000000:8A:00.0 Off |
>    0 |
> | N/A   41C    P0    55W / 300W |      0MiB / 32510MiB |      0%
> Default |
> +-------------------------------+----------------------+----------------------+
>
>       
> +-----------------------------------------------------------------------------+
> | Processes:                                                       GPU
> Memory |
> |  GPU       PID   Type   Process name                             Usage
>      |
> |=============================================================================|
> |  No running processes found
>      |
> +-----------------------------------------------------------------------------+
>
>
>
> egp at gpu21$ pwd
> /zfsauton2/home/predragp
> predragp at gpu21$ nvidia-smi
> Fri May 15 22:52:16 2020
> +-----------------------------------------------------------------------------+
> | NVIDIA-SMI 440.64.00    Driver Version: 440.64.00    CUDA Version:
> 10.2     |
> |-------------------------------+----------------------+----------------------+
> | GPU  Name        Persistence-M| Bus-Id        Disp.A | Volatile
> Uncorr. ECC |
> | Fan  Temp  Perf  Pwr:Usage/Cap|         Memory-Usage | GPU-Util
> Compute M. |
> |===============================+======================+======================|
> |   0  Tesla V100-SXM2...  Off  | 00000000:61:00.0 Off |
>    0 |
> | N/A   35C    P0    54W / 300W |      0MiB / 32510MiB |      0%
> Default |
> +-------------------------------+----------------------+----------------------+
> |   1  Tesla V100-SXM2...  Off  | 00000000:62:00.0 Off |
>    0 |
> | N/A   34C    P0    54W / 300W |      0MiB / 32510MiB |      0%
> Default |
> +-------------------------------+----------------------+----------------------+
> |   2  Tesla V100-SXM2...  Off  | 00000000:89:00.0 Off |
>    0 |
> | N/A   34C    P0    54W / 300W |      0MiB / 32510MiB |      0%
> Default |
> +-------------------------------+----------------------+----------------------+
> |   3  Tesla V100-SXM2...  Off  | 00000000:8A:00.0 Off |
>    0 |
> | N/A   34C    P0    57W / 300W |      0MiB / 32510MiB |      0%
> Default |
> +-------------------------------+----------------------+----------------------+
>
>       
> +-----------------------------------------------------------------------------+
> | Processes:                                                       GPU
> Memory |
> |  GPU       PID   Type   Process name                             Usage
>      |
> |=============================================================================|
> |  No running processes found
>      |
> +-----------------------------------------------------------------------------+
> predragp at gpu21$ ounted.


More information about the Autonlab-users mailing list