NVIDIA Docker Runtime: Difference between revisions

From WilliamsNet Wiki
Jump to navigation Jump to search
(Created page with "(originally from https://docs.nvidia.com/datacenter/cloud-native/container-toolkit/install-guide.html#docker) == CentOS 7/8 == <pre>sudo yum-config-manager --add-repo https:/...")
 
 
Line 27: Line 27:


Test the GPU performance using a simple NVIDIA GPU Cloud container with the CUDA nbody sample program
Test the GPU performance using a simple NVIDIA GPU Cloud container with the CUDA nbody sample program
  docker run -it --rm nvcr.io/nvidia/k8s/cuda-sample:nbody nbody -benchmark -fp64
  docker run -it --rm --gpus all nvcr.io/nvidia/k8s/cuda-sample:nbody nbody -benchmark -fp64


= Reloading Repository Certificates =
= Reloading Repository Certificates =

Latest revision as of 04:01, 19 December 2020

(originally from https://docs.nvidia.com/datacenter/cloud-native/container-toolkit/install-guide.html#docker)

CentOS 7/8[edit]

sudo yum-config-manager --add-repo https://nvidia.github.io/nvidia-docker/centos7/x86_64/nvidia-docker.repo
sudo yum install -y nvidia-docker2

Debian 10[edit]

Setup the stable repository and the GPG key:

distribution=$(. /etc/os-release;echo $ID$VERSION_ID) \
   && curl -s -L https://nvidia.github.io/nvidia-docker/gpgkey | sudo apt-key add - \
   && curl -s -L https://nvidia.github.io/nvidia-docker/$distribution/nvidia-docker.list | sudo tee /etc/apt/sources.list.d/nvidia-docker.list

Note: To get access to experimental features such as CUDA on WSL or the new MIG capability on A100, you may want to add the experimental branch to the repository listing:

curl -s -L https://nvidia.github.io/nvidia-container-runtime/experimental/$distribution/nvidia-container-runtime.list | sudo tee /etc/apt/sources.list.d/nvidia-container-runtime.list

Install the nvidia-docker2 package (and dependencies) after updating the package listing:

sudo apt-get update
sudo apt-get install -y nvidia-docker2

Finishing the install[edit]

Restart the Docker daemon to complete the installation after setting the default runtime:

sudo systemctl restart docker

Test nvidia-smi with the latest official CUDA image

sudo docker run --rm --gpus all nvidia/cuda:11.0-base nvidia-smi

Test the GPU performance using a simple NVIDIA GPU Cloud container with the CUDA nbody sample program

docker run -it --rm --gpus all nvcr.io/nvidia/k8s/cuda-sample:nbody nbody -benchmark -fp64

Reloading Repository Certificates[edit]

Rather frequently, it seems, the NVIDIA folks invalidate their signing certificates for their repositories. When that happens, you just need to delete the certificates and let the 'yum' command reload them on the next update:

$(sed -n 's/releasever=//p' /etc/yum.conf)
${DIST:-$(. /etc/os-release; echo $VERSION_ID)}
sudo rpm -e gpg-pubkey-f796ecb0
sudo gpg --homedir /var/lib/yum/repos/$(uname -m)/$DIST/nvidia-container-runtime/gpgdir --delete-key f796ecb0
sudo gpg --homedir /var/lib/yum/repos/$(uname -m)/$DIST/libnvidia-container/gpgdir --delete-key f796ecb0
sudo gpg --homedir /var/lib/yum/repos/$(uname -m)/$DIST/nvidia-docker/gpgdir --delete-key f796ecb0
sudo yum -y makecache