CUDA: Difference between revisions

Latest revision as of 16:15, 23 April 2024

Installation

I suggest using conda to install cuda for version control your project.

Note that nvidia-smi lists the maximum CUDA version supported by the GPU driver, not the installed version of CUDA.
You can have a different version of CUDA installed in each conda environment, independently of the version supported by the GPU driver.

Conda

See nvidia/cuda-toolkit and nvidia/cuda-libraries-dev

For example:

# Install the runtime only
conda install -c "nvidia/label/cuda-11.8.0" cuda-toolkit
# Install the runtime and the development tools
conda install -c "nvidia/label/cuda-11.8.0" cuda-toolkit cuda-libraries-dev cuda-nvcc

Ubuntu

CUDA Toolkit

Details

See CUDA Ubuntu Installation

# Set UBUNTU_VERSION to 2004 or 2204
UBUNTU_VERSION=$(lsb_release -sr | sed -e 's/\.//g')

# Install nvidia driver
sudo apt install nvidia-driver-545

# Add NVIDIA package repositories
wget https://developer.download.nvidia.com/compute/cuda/repos/ubuntu${UBUNTU_VERSION}/x86_64/cuda-ubuntu${UBUNTU_VERSION}.pin
sudo mv cuda-ubuntu${UBUNTU_VERSION}.pin /etc/apt/preferences.d/cuda-repository-pin-600
sudo apt-key adv --fetch-keys https://developer.download.nvidia.com/compute/cuda/repos/ubuntu${UBUNTU_VERSION}/x86_64/3bf863cc.pub
sudo add-apt-repository "deb https://developer.download.nvidia.com/compute/cuda/repos/ubuntu${UBUNTU_VERSION}/x86_64/ /"

# Install cuda.
sudo apt install cuda
# Reboot and check that the drivers are working with nvidia-smi
sudo reboot

# Install cudnn if needed
sudo apt install libcudnn8 libcudnn8-dev

Notes

For machine learning, use Anaconda or Docker's CUDA since different versions of TensorFlow and PyTorch require different CUDA versions.

You may need to add LD_LIBRARY_PATH=/usr/local/cuda/lib64 to your environment variables.
You can also do this in PyCharm.

GCC Versions

nvcc sometimes only supports older gcc/g++ versions.
To make it use those by default, create the following symlinks:

sudo ln -s /usr/bin/gcc-6 /usr/local/cuda/bin/gcc
sudo ln -s /usr/bin/g++-6 /usr/local/cuda/bin/g++

Alternatively, you can use -ccbin and point to your gcc:

-ccbin /usr/local/cuda/bin/gcc

References

An Even Easier Introduction To Cuda

@@ Line 1: / Line 1: @@
 ==Installation==
-===Linux===
+I suggest using conda to install cuda for version control your project.
-[https://www.pugetsystems.com/labs/hpc/How-To-Install-CUDA-10-1-on-Ubuntu-19-04-1405/#Step3)InstallCUDA\ Reference]
-* Install the latest nvidia drivers from the standard repo, e.g. <code>nvidia-drivers-450</code><br>
+Note that <code>nvidia-smi</code> lists the maximum CUDA version supported by the GPU driver, not the installed version of CUDA.<br>
-* Install [https://developer.nvidia.com/cuda-toolkit Cuda Toolkit] separately without the drivers.<br>
+You can have a different version of CUDA installed in each conda environment, independently of the version supported by the GPU driver.
-** Use one of the deb install options.
-* You may also want to install the following:
+===Conda===
-** [https://developer.nvidia.com/rdp/cudnn-download cuDnn]<br>
+See [https://anaconda.org/nvidia/cuda-toolkit nvidia/cuda-toolkit] and [https://anaconda.org/nvidia/cuda-libraries-dev nvidia/cuda-libraries-dev]
-** TensorRT
+For example:
+<syntaxhighlight lang="bash">
+# Install the runtime only
+conda install -c "nvidia/label/cuda-11.8.0" cuda-toolkit
+# Install the runtime and the development tools
+conda install -c "nvidia/label/cuda-11.8.0" cuda-toolkit cuda-libraries-dev cuda-nvcc
+</syntaxhighlight>
+===Ubuntu===
+[https://developer.nvidia.com/cuda-toolkit CUDA Toolkit]
+{{hidden | Details |
+See [https://docs.nvidia.com/cuda/cuda-installation-guide-linux/index.html#ubuntu-installation CUDA Ubuntu Installation]
+<syntaxhighlight lang="bash">
+# Set UBUNTU_VERSION to 2004 or 2204
+UBUNTU_VERSION=$(lsb_release -sr | sed -e 's/\.//g')
+# Install nvidia driver
+sudo apt install nvidia-driver-545
-;Adapted from tensorflow
-<pre>
 # Add NVIDIA package repositories
-wget https://developer.download.nvidia.com/compute/cuda/repos/ubuntu1804/x86_64/cuda-repo-ubuntu1804_10.1.243-1_amd64.deb
+wget https://developer.download.nvidia.com/compute/cuda/repos/ubuntu${UBUNTU_VERSION}/x86_64/cuda-ubuntu${UBUNTU_VERSION}.pin
-sudo apt-key adv --fetch-keys https://developer.download.nvidia.com/compute/cuda/repos/ubuntu1804/x86_64/7fa2af80.pub
+sudo mv cuda-ubuntu${UBUNTU_VERSION}.pin /etc/apt/preferences.d/cuda-repository-pin-600
-sudo dpkg -i cuda-repo-ubuntu1804_10.1.243-1_amd64.deb
+sudo apt-key adv --fetch-keys https://developer.download.nvidia.com/compute/cuda/repos/ubuntu${UBUNTU_VERSION}/x86_64/3bf863cc.pub
-sudo apt update
+sudo add-apt-repository "deb https://developer.download.nvidia.com/compute/cuda/repos/ubuntu${UBUNTU_VERSION}/x86_64/ /"
-wget http://developer.download.nvidia.com/compute/machine-learning/repos/ubuntu1804/x86_64/nvidia-machine-learning-repo-ubuntu1804_1.0.0-1_amd64.deb
-sudo apt install ./nvidia-machine-learning-repo-ubuntu1804_1.0.0-1_amd64.deb
-sudo apt update
-# Install NVIDIA driver
+# Install cuda.
-sudo apt install -y nvidia-driver-450
+sudo apt install cuda
-# Reboot. Check that GPUs are visible using the command: nvidia-smi
+# Reboot and check that the drivers are working with nvidia-smi
+sudo reboot
-# Install development and runtime libraries (~4GB)
+# Install cudnn if needed
-sudo apt-get install --no-install-recommends \
+sudo apt install libcudnn8 libcudnn8-dev
-    cuda-10-1 \
+</syntaxhighlight>
-    libcudnn7=7.6.4.38-1+cuda10.1  \
-    libcudnn7-dev=7.6.4.38-1+cuda10.1
-# Install TensorRT. Requires that libcudnn7 is installed above.
+;Notes
-sudo apt-get install -y --no-install-recommends libnvinfer6=6.0.1-1+cuda10.1 \
+* For machine learning, use Anaconda or Docker's CUDA since different versions of TensorFlow and PyTorch require different CUDA versions.
-    libnvinfer-dev=6.0.1-1+cuda10.1 \
-    libnvinfer-plugin6=6.0.1-1+cuda10.1
-</pre>
-For tensorflow and pytorch, you may need to add <code>LD_LIBRARY_PATH=/usr/local/cuda/lib64</code> to your environment variables.<br>
+You may need to add <code>LD_LIBRARY_PATH&#x3D;/usr/local/cuda/lib64</code> to your environment variables.<br>
 You can also do this in PyCharm.<br>
 [[File:Pycharm LD LIBRARY PATH config.png| 200x200px]]
 [[File:Pycharm LD LIBRARY PATH console config.png| 200x200px]]
-==GCC Versions==
+}}
+===GCC Versions===
 <code>nvcc</code> sometimes only supports older gcc/g++ versions.
 To make it use those by default, create the following symlinks:
@@ Line 49: / Line 60: @@
 * <code>sudo ln -s /usr/bin/gcc-6 /usr/local/cuda/bin/gcc</code>
 * <code>sudo ln -s /usr/bin/g++-6 /usr/local/cuda/bin/g++</code>
+Alternatively, you can use <code>-ccbin</code> and point to your gcc:
+<pre>
+-ccbin /usr/local/cuda/bin/gcc
+</pre>
 ==References==
 * [https://devblogs.nvidia.com/even-easier-introduction-cuda/ An Even Easier Introduction To Cuda]
+[[Category:Programming languages]]
+[[Category:GPU Programming languages]]