CUDA: Difference between revisions

From David's Wiki
 
(34 intermediate revisions by the same user not shown)
Line 1: Line 1:


==Installation==
==Installation==
===Linux===
I suggest using conda to install cuda for version control your project.
[https://www.pugetsystems.com/labs/hpc/How-To-Install-CUDA-10-1-on-Ubuntu-19-04-1405/#Step3)InstallCUDA\ Reference]


* Install the latest nvidia drivers from the standard repo, e.g. <code>nvidia-drivers-450</code><br>
Note that <code>nvidia-smi</code> lists the maximum CUDA version supported by the GPU driver, not the installed version of CUDA.<br>
* Install [https://developer.nvidia.com/cuda-toolkit Cuda Toolkit] separately without the drivers.<br>
You can have a different version of CUDA installed in each conda environment, independently of the version supported by the GPU driver.
** Use one of the deb install options.
 
* You may also want to install the following:
===Conda===
** [https://developer.nvidia.com/rdp/cudnn-download cuDnn]<br>
See [https://anaconda.org/nvidia/cuda-toolkit nvidia/cuda-toolkit] and [https://anaconda.org/nvidia/cuda-libraries-dev nvidia/cuda-libraries-dev]
** TensorRT
 
For example:
<syntaxhighlight lang="bash">
# Install the runtime only
conda install -c "nvidia/label/cuda-11.8.0" cuda-toolkit
# Install the runtime and the development tools
conda install -c "nvidia/label/cuda-11.8.0" cuda-toolkit cuda-libraries-dev cuda-nvcc
</syntaxhighlight>
 
===Ubuntu===
[https://developer.nvidia.com/cuda-toolkit CUDA Toolkit]
 
{{hidden | Details |
See [https://docs.nvidia.com/cuda/cuda-installation-guide-linux/index.html#ubuntu-installation CUDA Ubuntu Installation]
<syntaxhighlight lang="bash">
# Set UBUNTU_VERSION to 2004 or 2204
UBUNTU_VERSION=$(lsb_release -sr | sed -e 's/\.//g')
 
# Install nvidia driver
sudo apt install nvidia-driver-545


;Adapted from tensorflow
<pre>
# Add NVIDIA package repositories
# Add NVIDIA package repositories
wget https://developer.download.nvidia.com/compute/cuda/repos/ubuntu1804/x86_64/cuda-repo-ubuntu1804_10.1.243-1_amd64.deb
wget https://developer.download.nvidia.com/compute/cuda/repos/ubuntu${UBUNTU_VERSION}/x86_64/cuda-ubuntu${UBUNTU_VERSION}.pin
sudo apt-key adv --fetch-keys https://developer.download.nvidia.com/compute/cuda/repos/ubuntu1804/x86_64/7fa2af80.pub
sudo mv cuda-ubuntu${UBUNTU_VERSION}.pin /etc/apt/preferences.d/cuda-repository-pin-600
sudo dpkg -i cuda-repo-ubuntu1804_10.1.243-1_amd64.deb
sudo apt-key adv --fetch-keys https://developer.download.nvidia.com/compute/cuda/repos/ubuntu${UBUNTU_VERSION}/x86_64/3bf863cc.pub
sudo apt update
sudo add-apt-repository "deb https://developer.download.nvidia.com/compute/cuda/repos/ubuntu${UBUNTU_VERSION}/x86_64/ /"
wget http://developer.download.nvidia.com/compute/machine-learning/repos/ubuntu1804/x86_64/nvidia-machine-learning-repo-ubuntu1804_1.0.0-1_amd64.deb
sudo apt install ./nvidia-machine-learning-repo-ubuntu1804_1.0.0-1_amd64.deb
sudo apt update


# Install NVIDIA driver
# Install cuda.
sudo apt install -y nvidia-driver-450
sudo apt install cuda
# Reboot. Check that GPUs are visible using the command: nvidia-smi
# Reboot and check that the drivers are working with nvidia-smi
sudo reboot


# Install development and runtime libraries (~4GB)
# Install cudnn if needed
sudo apt-get install --no-install-recommends \
sudo apt install libcudnn8 libcudnn8-dev
    cuda-10-1 \
</syntaxhighlight>
    libcudnn7=7.6.4.38-1+cuda10.1  \
    libcudnn7-dev=7.6.4.38-1+cuda10.1


# Install TensorRT. Requires that libcudnn7 is installed above.
;Notes
sudo apt-get install -y --no-install-recommends libnvinfer6=6.0.1-1+cuda10.1 \
* For machine learning, use Anaconda or Docker's CUDA since different versions of TensorFlow and PyTorch require different CUDA versions.
    libnvinfer-dev=6.0.1-1+cuda10.1 \
    libnvinfer-plugin6=6.0.1-1+cuda10.1
</pre>


For tensorflow and pytorch, you may need to add <code>LD_LIBRARY_PATH=/usr/local/cuda/lib64</code> to your environment variables.<br>
You may need to add <code>LD_LIBRARY_PATH&#x3D;/usr/local/cuda/lib64</code> to your environment variables.<br>
You can also do this in PyCharm.<br>
You can also do this in PyCharm.<br>
[[File:Pycharm LD LIBRARY PATH config.png| 200x200px]]
[[File:Pycharm LD LIBRARY PATH config.png| 200x200px]]
[[File:Pycharm LD LIBRARY PATH console config.png| 200x200px]]
[[File:Pycharm LD LIBRARY PATH console config.png| 200x200px]]


==GCC Versions==
}}
 
===GCC Versions===
<code>nvcc</code> sometimes only supports older gcc/g++ versions.   
<code>nvcc</code> sometimes only supports older gcc/g++ versions.   
To make it use those by default, create the following symlinks:
To make it use those by default, create the following symlinks:
Line 49: Line 60:
* <code>sudo ln -s /usr/bin/gcc-6 /usr/local/cuda/bin/gcc</code>
* <code>sudo ln -s /usr/bin/gcc-6 /usr/local/cuda/bin/gcc</code>
* <code>sudo ln -s /usr/bin/g++-6 /usr/local/cuda/bin/g++</code>
* <code>sudo ln -s /usr/bin/g++-6 /usr/local/cuda/bin/g++</code>
Alternatively, you can use <code>-ccbin</code> and point to your gcc:
<pre>
-ccbin /usr/local/cuda/bin/gcc
</pre>


==References==
==References==
* [https://devblogs.nvidia.com/even-easier-introduction-cuda/ An Even Easier Introduction To Cuda]
* [https://devblogs.nvidia.com/even-easier-introduction-cuda/ An Even Easier Introduction To Cuda]
[[Category:Programming languages]]
[[Category:GPU Programming languages]]

Latest revision as of 16:15, 23 April 2024

Installation

I suggest using conda to install cuda for version control your project.

Note that nvidia-smi lists the maximum CUDA version supported by the GPU driver, not the installed version of CUDA.
You can have a different version of CUDA installed in each conda environment, independently of the version supported by the GPU driver.

Conda

See nvidia/cuda-toolkit and nvidia/cuda-libraries-dev

For example:

# Install the runtime only
conda install -c "nvidia/label/cuda-11.8.0" cuda-toolkit
# Install the runtime and the development tools
conda install -c "nvidia/label/cuda-11.8.0" cuda-toolkit cuda-libraries-dev cuda-nvcc

Ubuntu

CUDA Toolkit

Details

See CUDA Ubuntu Installation

# Set UBUNTU_VERSION to 2004 or 2204
UBUNTU_VERSION=$(lsb_release -sr | sed -e 's/\.//g')

# Install nvidia driver
sudo apt install nvidia-driver-545

# Add NVIDIA package repositories
wget https://developer.download.nvidia.com/compute/cuda/repos/ubuntu${UBUNTU_VERSION}/x86_64/cuda-ubuntu${UBUNTU_VERSION}.pin
sudo mv cuda-ubuntu${UBUNTU_VERSION}.pin /etc/apt/preferences.d/cuda-repository-pin-600
sudo apt-key adv --fetch-keys https://developer.download.nvidia.com/compute/cuda/repos/ubuntu${UBUNTU_VERSION}/x86_64/3bf863cc.pub
sudo add-apt-repository "deb https://developer.download.nvidia.com/compute/cuda/repos/ubuntu${UBUNTU_VERSION}/x86_64/ /"

# Install cuda.
sudo apt install cuda
# Reboot and check that the drivers are working with nvidia-smi
sudo reboot

# Install cudnn if needed
sudo apt install libcudnn8 libcudnn8-dev
Notes
  • For machine learning, use Anaconda or Docker's CUDA since different versions of TensorFlow and PyTorch require different CUDA versions.

You may need to add LD_LIBRARY_PATH=/usr/local/cuda/lib64 to your environment variables.
You can also do this in PyCharm.

GCC Versions

nvcc sometimes only supports older gcc/g++ versions.
To make it use those by default, create the following symlinks:

  • sudo ln -s /usr/bin/gcc-6 /usr/local/cuda/bin/gcc
  • sudo ln -s /usr/bin/g++-6 /usr/local/cuda/bin/g++

Alternatively, you can use -ccbin and point to your gcc:

-ccbin /usr/local/cuda/bin/gcc

References