CbC/CbC_llvm: docs/CompileCudaWithLLVM.rst comparison

comparison docs/CompileCudaWithLLVM.rst @ 148:63bd29f05246

merged

author	Shinji KONO <kono@ie.u-ryukyu.ac.jp>
date	Wed, 14 Aug 2019 19:46:37 +0900
parents	c2174574ed3a
children

comparison

equal deleted inserted replaced

-:3fc4d5c3e21e
+:63bd29f05246
 ===================
 Prerequisites
 -------------
-CUDA is supported in llvm 3.9, but it's still in active development, so we
+CUDA is supported since llvm 3.9. Current release of clang (7.0.0) supports CUDA
-recommend you `compile clang/LLVM from HEAD
+7.0 through 9.2. If you need support for CUDA 10, you will need to use clang
-<http://llvm.org/docs/GettingStarted.html>`_.
+built from r342924 or newer.
-Before you build CUDA code, you'll need to have installed the appropriate
+Before you build CUDA code, you'll need to have installed the appropriate driver
-driver for your nvidia GPU and the CUDA SDK.  See `NVIDIA's CUDA installation
+for your nvidia GPU and the CUDA SDK.  See `NVIDIA's CUDA installation guide
-guide <https://docs.nvidia.com/cuda/cuda-installation-guide-linux/index.html>`_
+<https://docs.nvidia.com/cuda/cuda-installation-guide-linux/index.html>`_ for
-for details.  Note that clang `does not support
+details.  Note that clang `does not support
-<https://llvm.org/bugs/show_bug.cgi?id=26966>`_ the CUDA toolkit as installed
+<https://llvm.org/bugs/show_bug.cgi?id=26966>`_ the CUDA toolkit as installed by
-by many Linux package managers; you probably need to install nvidia's package.
+many Linux package managers; you probably need to install CUDA in a single
+directory from NVIDIA's package.
-You will need CUDA 7.0, 7.5, or 8.0 to compile with clang.
+CUDA compilation is supported on Linux. Compilation on MacOS and Windows may or
-CUDA compilation is supported on Linux, on MacOS as of 2016-11-18, and on
+may not work and currently have no maintainers. Compilation with CUDA-9.x is
-Windows as of 2017-01-05.
+`currently broken on Windows <https://bugs.llvm.org/show_bug.cgi?id=38811>`_.
 Invoking clang
 --------------
 Invoking clang for CUDA compilation works similarly to compiling regular C++.
 Typically, ``/usr/local/cuda``.
 Pass e.g. ``-L/usr/local/cuda/lib64`` if compiling in 64-bit mode; otherwise,
 pass e.g. ``-L/usr/local/cuda/lib``.  (In CUDA, the device code and host code
 always have the same pointer widths, so if you're compiling 64-bit code for
-the host, you're also compiling 64-bit code for the device.)
+the host, you're also compiling 64-bit code for the device.) Note that as of
+v10.0 CUDA SDK `no longer supports compilation of 32-bit
+applications <https://docs.nvidia.com/cuda/cuda-toolkit-release-notes/index.html#deprecated-features>`_.
 * ``<GPU arch>`` -- the `compute capability
 <https://developer.nvidia.com/cuda-gpus>`_ of your GPU. For example, if you
 want to run your program on a GPU with compute capability of 3.5, specify
 ``--cuda-gpu-arch=sm_35``.
 You can pass ``--cuda-gpu-arch`` multiple times to compile for multiple archs.
 The `-L` and `-l` flags only need to be passed when linking.  When compiling,
 you may also need to pass ``--cuda-path=/path/to/cuda`` if you didn't install
-the CUDA SDK into ``/usr/local/cuda``, ``/usr/local/cuda-7.0``, or
+the CUDA SDK into ``/usr/local/cuda`` or ``/usr/local/cuda-X.Y``.
-``/usr/local/cuda-7.5``.
 Flags that control numerical code
 ---------------------------------
 If you're using GPUs, you probably care about making numerical code run fast.
 ``<math.h>`` and ``<cmath>``
 ----------------------------
 In clang, ``math.h`` and ``cmath`` are available and `pass
-<https://github.com/llvm-mirror/test-suite/blob/master/External/CUDA/math_h.cu>`_
+<https://github.com/llvm/llvm-test-suite/blob/master/External/CUDA/math_h.cu>`_
 `tests
-<https://github.com/llvm-mirror/test-suite/blob/master/External/CUDA/cmath.cu>`_
+<https://github.com/llvm/llvm-test-suite/blob/master/External/CUDA/cmath.cu>`_
 adapted from libc++'s test suite.
 In nvcc ``math.h`` and ``cmath`` are mostly available.  Versions of ``::foof``
 in namespace std (e.g. ``std::sinf``) are not available, and where the standard
 calls for overloads that take integral arguments, these are usually not
 | `gpucc: An Open-Source GPGPU Compiler <http://dl.acm.org/citation.cfm?id=2854041>`_
 | Jingyue Wu, Artem Belevich, Eli Bendersky, Mark Heffernan, Chris Leary, Jacques Pienaar, Bjarke Roune, Rob Springer, Xuetian Weng, Robert Hundt
 | *Proceedings of the 2016 International Symposium on Code Generation and Optimization (CGO 2016)*
 |
-| `Slides from the CGO talk <http://wujingyue.com/docs/gpucc-talk.pdf>`_
+| `Slides from the CGO talk <http://wujingyue.github.io/docs/gpucc-talk.pdf>`_
 |
-| `Tutorial given at CGO <http://wujingyue.com/docs/gpucc-tutorial.pdf>`_
+| `Tutorial given at CGO <http://wujingyue.github.io/docs/gpucc-tutorial.pdf>`_
 Obtaining Help
 ==============
 To obtain help on LLVM in general and its CUDA support, see `the LLVM

Mercurial > hg > CbC > CbC_llvm

comparison docs/CompileCudaWithLLVM.rst @ 148:63bd29f05246