Jimver / cuda-toolkitLinks
GitHub Action to install CUDA
☆175Updated this week
Alternatives and similar repositories for cuda-toolkit
Users that are interested in cuda-toolkit are comparing it to the libraries listed below
Sorting:
- ☆58Updated 9 months ago
- An example combining scikit-build and pybind11☆129Updated this week
- The NVIDIA® Tools Extension SDK (NVTX) is a C-based Application Programming Interface (API) for annotating events, code ranges, and resou…☆388Updated last week
- A next generation Python CMake adaptor and Python API for plugins☆321Updated this week
- CUDA Kernel Benchmarking Library☆650Updated last week
- The CUDA target for Numba☆128Updated this week
- NVIDIA Math Libraries for the Python Ecosystem☆318Updated 2 months ago
- manylinux docker images with CUDA Toolkit☆13Updated 2 weeks ago
- A single-header C++ library for simplifying the use of CUDA Runtime Compilation (NVRTC).☆536Updated last week
- A nanobind example project☆106Updated last month
- Numbast is a tool to build an automated pipeline that converts CUDA APIs into Numba bindings.☆47Updated last week
- ☆538Updated last week
- A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")☆331Updated this week
- Generate simple index ranges in C++ and CUDA C++☆39Updated last year
- CUDA kernel author's tools☆111Updated 3 years ago
- A Visual Studio Code extension for building and debugging CUDA applications.☆81Updated 10 months ago
- cudnn_frontend provides a c++ wrapper for the cudnn backend API and samples on how to use it☆568Updated last week
- ☆230Updated 3 weeks ago
- A stand-alone implementation of several NumPy dtype extensions used in machine learning.☆268Updated last week
- ROCm Thrust - run Thrust dependent software on AMD GPUs☆120Updated this week
- Thin, unified, C++-flavored wrappers for the CUDA APIs☆840Updated last week
- Pybind11 tool for making docstrings from C++ comments☆40Updated last year
- Archived implementation of BLAS using the SYCL open standard. See oneMath for a replacement.☆261Updated 4 months ago
- Reusable software components for ROCm developers☆84Updated this week
- ROCm BLAS marshalling library☆142Updated this week
- Python SYCL bindings and SYCL-based Python Array API library☆112Updated this week
- HIPIFY: Convert CUDA to Portable C++ Code☆585Updated this week
- ☆35Updated this week
- AMD’s C++ library for accelerating tensor primitives☆41Updated this week
- RAPIDS Memory Manager☆586Updated this week