Jimver / cuda-toolkitLinks
GitHub Action to install CUDA
☆193Updated 3 weeks ago
Alternatives and similar repositories for cuda-toolkit
Users that are interested in cuda-toolkit are comparing it to the libraries listed below
Sorting:
- ☆59Updated 2 months ago
- The CUDA target for Numba☆222Updated this week
- An example combining scikit-build and pybind11☆142Updated last week
- The NVIDIA® Tools Extension SDK (NVTX) is a C-based Application Programming Interface (API) for annotating events, code ranges, and resou…☆478Updated this week
- Numbast is a tool to build an automated pipeline that converts CUDA APIs into Numba bindings.☆52Updated this week
- A nanobind example project☆114Updated 2 weeks ago
- A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")☆363Updated this week
- No-GIL Python environment featuring NVIDIA Deep Learning libraries.☆69Updated 7 months ago
- A next generation Python CMake adaptor and Python API for plugins☆423Updated last week
- A single-header C++ library for simplifying the use of CUDA Runtime Compilation (NVRTC).☆566Updated 2 months ago
- A Visual Studio Code extension for building and debugging CUDA applications.☆94Updated this week
- CUDA Kernel Benchmarking Library☆773Updated this week
- The Foundation for All Legate Libraries☆233Updated this week
- Generate stubs for python modules☆332Updated 4 months ago
- CUDA kernel author's tools☆114Updated 3 years ago
- RAPIDS Memory Manager☆663Updated last week
- Generate simple index ranges in C++ and CUDA C++☆39Updated 2 years ago
- Study and Implementations of Numerical Algorithms on Apple M1 and A* Devices☆149Updated 3 years ago
- ☆273Updated this week
- A stand-alone implementation of several NumPy dtype extensions used in machine learning.☆315Updated last week
- Repository for nvCOMP docs and examples. nvCOMP is a library for fast lossless compression/decompression on the GPU that can be downloade…☆605Updated last year
- Thrust, CUB, TBB, AVX2, AVX-512, CUDA, OpenCL, OpenMP, Metal, and Rust - all it takes to sum a lot of numbers fast!☆112Updated 4 months ago
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆147Updated this week
- Python SYCL bindings and SYCL-based Python Array API library☆119Updated this week
- Pybind11 tool for making docstrings from C++ comments☆44Updated this week
- LLM training in simple, raw C/CUDA☆108Updated last year
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆124Updated last week
- ☆598Updated this week
- GPUOcelot: A dynamic compilation framework for PTX☆217Updated 10 months ago
- Data Parallel Extension for NumPy☆118Updated last week