Jimver / cuda-toolkitLinks
GitHub Action to install CUDA
☆197Updated 3 weeks ago
Alternatives and similar repositories for cuda-toolkit
Users that are interested in cuda-toolkit are comparing it to the libraries listed below
Sorting:
- ☆59Updated 3 months ago
- The CUDA target for Numba☆242Updated this week
- Numbast is a tool to build an automated pipeline that converts CUDA APIs into Numba bindings.☆55Updated last week
- The NVIDIA® Tools Extension SDK (NVTX) is a C-based Application Programming Interface (API) for annotating events, code ranges, and resou…☆502Updated last week
- A nanobind example project☆115Updated last month
- No-GIL Python environment featuring NVIDIA Deep Learning libraries.☆69Updated 9 months ago
- CUDA kernel author's tools☆115Updated 3 years ago
- A High-Throughput Parallel Lossless Compressor for Scientific Data☆75Updated 2 years ago
- An example combining scikit-build and pybind11☆141Updated last week
- ☆279Updated this week
- A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")☆371Updated this week
- SYCL implementation of Fused MLPs for Intel GPUs☆50Updated last month
- Repository for nvCOMP docs and examples. nvCOMP is a library for fast lossless compression/decompression on the GPU that can be downloade…☆609Updated last year
- LLM training in simple, raw C/CUDA☆110Updated last year
- GPUOcelot: A dynamic compilation framework for PTX☆219Updated 11 months ago
- Python SYCL bindings and SYCL-based Python Array API library☆121Updated this week
- Generate simple index ranges in C++ and CUDA C++☆39Updated 2 years ago
- ☆72Updated this week
- CUDA Kernel Benchmarking Library☆798Updated 2 weeks ago
- ☆612Updated 2 weeks ago
- A Visual Studio Code extension for building and debugging CUDA applications.☆97Updated 2 weeks ago
- Thrust, CUB, TBB, AVX2, AVX-512, CUDA, OpenCL, OpenMP, Metal, and Rust - all it takes to sum a lot of numbers fast!☆115Updated 5 months ago
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆66Updated last week
- A stand-alone implementation of several NumPy dtype extensions used in machine learning.☆325Updated 2 weeks ago
- A next generation Python CMake adaptor and Python API for plugins☆430Updated this week
- A single-header C++ library for simplifying the use of CUDA Runtime Compilation (NVRTC).☆567Updated 4 months ago
- AMD’s C++ library for accelerating tensor primitives☆48Updated this week
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆124Updated this week
- manylinux docker images with CUDA Toolkit☆17Updated last month
- NVIDIA Math Libraries for the Python Ecosystem☆542Updated 2 months ago