Jimver / cuda-toolkitLinks
GitHub Action to install CUDA
☆188Updated last week
Alternatives and similar repositories for cuda-toolkit
Users that are interested in cuda-toolkit are comparing it to the libraries listed below
Sorting:
- ☆59Updated 2 weeks ago
- The CUDA target for Numba☆193Updated this week
- The NVIDIA® Tools Extension SDK (NVTX) is a C-based Application Programming Interface (API) for annotating events, code ranges, and resou…☆454Updated last week
- An example combining scikit-build and pybind11☆136Updated last week
- Numbast is a tool to build an automated pipeline that converts CUDA APIs into Numba bindings.☆50Updated last week
- A nanobind example project☆112Updated 5 months ago
- CUDA Kernel Benchmarking Library☆733Updated this week
- Repository for nvCOMP docs and examples. nvCOMP is a library for fast lossless compression/decompression on the GPU that can be downloade…☆595Updated last year
- A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")☆356Updated this week
- A next generation Python CMake adaptor and Python API for plugins☆391Updated this week
- The Foundation for All Legate Libraries☆228Updated this week
- Python SYCL bindings and SYCL-based Python Array API library☆117Updated this week
- No-GIL Python environment featuring NVIDIA Deep Learning libraries.☆64Updated 5 months ago
- Generate simple index ranges in C++ and CUDA C++☆39Updated 2 years ago
- HIPIFY: Convert CUDA to Portable C++ Code☆625Updated this week
- ☆588Updated last week
- ☆271Updated this week
- NVIDIA Math Libraries for the Python Ecosystem☆507Updated last month
- GPUOcelot: A dynamic compilation framework for PTX☆209Updated 7 months ago
- A Visual Studio Code extension for building and debugging CUDA applications.☆90Updated last week
- Data Parallel Extension for NumPy☆114Updated this week
- LLM training in simple, raw C/CUDA☆105Updated last year
- Thrust, CUB, TBB, AVX2, AVX-512, CUDA, OpenCL, OpenMP, Metal, and Rust - all it takes to sum a lot of numbers fast!☆108Updated 2 months ago
- RAPIDS Memory Manager☆623Updated last week
- A single-header C++ library for simplifying the use of CUDA Runtime Compilation (NVRTC).☆556Updated 3 weeks ago
- FP64 equivalent GEMM via Int8 Tensor Cores using the Ozaki scheme☆87Updated 6 months ago
- C++ library for reading and writing of numpy's .npy files☆417Updated last year
- ☆77Updated this week
- CUDA kernel author's tools☆113Updated 3 years ago
- A High-Throughput Parallel Lossless Compressor for Scientific Data☆71Updated 2 years ago