Jimver / cuda-toolkit
GitHub Action to install CUDA
☆154Updated last week
Related projects ⓘ
Alternatives and complementary repositories for cuda-toolkit
- ☆56Updated 2 months ago
- A nanobind example project☆91Updated 2 weeks ago
- An example combining scikit-build and pybind11☆112Updated this week
- A next generation Python CMake adaptor and Python API for plugins☆245Updated this week
- Generate simple index ranges in C++ and CUDA C++☆39Updated last year
- NVIDIA Math Libraries for the Python Ecosystem☆207Updated this week
- CUDA Kernel Benchmarking Library☆519Updated this week
- Generate stubs for python modules☆242Updated 5 months ago
- A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")☆271Updated this week
- The NVIDIA® Tools Extension SDK (NVTX) is a C-based Application Programming Interface (API) for annotating events, code ranges, and resou…☆308Updated this week
- Numbast is a tool to build an automated pipeline that converts CUDA APIs into Numba bindings.☆27Updated this week
- Training material for Nsight developer tools☆129Updated 3 months ago
- Pybind11 tool for making docstrings from C++ comments☆39Updated 7 months ago
- A single-header C++ library for simplifying the use of CUDA Runtime Compilation (NVRTC).☆518Updated 6 months ago
- An implementation of BLAS using the SYCL open standard.☆259Updated 3 weeks ago
- GPUOcelot: A dynamic compilation framework for PTX☆147Updated 2 months ago
- ☆36Updated this week
- ☆204Updated 3 weeks ago
- CUDA kernel author's tools☆109Updated 2 years ago
- The Foundation for All Legate Libraries☆193Updated last week
- ☆486Updated this week
- torch::deploy (multipy for non-torch uses) is a system that lets you get around the GIL problem by running multiple Python interpreters i…☆176Updated this week
- Sample projects demonstrating use of scikit-build☆75Updated this week
- ☆20Updated 5 years ago
- ROCm Thrust - run Thrust dependent software on AMD GPUs☆100Updated this week
- Repository for nvCOMP docs and examples. nvCOMP is a library for fast lossless compression/decompression on the GPU that can be downloade…☆561Updated 2 months ago
- Thrust, CUB, TBB, AVX2, CUDA, OpenCL, OpenMP, SyCL - all it takes to sum a lot of numbers fast!☆73Updated 6 months ago
- Template for starting CUDA/C++ project using CMake with Github Action for CI☆29Updated last year
- An extension library of WMMA API (Tensor Core API)☆84Updated 4 months ago
- C++ library for reading and writing of numpy's .npy files☆373Updated last month