Jimver / cuda-toolkitLinks
GitHub Action to install CUDA
☆186Updated last week
Alternatives and similar repositories for cuda-toolkit
Users that are interested in cuda-toolkit are comparing it to the libraries listed below
Sorting:
- ☆59Updated last year
- The CUDA target for Numba☆184Updated last week
- An example combining scikit-build and pybind11☆136Updated this week
- The NVIDIA® Tools Extension SDK (NVTX) is a C-based Application Programming Interface (API) for annotating events, code ranges, and resou…☆449Updated 3 weeks ago
- Numbast is a tool to build an automated pipeline that converts CUDA APIs into Numba bindings.☆50Updated last week
- No-GIL Python environment featuring NVIDIA Deep Learning libraries.☆63Updated 5 months ago
- NVIDIA Math Libraries for the Python Ecosystem☆350Updated last week
- A stand-alone implementation of several NumPy dtype extensions used in machine learning.☆296Updated last week
- A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")☆354Updated this week
- A Visual Studio Code extension for building and debugging CUDA applications.☆89Updated this week
- CUDA kernel author's tools☆113Updated 3 years ago
- Python SYCL bindings and SYCL-based Python Array API library☆117Updated this week
- A nanobind example project☆112Updated 5 months ago
- GPUOcelot: A dynamic compilation framework for PTX☆206Updated 7 months ago
- CUDA Kernel Benchmarking Library☆721Updated last week
- LLM training in simple, raw C/CUDA☆104Updated last year
- Data Parallel Extension for NumPy☆111Updated this week
- The Foundation for All Legate Libraries☆227Updated this week
- A next generation Python CMake adaptor and Python API for plugins☆381Updated last week
- ☆578Updated last week
- Generate simple index ranges in C++ and CUDA C++☆39Updated 2 years ago
- ☆269Updated this week
- HIPIFY: Convert CUDA to Portable C++ Code☆619Updated this week
- ☆38Updated this week
- Kernel Tuner☆360Updated this week
- ☆51Updated 3 months ago
- A single-header C++ library for simplifying the use of CUDA Runtime Compilation (NVRTC).☆554Updated 3 weeks ago
- A simple profiler to count Nvidia PTX assembly instructions of OpenCL/SYCL/CUDA kernels for roofline model analysis.☆55Updated 5 months ago
- RAPIDS Memory Manager☆616Updated last week
- Thrust, CUB, TBB, AVX2, AVX-512, CUDA, OpenCL, OpenMP, Metal, and Rust - all it takes to sum a lot of numbers fast!☆106Updated last month