Jimver / cuda-toolkitLinks
GitHub Action to install CUDA
☆176Updated last week
Alternatives and similar repositories for cuda-toolkit
Users that are interested in cuda-toolkit are comparing it to the libraries listed below
Sorting:
- ☆58Updated 9 months ago
- The CUDA target for Numba☆138Updated last week
- Numbast is a tool to build an automated pipeline that converts CUDA APIs into Numba bindings.☆47Updated this week
- An example combining scikit-build and pybind11☆129Updated last week
- The NVIDIA® Tools Extension SDK (NVTX) is a C-based Application Programming Interface (API) for annotating events, code ranges, and resou…☆396Updated 3 weeks ago
- No-GIL Python environment featuring NVIDIA Deep Learning libraries.☆61Updated 2 months ago
- A nanobind example project☆107Updated 2 months ago
- A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")☆339Updated this week
- A next generation Python CMake adaptor and Python API for plugins☆337Updated last week
- CUDA Kernel Benchmarking Library☆669Updated last week
- NVIDIA Math Libraries for the Python Ecosystem☆330Updated 2 weeks ago
- LLM training in simple, raw C/CUDA☆99Updated last year
- Training material for Nsight developer tools☆159Updated 10 months ago
- CUDA kernel author's tools☆111Updated 3 years ago
- ☆35Updated this week
- A stand-alone implementation of several NumPy dtype extensions used in machine learning.☆276Updated 3 weeks ago
- KvikIO - High Performance File IO☆213Updated last week
- FP64 equivalent GEMM via Int8 Tensor Cores using the Ozaki scheme☆73Updated 3 months ago
- This repository contains examples CUDA usage in Cython code.☆24Updated 3 years ago
- ☆261Updated this week
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆119Updated this week
- High-Performance SGEMM on CUDA devices☆95Updated 5 months ago
- ☆543Updated this week
- Generate simple index ranges in C++ and CUDA C++☆39Updated 2 years ago
- SYCL implementation of Fused MLPs for Intel GPUs☆47Updated 3 weeks ago
- Example to build PyTorch CUDA extension using CMake (with pybind11 and scikit-build)☆11Updated 5 years ago
- A Visual Studio Code extension for building and debugging CUDA applications.☆82Updated 10 months ago
- Gpu benchmark☆63Updated 4 months ago
- TritonParse is a tool designed to help developers analyze and debug Triton kernels by visualizing the compilation process and source code…☆93Updated last week
- Generate stubs for python modules☆294Updated last month