CUDA kernel author's tools
☆116Apr 24, 2022Updated 3 years ago
Alternatives and similar repositories for cuda-kat
Users that are interested in cuda-kat are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Thin, unified, C++-flavored wrappers for the CUDA APIs☆877Feb 16, 2026Updated last month
- A single-header C++ library for simplifying the use of CUDA Runtime Compilation (NVRTC).☆569Sep 15, 2025Updated 6 months ago
- Runs a single CUDA/OpenCL kernel, taking its source from a file and arguments from the command-line☆24Updated this week
- Generate simple index ranges in C++ and CUDA C++☆39Jun 14, 2023Updated 2 years ago
- CUDA Kernel Benchmarking Library☆838Updated this week
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆14Apr 26, 2022Updated 3 years ago
- Simple starter CMake project that uses NVBench.☆16May 6, 2025Updated 10 months ago
- moderngpu algorithms for C++ shaders☆16Mar 3, 2021Updated 5 years ago
- A C++ allocator based on cudaMallocManaged☆23Nov 19, 2018Updated 7 years ago
- Volume Manipulation Library☆17Jul 13, 2023Updated 2 years ago
- Patterns and behaviors for GPU computing☆1,766Jan 17, 2026Updated 2 months ago
- ☆627Updated this week
- Launching collective tasks in bulk☆37Oct 4, 2019Updated 6 years ago
- ☆19Aug 22, 2019Updated 6 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Range-based for loops to iterate over a range of numbers or values☆34Nov 23, 2016Updated 9 years ago
- [ARCHIVED] The C++ Standard Library for your entire system. See https://github.com/NVIDIA/cccl☆2,306Feb 7, 2024Updated 2 years ago
- Development/testing repo for SWIG+Fortran☆11Mar 25, 2018Updated 8 years ago
- CUDA Data Parallel Primitives Library☆438Nov 9, 2018Updated 7 years ago
- A fast and highly scalable GPU dynamic memory allocator☆112Mar 11, 2015Updated 11 years ago
- Public proposals, extensions, information and materials from the SYCL working group☆15Jan 26, 2024Updated 2 years ago
- Repository for nvCOMP docs and examples. nvCOMP is a library for fast lossless compression/decompression on the GPU that can be downloade…☆616Sep 11, 2024Updated last year
- [ARCHIVED] Cooperative primitives for CUDA C++. See https://github.com/NVIDIA/cccl☆1,825Oct 9, 2023Updated 2 years ago
- A 128 bit unsigned integer class for CUDA☆46Jan 3, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Simple utilities to enable code reuse and portability between CUDA C/C++ and standard C/C++.☆347Apr 14, 2022Updated 3 years ago
- High-level C++ for Accelerator Clusters☆155Updated this week
- Kernel Tuner☆389Updated this week
- Helper C++ classes to quickly preintegrate IMU measurements between SLAM keyframes☆16Feb 23, 2026Updated last month
- mallocMC: Memory Allocator for Many Core Architectures☆58Mar 20, 2026Updated last week
- A Library for fast Hash Tables on GPUs☆132Oct 14, 2025Updated 5 months ago
- An alternative to Boost.MPI for a user friendly C++ interface for MPI (MPICH).☆19Feb 24, 2018Updated 8 years ago
- High-order Remap Miniapp☆22Mar 3, 2026Updated 3 weeks ago
- A warp-oriented dynamic hash table for GPUs☆76Jan 19, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- GPUDirect Async suite☆17Dec 5, 2018Updated 7 years ago
- SYCL materials for ENCCS workshop☆25Apr 25, 2023Updated 2 years ago
- CUDA Core Compute Libraries☆2,240Updated this week
- The Power of LaTeX, the Style of Markdown.☆12Sep 5, 2024Updated last year
- GPU implementation of classical molecular dynamics proxy application.☆31Jan 30, 2017Updated 9 years ago
- A reference implementation of std::simd, providing data parallel types in the C++ standard☆14Mar 9, 2020Updated 6 years ago
- ☆16Nov 27, 2016Updated 9 years ago