AnswerDotAI / gpu.cppLinks
A lightweight library for portable low-level GPU computation using WebGPU.
☆3,941Updated 3 months ago
Alternatives and similar repositories for gpu.cpp
Users that are interested in gpu.cpp are comparing it to the libraries listed below
Sorting:
- ☆1,282Updated last year
- Up to 200x Faster Dot Products & Similarity Metrics — for Python, Rust, C, JS, and Swift, supporting f64, f32, f16 real & complex, i8, an…☆1,649Updated this week
- lightweight, standalone C++ inference engine for Google's Gemma models.☆6,721Updated last week
- ☆1,074Updated 8 months ago
- Implementation for MatMul-free LM.☆3,053Updated 2 months ago
- CUDA Core Compute Libraries☆2,162Updated this week
- Playing around "Less Slow" coding practices in C++ 20, C, CUDA, PTX, & Assembly, from numerics & SIMD to coroutines, ranges, exception ha…☆1,900Updated last month
- Distributed LLM and StableDiffusion inference for mobile, desktop and server.☆2,901Updated last year
- Performance-portable, length-agnostic SIMD with runtime dispatch☆5,301Updated last week
- Diffusion model(SD,Flux,Wan,Qwen Image,Z-Image,...) inference in pure C/C++☆5,327Updated this week
- LLM training in simple, raw C/CUDA☆28,763Updated 7 months ago
- A machine learning compiler for GPUs, CPUs, and ML accelerators☆3,956Updated this week
- General purpose GPU compute framework built on Vulkan to support 1000s of cross vendor graphics cards (AMD, Qualcomm, NVIDIA & friends). …☆2,445Updated last week
- Tile primitives for speedy kernels☆3,120Updated this week
- A retargetable MLIR-based machine learning compiler and runtime toolkit.☆3,591Updated this week
- JSON for Classic C++☆769Updated 3 months ago
- Native WebGPU implementation. Mirror of https://dawn.googlesource.com/dawn. File bugs here: https://crbug.com/dawn/new☆878Updated last week
- Run PyTorch LLMs locally on servers, desktop and mobile☆3,624Updated 4 months ago
- nanobind: tiny and efficient C++/Python bindings☆3,326Updated this week
- A minimal GPU design in Verilog to learn how GPUs work from the ground up☆11,126Updated last year
- An efficient C++20 GPU numerical computing library with Python-like syntax☆1,402Updated this week
- Intermediate Graphics Library (IGL) is a cross-platform library that commands the GPU. It provides a single low-level cross-platform inte…☆3,179Updated this week
- A modern model graph visualizer and debugger☆1,379Updated this week
- ☆1,282Updated 2 years ago
- NanoGPT (124M) in 2 minutes☆4,515Updated last week
- Fast, flexible LLM inference☆6,449Updated last week
- An Extensible Deep Learning Library☆2,317Updated last week
- Implementation of C++ standard libraries in C☆1,211Updated 6 months ago
- UNet diffusion model in pure CUDA☆661Updated last year
- CUDA Python: Performance meets Productivity☆3,156Updated this week