Const-me / CgmlLinks
GPU-targeted vendor-agnostic AI library for Windows, and Mistral model implementation.
☆57Updated last year
Alternatives and similar repositories for Cgml
Users that are interested in Cgml are comparing it to the libraries listed below
Sorting:
- A library for incremental loading of large PyTorch checkpoints☆56Updated 2 years ago
- C++ raytracer that supports custom models. Supports running the calculations on the CPU using C++11 threads or in the GPU via CUDA.☆74Updated 3 years ago
- Richard is gaining power☆199Updated 6 months ago
- A JPEG Image Compression Service using Part Homomorphic Encryption.☆31Updated 10 months ago
- Lightweight C inference for Qwen3 GGUF. Multiturn prefix caching & batch processing.☆21Updated 4 months ago
- A CLI to manage install and configure llama inference implemenation in multiple languages☆65Updated 2 years ago
- Code sample showing how to run and benchmark models on Qualcomm's Window PCs☆104Updated last year
- Algebraic enhancements for GEMM & AI accelerators☆286Updated 10 months ago
- Single-file, pure CUDA C implementation for running inference on Qwen3 0.6B GGUF. No Dependencies.☆22Updated last month
- Tiny Dream - An embedded, Header Only, Stable Diffusion C++ implementation☆267Updated 2 years ago
- throwaway GPT inference☆141Updated last year
- Wang Yi's GPT solution☆142Updated 2 years ago
- Heirarchical Navigable Small Worlds☆101Updated 5 months ago
- Lightweight Llama 3 8B Inference Engine in CUDA C☆53Updated 9 months ago
- Revealing example of self-attention, the building block of transformer AI models☆131Updated 2 years ago
- Mistral7B playing DOOM☆138Updated last year
- A GPU Accelerated Binary Vector Store☆47Updated 10 months ago
- An implementation of bucketMul LLM inference☆223Updated last year
- ☆191Updated last year
- ☆199Updated 8 months ago
- A playground to make it easy to try crazy things☆33Updated last month
- Experiments with BitNet inference on CPU☆55Updated last year
- iterate quickly with llama.cpp hot reloading. use the llama.cpp bindings with bun.sh☆50Updated 2 years ago
- Absolute minimalistic implementation of a GPT-like transformer using only numpy (<650 lines).☆254Updated 2 years ago
- ☆166Updated last year
- ☆62Updated last year
- A graphics engine that executes entirely on the CPU☆225Updated last year
- A live multiplayer trivia game where users can bid for the subject of the next question☆29Updated 9 months ago
- DiscoGrad - automatically differentiate across conditional branches in C++ programs☆209Updated last year
- A web-app to explore topics using LLM (less typing and more clicks)☆67Updated 2 years ago