Const-me / CgmlLinks
GPU-targeted vendor-agnostic AI library for Windows, and Mistral model implementation.
☆58Updated last year
Alternatives and similar repositories for Cgml
Users that are interested in Cgml are comparing it to the libraries listed below
Sorting:
- A library for incremental loading of large PyTorch checkpoints☆56Updated 2 years ago
- Richard is gaining power☆194Updated 3 months ago
- A JPEG Image Compression Service using Part Homomorphic Encryption.☆31Updated 7 months ago
- A CLI to manage install and configure llama inference implemenation in multiple languages☆67Updated last year
- C++ raytracer that supports custom models. Supports running the calculations on the CPU using C++11 threads or in the GPU via CUDA.☆75Updated 2 years ago
- Wang Yi's GPT solution☆142Updated last year
- A playground to make it easy to try crazy things☆33Updated 3 months ago
- Lightweight Llama 3 8B Inference Engine in CUDA C☆48Updated 6 months ago
- Pytorch script hot swap: Change code without unloading your LLM from VRAM☆124Updated 5 months ago
- Code sample showing how to run and benchmark models on Qualcomm's Window PCs☆101Updated last year
- Mistral7B playing DOOM☆137Updated last year
- throwaway GPT inference☆140Updated last year
- Lightweight C inference for Qwen3 GGUF. Multiturn prefix caching & batch processing.☆18Updated last month
- ☆196Updated 5 months ago
- ☆189Updated last year
- Algebraic enhancements for GEMM & AI accelerators☆280Updated 7 months ago
- A copy of ONNX models, datasets, and code all in one GitHub repository. Follow the README to learn more.☆104Updated last year
- A GPU Accelerated Binary Vector Store☆47Updated 7 months ago
- Visual Basic IDE for Native Linux Dotnet Development☆109Updated last month
- An implementation of bucketMul LLM inference☆223Updated last year
- Docker-based inference engine for AMD GPUs☆230Updated last year
- Editor with LLM generation tree exploration☆77Updated 7 months ago
- Official implementation of "WhisperNER: Unified Open Named Entity and Speech Recognition"☆198Updated 7 months ago
- ☆61Updated last year
- A tiny version of GPT fully implemented in Python with zero dependencies☆74Updated 10 months ago
- GGUF implementation in C as a library and a tools CLI program☆291Updated last month
- A graphics engine that executes entirely on the CPU☆223Updated last year
- Heirarchical Navigable Small Worlds☆101Updated 2 months ago
- LLaVA server (llama.cpp).☆182Updated last year
- ☆163Updated last year