Const-me / Cgml
GPU-targeted vendor-agnostic AI library for Windows, and Mistral model implementation.
☆53Updated last year
Alternatives and similar repositories for Cgml:
Users that are interested in Cgml are comparing it to the libraries listed below
- A library for incremental loading of large PyTorch checkpoints☆56Updated last year
- A CLI to manage install and configure llama inference implemenation in multiple languages☆65Updated last year
- Experiments with BitNet inference on CPU☆53Updated 10 months ago
- Agent Based Model on GPU using CUDA 12.2.1 and OpenGL 4.5 (CUDA OpenGL interop) on Windows/Linux☆70Updated 4 months ago
- C++ raytracer that supports custom models. Supports running the calculations on the CPU using C++11 threads or in the GPU via CUDA.☆75Updated 2 years ago
- A live multiplayer trivia game where users can bid for the subject of the next question☆27Updated 3 months ago
- The procedure and the code to run shap-e sample code locally.☆116Updated last year
- Lightweight Llama 3 8B Inference Engine in CUDA C☆45Updated this week
- Visual inference exploration & experimentation playground☆87Updated 2 months ago
- A tiny version of GPT fully implemented in Python with zero dependencies☆62Updated 2 months ago
- Editor with LLM generation tree exploration☆60Updated this week
- Algebraic enhancements for GEMM & AI accelerators☆263Updated 2 weeks ago
- ☆142Updated this week
- Mistral7B playing DOOM☆127Updated 7 months ago
- Code sample showing how to run and benchmark models on Qualcomm's Window PCs☆95Updated 4 months ago
- ☆181Updated 5 months ago
- DiscoGrad - automatically differentiate across conditional branches in C++ programs☆203Updated 5 months ago
- A copy of ONNX models, datasets, and code all in one GitHub repository. Follow the README to learn more.☆104Updated last year
- Richard is gaining power☆185Updated 2 months ago
- Port of Suno AI's Bark in C/C++ for fast inference☆55Updated 10 months ago
- Simple LLM inference server☆20Updated 8 months ago
- ☆163Updated 8 months ago
- Tiny Dream - An embedded, Header Only, Stable Diffusion C++ implementation☆257Updated last year
- GGML implementation of BERT model with Python bindings and quantization.☆53Updated 11 months ago
- 100k real ( +100k random ) galaxies from a sector. Visualized with Raylib.☆87Updated 4 months ago
- An implementation of bucketMul LLM inference☆215Updated 7 months ago
- Wang Yi's GPT solution☆142Updated last year
- A web-app to explore topics using LLM (less typing and more clicks)☆66Updated last year
- Local LLM inference & management server with built-in OpenAI API☆31Updated 9 months ago
- A fork of llama3.c used to do some R&D on inferencing☆18Updated last month