gruai / koifishLinks
A c++ framework on efficient training & fine-tuning LLMs
☆27Updated last week
Alternatives and similar repositories for koifish
Users that are interested in koifish are comparing it to the libraries listed below
Sorting:
- Lightweight C inference for Qwen3 GGUF. Multiturn prefix caching & batch processing.☆21Updated 4 months ago
- A little(lil) Language Model (LM). A tiny reproduction of LLaMA 3's model architecture.☆54Updated 8 months ago
- Transplants vocabulary between language models, enabling the creation of draft models for speculative decoding WITHOUT retraining.☆47Updated 2 months ago
- Local Qwen3 LLM inference. One easy-to-understand file of C source with no dependencies.☆154Updated 6 months ago
- ☆109Updated 6 months ago
- ☆87Updated last month
- Produce your own Dynamic 3.0 Quants and achieve optimum accuracy & SOTA quantization performance! Input your VRAM and RAM and the toolcha…☆76Updated this week
- Course Project for COMP4471 on RWKV☆17Updated last year
- Cleanai (https://github.com/willmil11/cleanai) except I'm making it in c now. Fast and clean from the start this time :)☆17Updated 3 weeks ago
- Load and run Llama from safetensors files in C☆15Updated last year
- Train your own small bitnet model☆76Updated last year
- Stable Diffusion and Flux in pure C/C++☆24Updated this week
- LLM inference in C/C++☆23Updated last year
- Sherpa-onnx-tts-stt source for homeassisstant addon with Kroko Onnx Streaming STT integration.☆23Updated last month
- 1.58-bit LLaMa model☆83Updated last year
- Inference of Mamba models in pure C☆196Updated last year
- Inference RWKV v7 in pure C.☆43Updated 3 months ago
- Experiments with BitNet inference on CPU☆55Updated last year
- SwiftLet is a lightweight Python framework for running open-source Large Language Models (LLMs) locally using safetensors☆28Updated 5 months ago
- ☆62Updated 6 months ago
- Generate a llama-quantize command to copy the quantization parameters of any GGUF☆29Updated 5 months ago
- Testing LLM reasoning abilities with family relationship quizzes.☆63Updated 11 months ago
- A simple library for working with Hugging Face models.☆14Updated last year
- Thin wrapper around GGML to make life easier☆42Updated 2 months ago
- A ggml (C++) re-implementation of tortoise-tts☆193Updated last year
- automatically quant GGUF models☆220Updated 3 weeks ago
- Automated Identification of Redundant Layer Blocks for Pruning in Large Language Models☆260Updated last year
- Lower Precision Floating Point Operations☆62Updated last week
- Efficient non-uniform quantization with GPTQ for GGUF☆57Updated 4 months ago
- Teaching AI to play the classic text adventure Zork using Large Language Models☆32Updated 3 weeks ago