pierrel55 / llama_stLinks
Load and run Llama from safetensors files in C
☆12Updated 9 months ago
Alternatives and similar repositories for llama_st
Users that are interested in llama_st are comparing it to the libraries listed below
Sorting:
- The simplest, fastest repository for training/finetuning medium-sized xLSTMs.☆41Updated last year
- ☆113Updated last month
- entropix style sampling + GUI☆26Updated 9 months ago
- Video+code lecture on building nanoGPT from scratch☆69Updated last year
- Yet another frontend for LLM, written using .NET and WinUI 3☆10Updated 8 months ago
- Tiny Llama model trained to play chess☆24Updated 2 weeks ago
- ☆57Updated last month
- FMS Model Optimizer is a framework for developing reduced precision neural network models.☆20Updated this week
- ☆134Updated 11 months ago
- ☆51Updated last year
- ☆9Updated last year
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆31Updated last year
- SwiftLet is a lightweight Python framework for running open-source Large Language Models (LLMs) locally using safetensors☆20Updated last week
- Local Qwen3 LLM inference. One easy-to-understand file of C source with no dependencies.☆98Updated last month
- ☆93Updated last month
- 1.58-bit LLaMa model☆81Updated last year
- Lightweight continuous batching OpenAI compatibility using HuggingFace Transformers include T5 and Whisper.☆26Updated 4 months ago
- a lightweight, open-source blueprint for building powerful and scalable LLM chat applications☆28Updated last year
- Running Microsoft's BitNet via Electron, React & Astro☆43Updated 2 months ago
- Fine-tunes a student LLM using teacher feedback for improved reasoning and answer quality. Implements GRPO with teacher-provided evaluati…☆46Updated 3 months ago
- A pure and fast NumPy implementation of Mamba with cache support.☆17Updated last year
- ☆35Updated last week
- Course Project for COMP4471 on RWKV☆17Updated last year
- Experiments with BitNet inference on CPU☆54Updated last year
- run ollama & gguf easily with a single command☆52Updated last year
- ☆44Updated last month
- Inference RWKV v7 in pure C.☆37Updated 2 weeks ago
- Yet Another (LLM) Web UI, made with Gemini☆12Updated 7 months ago
- 5X faster 60% less memory QLoRA finetuning☆21Updated last year
- Visual Tagger is a JavaScript tool that visually highlights HTML elements for AIs, aiding in identifying interactive components on web pa…☆10Updated 9 months ago