Formatron empowers everyone to control the format of language models' output with minimal overhead.
☆236Jun 7, 2025Updated 11 months ago
Alternatives and similar repositories for formatron
Users that are interested in formatron are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A high-performance constrained decoding engine based on context free grammar in Rust☆59May 22, 2025Updated last year
- ☆11Jul 23, 2023Updated 2 years ago
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- Enforce the output format (JSON Schema, Regex etc) of a language model☆2,012Apr 4, 2026Updated last month
- A highly customizable, full scale web backend for web-rwkv, built on axum with websocket protocol.☆28Apr 15, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆136Updated this week
- Large-scale LLM inference engine☆1,736May 8, 2026Updated 2 weeks ago
- MiSS is a novel PEFT method that features a low-rank structure but introduces a new update mechanism distinct from LoRA, achieving an exc…☆35Mar 9, 2026Updated 2 months ago
- A collection of various LLM sampling methods implemented in pure Pytorch☆30Dec 9, 2024Updated last year
- run ollama & gguf easily with a single command☆52May 15, 2024Updated 2 years ago
- klmbr - a prompt pre-processing technique to break through the barrier of entropy while generating text with LLMs☆89Sep 22, 2024Updated last year
- The official API server for Exllama. OAI compatible, lightweight, and fast.☆1,223Updated this week
- A fast inference library for running LLMs locally on modern consumer-class GPUs☆4,531Mar 4, 2026Updated 2 months ago
- A Python Signal-Slot library inspired by Qt, featuring thread-safe communication, async support, and automatic connection type detection.…☆24Dec 29, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆14May 11, 2025Updated last year
- 🤗 A specialized library for integrating context-free grammars (CFG) in EBNF with the Hugging Face Transformers☆138Jan 13, 2026Updated 4 months ago
- ☆51May 31, 2024Updated last year
- LLM backed Fantasy Tribe Game☆19Nov 21, 2024Updated last year
- One Line To Build Zero-Data Classifiers in Minutes☆65Sep 25, 2024Updated last year
- Evolutionary Search for expert-level performance on any task with environmental feedback☆14Oct 12, 2025Updated 7 months ago
- Generate python documentation using LLMs☆70Jun 28, 2024Updated last year
- ☆21Oct 20, 2023Updated 2 years ago
- an auto-sleeping and -waking framework around llama.cpp☆12Feb 8, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Efficient and general syntactical decoding for Large Language Models