Dan-wanna-M/formatron

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Dan-wanna-M/formatron)

Dan-wanna-M / formatron

Formatron empowers everyone to control the format of language models' output with minimal overhead.

☆237

Alternatives and similar repositories for formatron

Users that are interested in formatron are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Dan-wanna-M / kbnf
View on GitHub
A high-performance constrained decoding engine based on context free grammar in Rust
☆59May 22, 2025Updated last year
npk48 / rwkv_cuda
View on GitHub
☆11Jul 23, 2023Updated 3 years ago
noamgat / lm-format-enforcer
View on GitHub
Enforce the output format (JSON Schema, Regex etc) of a language model
☆2,026Apr 4, 2026Updated 3 months ago
remichu-ai / gallama
View on GitHub
☆137Jun 30, 2026Updated 3 weeks ago
Prunoideae / web-rwkv-axum
View on GitHub
A highly customizable, full scale web backend for web-rwkv, built on axum with websocket protocol.
☆30Apr 15, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
shreyansh26 / LLM-Sampling
View on GitHub
A collection of various LLM sampling methods implemented in pure Pytorch
☆30Dec 9, 2024Updated last year
dphnAI / sonar
View on GitHub
Large-scale LLM inference engine
☆1,810Updated this week
monk1337 / auto-ollama
View on GitHub
run ollama & gguf easily with a single command
☆52May 15, 2024Updated 2 years ago
av / klmbr
View on GitHub
klmbr - a prompt pre-processing technique to break through the barrier of entropy while generating text with LLMs
☆90Sep 22, 2024Updated last year
theroyallab / tabbyAPI
View on GitHub
The official API server for Exllama. OAI compatible, lightweight, and fast.
☆1,288Jul 18, 2026Updated last week
nyunAI / PruneGPT
View on GitHub
☆51May 31, 2024Updated 2 years ago
epfl-dlab / transformers-CFG
View on GitHub
🤗 A specialized library for integrating context-free grammars (CFG) in EBNF with the Hugging Face Transformers
☆139Jan 13, 2026Updated 6 months ago
HabermannR / Fantasy-Tribe-Game
View on GitHub
LLM backed Fantasy Tribe Game
☆19Nov 21, 2024Updated last year
turboderp-org / exllamav2
View on GitHub
A fast inference library for running LLMs locally on modern consumer-class GPUs
☆4,593Mar 4, 2026Updated 4 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
amitlevy / evolutionaryGPT
View on GitHub
Evolutionary Search for expert-level performance on any task with environmental feedback
☆14Oct 12, 2025Updated 9 months ago
kenhktsui / anyclassifier
View on GitHub
One Line To Build Zero-Data Classifiers in Minutes
☆65Sep 25, 2024Updated last year
shivamsanju / ragswift
View on GitHub
🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform
☆38Jan 29, 2024Updated 2 years ago
hyperfocAIs / Attend
View on GitHub
Attend - to what matters.
☆17Feb 22, 2025Updated last year
wdlctc / mini-s
View on GitHub
☆51Oct 29, 2024Updated last year
FarFetchd / sleepyllama
View on GitHub
an auto-sleeping and -waking framework around llama.cpp
☆13Feb 8, 2025Updated last year
BWStearns / circadian_tools
View on GitHub
☆21Oct 20, 2023Updated 2 years ago
aastroza / structured-generation-benchmark
View on GitHub
Structured Generation Evals
☆14Sep 25, 2024Updated last year
Joluck / MiSS
View on GitHub
MiSS is a novel PEFT method that features a low-rank structure but introduces a new update mechanism distinct from LoRA, achieving an exc…
☆35Mar 9, 2026Updated 4 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
honeyhiveai / realign
View on GitHub
Realign is a testing and simulation framework for AI applications.
☆19Dec 4, 2024Updated last year
leonsama / web-rwkv-realweb
View on GitHub
☆12Feb 20, 2026Updated 5 months ago
MananSoni42 / lmdocs
View on GitHub
Generate python documentation using LLMs
☆70Jun 28, 2024Updated 2 years ago
silphendio / sliced_llama
View on GitHub
Simple LLM inference server
☆20Jun 13, 2024Updated 2 years ago
charmandercha / ArchiDoc
View on GitHub
☆16Dec 16, 2024Updated last year
Joluck / RWKV-PEFT
View on GitHub
☆183Jan 13, 2026Updated 6 months ago
jukofyork / transplant-vocab
View on GitHub
Transplants vocabulary between language models, enabling the creation of draft models for speculative decoding WITHOUT retraining.
☆54Oct 29, 2025Updated 8 months ago
RWKV-Vibe / RWKV-LM-V7
View on GitHub
RWKV-LM-V7(https://github.com/BlinkDL/RWKV-LM) Under Lightning Framework
☆62May 13, 2026Updated 2 months ago
VITA-Group / Q-GaLore
View on GitHub
Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients.
☆206Jul 17, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
michaelfeil / embed
View on GitHub
A stable, fast and easy-to-use inference library with a focus on a sync-to-async API
☆48Sep 26, 2024Updated last year
ElleLeonne / Lightning-ReLoRA
View on GitHub
A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.
☆34Mar 2, 2024Updated 2 years ago
rooben-me / tone-changer-open
View on GitHub
This project is a reverse-engineered version of Figma's tone changer. It uses Groq's Llama-3-8b for high-speed inference and to adjust th…
☆91Jul 26, 2024Updated 2 years ago
ShelbyJenkins / llm_client
View on GitHub
The Easiest Rust Interface for Local LLMs and an Interface for Deterministic Signals from Probabilistic LLM Vibes
☆255Aug 6, 2025Updated 11 months ago
NVIDIA / logits-processor-zoo
View on GitHub
A collection of LogitsProcessors to customize and enhance LLM behavior for specific tasks.
☆388Jul 8, 2025Updated last year
Cornell-RelaxML / qtip
View on GitHub
☆181Jun 22, 2025Updated last year
zenoverflow / omnichain
View on GitHub
Efficient visual programming for AI language models
☆357May 13, 2025Updated last year