Formatron empowers everyone to control the format of language models' output with minimal overhead.
☆234Jun 7, 2025Updated 9 months ago
Alternatives and similar repositories for formatron
Users that are interested in formatron are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A high-performance constrained decoding engine based on context free grammar in Rust☆58May 22, 2025Updated 10 months ago
- ☆11Jul 23, 2023Updated 2 years ago
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- Enforce the output format (JSON Schema, Regex etc) of a language model☆1,999Aug 24, 2025Updated 7 months ago
- A highly customizable, full scale web backend for web-rwkv, built on axum with websocket protocol.☆28Apr 15, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- ☆134Mar 14, 2026Updated last week
- Large-scale LLM inference engine☆1,681Mar 12, 2026Updated 2 weeks ago
- MiSS is a novel PEFT method that features a low-rank structure but introduces a new update mechanism distinct from LoRA, achieving an exc…☆35Mar 9, 2026Updated 2 weeks ago
- A collection of various LLM sampling methods implemented in pure Pytorch☆29Dec 9, 2024Updated last year
- klmbr - a prompt pre-processing technique to break through the barrier of entropy while generating text with LLMs☆86Sep 22, 2024Updated last year
- run ollama & gguf easily with a single command☆52May 15, 2024Updated last year
- The official API server for Exllama. OAI compatible, lightweight, and fast.☆1,158Updated this week
- A fast inference library for running LLMs locally on modern consumer-class GPUs☆4,476Mar 4, 2026Updated 3 weeks ago
- A Python Signal-Slot library inspired by Qt, featuring thread-safe communication, async support, and automatic connection type detection.…☆24Dec 29, 2024Updated last year
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- 🤗 A specialized library for integrating context-free grammars (CFG) in EBNF with the Hugging Face Transformers☆134Jan 13, 2026Updated 2 months ago
- ☆13May 11, 2025Updated 10 months ago
- ☆51May 31, 2024Updated last year
- LLM backed Fantasy Tribe Game☆19Nov 21, 2024Updated last year
- Evolutionary Search for expert-level performance on any task with environmental feedback☆14Oct 12, 2025Updated 5 months ago
- Generate python documentation using LLMs☆71Jun 28, 2024Updated last year
- Optimizing inference proxy for LLMs☆3,389Mar 19, 2026Updated last week
- Efficient and general syntactical decoding for Large Language Models☆330Jan 19, 2026Updated 2 months ago
- ☆53Oct 29, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Structured Generation Evals☆14Sep 25, 2024Updated last year
- an auto-sleeping and -waking framework around llama.cpp☆12Feb 8, 2025Updated last year
- 🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform☆38Jan 29, 2024Updated 2 years ago
- Simple LLM inference server☆20Jun 13, 2024Updated last year
- JacQues is a Dash-based interactive web application that facilitates real-time chat and document management.☆22Jan 5, 2026Updated 2 months ago
- This project is a reverse-engineered version of Figma's tone changer. It uses Groq's Llama-3-8b for high-speed inference and to adjust th…☆90Jul 26, 2024Updated last year
- 90% of what you need for LLM app development. Nothing you don't.☆274Aug 25, 2025Updated 7 months ago
- A modular framework for building massively parallel agentic systems☆31Sep 8, 2025Updated 6 months ago
- ☆17Dec 16, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆179Jan 13, 2026Updated 2 months ago
- A Lightweight Library for LLM I/O☆122Mar 12, 2026Updated 2 weeks ago
- A flexible python duration parser designed for human readable lengths of time.☆21Feb 18, 2026Updated last month
- An optimized quantization and inference library for running LLMs locally on modern consumer-class GPUs☆701Updated this week
- Structured Outputs☆13,588Updated this week
- A stable, fast and easy-to-use inference library with a focus on a sync-to-async API☆48Sep 26, 2024Updated last year
- Rust snippets and tips☆17Oct 20, 2021Updated 4 years ago