Super-fast Structured Outputs
☆706Mar 3, 2026Updated this week
Alternatives and similar repositories for llguidance
Users that are interested in llguidance are comparing it to the libraries listed below
Sorting:
- TensorRT-LLM server with Structured Outputs (JSON) built with Rust☆67Apr 25, 2025Updated 10 months ago
- Faster structured generation☆277Jan 26, 2026Updated last month
- A high-performance constrained decoding engine based on context free grammar in Rust☆58May 22, 2025Updated 9 months ago
- Fast, Flexible and Portable Structured Generation☆1,567Updated this week
- ☆81Feb 12, 2026Updated 3 weeks ago
- Graph model execution API for Candle☆17Jul 27, 2025Updated 7 months ago
- AICI: Prompts as (Wasm) Programs☆2,063Jan 22, 2025Updated last year
- Structured Outputs☆13,488Mar 2, 2026Updated last week
- Enforce the output format (JSON Schema, Regex etc) of a language model☆1,992Aug 24, 2025Updated 6 months ago
- Fast, flexible LLM inference☆6,653Feb 27, 2026Updated last week
- A guidance language for controlling large language models.☆21,333Feb 13, 2026Updated 3 weeks ago
- Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM☆2,817Updated this week
- A collection of LogitsProcessors to customize and enhance LLM behavior for specific tasks.☆383Jul 8, 2025Updated 8 months ago
- Easy and Efficient Quantization for Transformers☆206Jan 28, 2026Updated last month
- A language for constraint-guided and efficient LLM programming.☆4,155May 22, 2025Updated 9 months ago
- OO for LLMs☆898Updated this week
- Open-source LLM load balancer and serving platform for self-hosting LLMs at scale 🏓🦙 Alternative to projects like llm-d, Docker Model R…☆1,474Updated this week
- ☆18Aug 19, 2024Updated last year
- Experimental compiler for deep learning models☆75Sep 18, 2025Updated 5 months ago
- SGLang is a high-performance serving framework for large language models and multimodal models.☆24,216Updated this week
- A blazing fast inference solution for text embeddings models☆4,553Feb 25, 2026Updated last week
- Supercharge Your LLM with the Fastest KV Cache Layer☆7,272Updated this week
- Serving multiple LoRA finetuned LLM as one☆1,144May 8, 2024Updated last year
- Plug n Play GBNF Compiler for llama.cpp☆28Nov 8, 2023Updated 2 years ago
- FlashInfer: Kernel Library for LLM Serving☆5,101Updated this week
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs☆3,732May 21, 2025Updated 9 months ago
- Fast Open-Source Search & Clustering engine × for Vectors & Arbitrary Objects × in C++, C, Python, JavaScript, Rust, Java, Objective-C, S…☆3,938Mar 2, 2026Updated last week
- A Bulletproof Way to Generate Structured JSON from Language Models☆4,905Feb 24, 2024Updated 2 years ago
- The AI framework that adds the engineering to prompt engineering (Python/TS/Ruby/Java/C#/Rust/Go compatible)☆7,722Updated this week
- structured outputs for llms☆12,468Feb 25, 2026Updated last week
- A guide to structured generation using constrained decoding☆14Jun 9, 2024Updated last year
- Rust crate for some audio utilities☆27Mar 8, 2025Updated last year
- IBM development fork of https://github.com/huggingface/text-generation-inference☆63Sep 18, 2025Updated 5 months ago
- DSPy: The framework for programming—not prompting—language models☆32,519Updated this week
- ☆37May 5, 2025Updated 10 months ago
- Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets☆4,884Mar 2, 2026Updated last week
- A rust wrapper for HIP☆12Jun 10, 2025Updated 8 months ago
- Neural Search☆367Mar 11, 2025Updated 11 months ago
- ☆43Apr 22, 2025Updated 10 months ago