Super-fast Structured Outputs
☆717Mar 24, 2026Updated this week
Alternatives and similar repositories for llguidance
Users that are interested in llguidance are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- TensorRT-LLM server with Structured Outputs (JSON) built with Rust☆69Apr 25, 2025Updated 11 months ago
- Fast, Flexible and Portable Structured Generation☆1,595Updated this week
- A high-performance constrained decoding engine based on context free grammar in Rust☆58May 22, 2025Updated 10 months ago
- Faster structured generation☆278Mar 12, 2026Updated 2 weeks ago
- ☆83Feb 12, 2026Updated last month
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Graph model execution API for Candle☆17Jul 27, 2025Updated 8 months ago
- AICI: Prompts as (Wasm) Programs☆2,065Jan 22, 2025Updated last year
- Structured Outputs☆13,588Mar 21, 2026Updated last week
- A guidance language for controlling large language models.☆21,362Mar 18, 2026Updated last week
- Enforce the output format (JSON Schema, Regex etc) of a language model☆1,999Aug 24, 2025Updated 7 months ago
- Fast, flexible LLM inference☆6,733Mar 21, 2026Updated last week
- Experimental compiler for deep learning models☆75Sep 18, 2025Updated 6 months ago
- Plug n Play GBNF Compiler for llama.cpp☆28Nov 8, 2023Updated 2 years ago
- A language for constraint-guided and efficient LLM programming.☆4,164May 22, 2025Updated 10 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- llm_utils: Basic LLM tools, best practices, and minimal abstraction.☆48Feb 18, 2025Updated last year
- Benchmark structured generation libraries☆31Oct 25, 2024Updated last year
- A rust wrapper for HIP☆12Jun 10, 2025Updated 9 months ago
- SGLang is a high-performance serving framework for large language models and multimodal models.☆25,041Updated this week
- Structured Generation Evals☆14Sep 25, 2024Updated last year
- A guide to structured generation using constrained decoding☆14Jun 9, 2024Updated last year
- A collection of LogitsProcessors to customize and enhance LLM behavior for specific tasks.☆385Jul 8, 2025Updated 8 months ago
- ☆12Sep 27, 2017Updated 8 years ago
- Use context-free grammars with an LLM☆174Mar 21, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- A simple, CUDA or CPU powered, library for creating vector embeddings using Candle and models from Hugging Face☆47May 3, 2024Updated last year
- FlashInfer: Kernel Library for LLM Serving☆5,231Updated this week
- Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM☆2,928Updated this week
- OO for LLMs☆901Updated this week
- Serving multiple LoRA finetuned LLM as one☆1,148May 8, 2024Updated last year
- A Bulletproof Way to Generate Structured JSON from Language Models☆4,915Feb 24, 2024Updated 2 years ago
- A curated list of papers related to constrained decoding of LLM, along with their relevant code and resources.☆333Jan 22, 2026Updated 2 months ago
- Easy and Efficient Quantization for Transformers☆206Updated this week
- Open-source LLM load balancer and serving platform for self-hosting LLMs at scale 🏓🦙 Alternative to projects like llm-d, Docker Model R…☆1,496Updated this week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs☆3,740May 21, 2025Updated 10 months ago
- A blazing fast inference solution for text embeddings models☆4,625Mar 23, 2026Updated last week
- 🤗 A specialized library for integrating context-free grammars (CFG) in EBNF with the Hugging Face Transformers☆134Jan 13, 2026Updated 2 months ago
- ☆14Dec 21, 2025Updated 3 months ago
- Rust crate for some audio utilities☆27Mar 8, 2025Updated last year
- DSPy: The framework for programming—not prompting—language models☆33,038Mar 22, 2026Updated last week
- structured outputs for llms☆12,589Updated this week