Super-fast Structured Outputs
☆793Jun 17, 2026Updated this week
Alternatives and similar repositories for llguidance
Users that are interested in llguidance are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- TensorRT-LLM server with Structured Outputs (JSON) built with Rust☆72Apr 25, 2025Updated last year
- Fast, Flexible and Portable Structured Generation☆1,746Jun 11, 2026Updated last week
- A high-performance constrained decoding engine based on context free grammar in Rust☆59May 22, 2025Updated last year
- Faster structured generation☆294Apr 21, 2026Updated last month
- ☆92Feb 12, 2026Updated 4 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Graph model execution API for Candle☆18Jul 27, 2025Updated 10 months ago
- AICI: Prompts as (Wasm) Programs☆2,075Jan 22, 2025Updated last year
- Structured Outputs☆13,964May 18, 2026Updated last month
- Enforce the output format (JSON Schema, Regex etc) of a language model☆2,020Apr 4, 2026Updated 2 months ago
- A guidance language for controlling large language models.☆21,500May 21, 2026Updated 3 weeks ago
- Fast, flexible LLM inference☆7,282Jun 11, 2026Updated last week
- A language for constraint-guided and efficient LLM programming.☆4,185May 22, 2025Updated last year
- Experimental compiler for deep learning models☆75Sep 18, 2025Updated 9 months ago
- A collection of LogitsProcessors to customize and enhance LLM behavior for specific tasks.☆391Jul 8, 2025Updated 11 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- llm_utils: Basic LLM tools, best practices, and minimal abstraction.☆47Feb 18, 2025Updated last year
- Benchmark structured generation libraries☆31Oct 25, 2024Updated last year
- A rust wrapper for HIP☆13Jun 10, 2025Updated last year
- Plug n Play GBNF Compiler for llama.cpp☆32Nov 8, 2023Updated 2 years ago
- SGLang is a high-performance serving framework for large language models and multimodal models.☆28,978Updated this week
- Structured Generation Evals☆14Sep 25, 2024Updated last year
- A guide to structured generation using constrained decoding☆18Jun 9, 2024Updated 2 years ago
- GLiNER inference in JavaScript☆27Mar 2, 2025Updated last year
- ☆12Sep 27, 2017Updated 8 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- FlashInfer: Kernel Library for LLM Serving☆5,791Updated this week
- Serving multiple LoRA finetuned LLM as one☆1,161May 8, 2024Updated 2 years ago
- Use context-free grammars with an LLM☆175Mar 21, 2024Updated 2 years ago
- A simple, CUDA or CPU powered, library for creating vector embeddings using Candle and models from Hugging Face☆48May 3, 2024Updated 2 years ago
- Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM☆3,418Updated this week
- OO for LLMs☆911Jun 11, 2026Updated last week
- A Bulletproof Way to Generate Structured JSON from Language Models☆4,926Feb 24, 2024Updated 2 years ago
- A curated list of papers related to constrained decoding of LLM, along with their relevant code and resources.☆350Jan 22, 2026Updated 4 months ago
- Easy and Efficient Quantization for Transformers☆205Mar 25, 2026Updated 2 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs☆3,795May 28, 2026Updated 3 weeks ago
- A blazing fast inference solution for text embeddings models☆4,874Updated this week
- Open-source LLM/VLM load balancer and serving platform for self-hosting LLMs (and VLMs) at scale 🏓🦙 Alternative to projects like llm-d,…☆1,603Updated this week
- ☆14Dec 21, 2025Updated 5 months ago
- DSPy: The framework for programming—not prompting—language models☆35,064Updated this week
- Rust crate for some audio utilities☆28Mar 8, 2025Updated last year
- A high-throughput and memory-efficient inference and serving engine for LLMs☆83,135Updated this week