guidance-ai / llguidanceView external linksLinks
Super-fast Structured Outputs
☆688Updated this week
Alternatives and similar repositories for llguidance
Users that are interested in llguidance are comparing it to the libraries listed below
Sorting:
- TensorRT-LLM server with Structured Outputs (JSON) built with Rust☆67Apr 25, 2025Updated 9 months ago
- Faster structured generation☆275Jan 26, 2026Updated 3 weeks ago
- A high-performance constrained decoding engine based on context free grammar in Rust☆58May 22, 2025Updated 8 months ago
- Fast, Flexible and Portable Structured Generation☆1,548Updated this week
- ☆77Updated this week
- Graph model execution API for Candle☆17Jul 27, 2025Updated 6 months ago
- AICI: Prompts as (Wasm) Programs☆2,061Jan 22, 2025Updated last year
- Structured Outputs☆13,403Feb 6, 2026Updated last week
- Enforce the output format (JSON Schema, Regex etc) of a language model☆1,990Aug 24, 2025Updated 5 months ago
- Fast, flexible LLM inference☆6,580Updated this week
- A guidance language for controlling large language models.☆21,270Feb 6, 2026Updated last week
- A collection of LogitsProcessors to customize and enhance LLM behavior for specific tasks.☆382Jul 8, 2025Updated 7 months ago
- Easy and Efficient Quantization for Transformers☆205Jan 28, 2026Updated 2 weeks ago
- Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM☆2,737Updated this week
- A language for constraint-guided and efficient LLM programming.☆4,148May 22, 2025Updated 8 months ago
- OO for LLMs☆892Feb 7, 2026Updated last week
- Open-source LLM load balancer and serving platform for self-hosting LLMs at scale 🏓🦙 Alternative to projects like llm-d, Docker Model R…☆1,455Feb 6, 2026Updated last week
- ☆18Aug 19, 2024Updated last year
- Supercharge Your LLM with the Fastest KV Cache Layer☆6,871Updated this week
- Experimental compiler for deep learning models☆75Sep 18, 2025Updated 4 months ago
- SGLang is a high-performance serving framework for large language models and multimodal models.☆23,547Updated this week
- Serving multiple LoRA finetuned LLM as one☆1,139May 8, 2024Updated last year
- A blazing fast inference solution for text embeddings models☆4,495Updated this week
- Plug n Play GBNF Compiler for llama.cpp☆28Nov 8, 2023Updated 2 years ago
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs☆3,719May 21, 2025Updated 8 months ago
- A Bulletproof Way to Generate Structured JSON from Language Models☆4,902Feb 24, 2024Updated last year
- Fast Open-Source Search & Clustering engine × for Vectors & Arbitrary Objects × in C++, C, Python, JavaScript, Rust, Java, Objective-C, S…☆3,847Jan 27, 2026Updated 2 weeks ago
- FlashInfer: Kernel Library for LLM Serving☆4,935Feb 10, 2026Updated last week
- structured outputs for llms☆12,357Updated this week
- A guide to structured generation using constrained decoding☆14Jun 9, 2024Updated last year
- The AI framework that adds the engineering to prompt engineering (Python/TS/Ruby/Java/C#/Rust/Go compatible)☆7,549Feb 10, 2026Updated last week
- Rust crate for some audio utilities☆27Mar 8, 2025Updated 11 months ago
- IBM development fork of https://github.com/huggingface/text-generation-inference☆63Sep 18, 2025Updated 4 months ago
- ☆37May 5, 2025Updated 9 months ago
- DSPy: The framework for programming—not prompting—language models☆32,156Updated this week
- A rust wrapper for HIP☆12Jun 10, 2025Updated 8 months ago
- cursor logs with gpt-4o using litellm proxy☆14Sep 9, 2025Updated 5 months ago
- Neural Search☆367Mar 11, 2025Updated 11 months ago
- ☆43Apr 22, 2025Updated 9 months ago