guidance-ai / llguidanceLinks
Super-fast Structured Outputs
☆350Updated this week
Alternatives and similar repositories for llguidance
Users that are interested in llguidance are comparing it to the libraries listed below
Sorting:
- Faster structured generation☆237Updated 2 months ago
- Formatron empowers everyone to control the format of language models' output with minimal overhead.☆220Updated last month
- TensorRT-LLM server with Structured Outputs (JSON) built with Rust☆57Updated 3 months ago
- ☆388Updated this week
- High-performance MinHash implementation in Rust with Python bindings for efficient similarity estimation and deduplication of large datas…☆192Updated 2 weeks ago
- The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching o…☆142Updated 3 weeks ago
- Inference server benchmarking tool☆87Updated 3 months ago
- Sandboxed code execution for AI agents, locally or on the cloud. Massively parallel, easy to extend. Powering SWE-agent and more.☆273Updated last week
- ☆209Updated last month
- Comparison of Language Model Inference Engines☆222Updated 7 months ago
- XTR/WARP (SIGIR'25) is an extremely fast and accurate retrieval engine based on Stanford's ColBERTv2/PLAID and Google DeepMind's XTR.☆152Updated 3 months ago
- Fast parallel LLM inference for MLX☆204Updated last year
- ☆154Updated 8 months ago
- Efficent platform for inference and serving local LLMs including an OpenAI compatible API server.☆409Updated this week
- This repository has code for fine-tuning LLMs with GRPO specifically for Rust Programming using cargo as feedback☆100Updated 4 months ago
- ☆51Updated last month
- ☆219Updated last month
- High-Performance Engine for Multi-Vector Search☆132Updated last month
- ☆128Updated 3 months ago
- 🕹️ Performance Comparison of MLOps Engines, Frameworks, and Languages on Mainstream AI Models.☆137Updated last year
- Late Interaction Models Training & Retrieval☆511Updated 2 weeks ago
- A simple tool that let's you explore different possible paths that an LLM might sample.☆180Updated 3 months ago
- A Lightweight Library for AI Observability☆249Updated 5 months ago
- Baguetter is a flexible, efficient, and hackable search engine library implemented in Python. It's designed for quickly benchmarking, imp…☆186Updated 11 months ago
- Long context evaluation for large language models☆220Updated 5 months ago
- A high-performance constrained decoding engine based on context free grammar in Rust☆54Updated 2 months ago
- ☆231Updated this week
- Guaranteed Structured Output from any Language Model via Hierarchical State Machines☆141Updated 2 months ago
- Simple UI for debugging correlations of text embeddings☆288Updated 2 months ago
- ☆130Updated last year