guidance-ai / llguidanceLinks

Super-fast Structured Outputs

☆350

Alternatives and similar repositories for llguidance

Users that are interested in llguidance are comparing it to the libraries listed below

Sorting:

dottxt-ai / outlines-core
Faster structured generation
☆237Updated 2 months ago
Dan-wanna-M / formatron
Formatron empowers everyone to control the format of language models' output with minimal overhead.
☆220Updated last month
guidance-ai / llgtrt
TensorRT-LLM server with Structured Outputs (JSON) built with Rust
☆57Updated 3 months ago
ScalingIntelligence / tokasaurus
☆388Updated this week
beowolx / rensa
High-performance MinHash implementation in Rust with Python bindings for efficient similarity estimation and deduplication of large datas…
☆192Updated 2 weeks ago
mixedbread-ai / batched
The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching o…
☆142Updated 3 weeks ago
huggingface / inference-benchmarker
Inference server benchmarking tool
☆87Updated 3 months ago
SWE-agent / SWE-ReX
Sandboxed code execution for AI agents, locally or on the cloud. Massively parallel, easy to extend. Powering SWE-agent and more.
☆273Updated last week
pyember / ember
☆209Updated last month
lapp0 / lm-inference-engines
Comparison of Language Model Inference Engines
☆222Updated 7 months ago
jlscheerer / xtr-warp
XTR/WARP (SIGIR'25) is an extremely fast and accurate retrieval engine based on Stanford's ColBERTv2/PLAID and Google DeepMind's XTR.
☆152Updated 3 months ago
willccbb / mlx_parallm
Fast parallel LLM inference for MLX
☆204Updated last year
AnswerDotAI / fastdata
☆154Updated 8 months ago
EricLBuehler / candle-vllm
Efficent platform for inference and serving local LLMs including an OpenAI compatible API server.
☆409Updated this week
Oxen-AI / GRPO-With-Cargo-Feedback
This repository has code for fine-tuning LLMs with GRPO specifically for Rust Programming using cargo as feedback
☆100Updated 4 months ago
guidance-ai / jsonschemabench
☆51Updated last month
google / lmeval
☆219Updated last month
lightonai / fast-plaid
High-Performance Engine for Multi-Vector Search
☆132Updated last month
QuixiAI / spectrum
☆128Updated 3 months ago
premAI-io / benchmarks
🕹️ Performance Comparison of MLOps Engines, Frameworks, and Languages on Mainstream AI Models.
☆137Updated last year
lightonai / pylate
Late Interaction Models Training & Retrieval
☆511Updated 2 weeks ago
willkurt / token-explorer
A simple tool that let's you explore different possible paths that an LLM might sample.
☆180Updated 3 months ago
cfahlgren1 / observers
A Lightweight Library for AI Observability
☆249Updated 5 months ago
mixedbread-ai / baguetter
Baguetter is a flexible, efficient, and hackable search engine library implemented in Python. It's designed for quickly benchmarking, imp…
☆186Updated 11 months ago
magicproduct / hash-hop
Long context evaluation for large language models
☆220Updated 5 months ago
Dan-wanna-M / kbnf
A high-performance constrained decoding engine based on context free grammar in Rust
☆54Updated 2 months ago
run-ai / runai-model-streamer
☆231Updated this week
TheProxyCompany / proxy-structuring-engine
Guaranteed Structured Output from any Language Model via Hierarchical State Machines
☆141Updated 2 months ago
jina-ai / correlations
Simple UI for debugging correlations of text embeddings
☆288Updated 2 months ago
LaurentMazare / mamba.rs
☆130Updated last year