guidance-ai / llguidanceLinks
Super-fast Structured Outputs
☆657Updated last month
Alternatives and similar repositories for llguidance
Users that are interested in llguidance are comparing it to the libraries listed below
Sorting:
- Faster structured generation☆270Updated last week
- TensorRT-LLM server with Structured Outputs (JSON) built with Rust☆65Updated 8 months ago
- ☆459Updated last month
- Formatron empowers everyone to control the format of language models' output with minimal overhead.☆232Updated 7 months ago
- Fast, Flexible and Portable Structured Generation☆1,476Updated this week
- High-performance MinHash implementation in Rust with Python bindings for efficient similarity estimation and deduplication of large datas…☆225Updated last week
- Comparison of Language Model Inference Engines☆239Updated last year
- Late Interaction Models Training & Retrieval☆681Updated last week
- High-Performance Engine for Multi-Vector Search☆201Updated this week
- Split text into semantic chunks, up to a desired chunk size. Supports calculating length by characters and tokens, and is callable from R…☆539Updated this week
- Inference server benchmarking tool☆136Updated 3 months ago
- The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching o…☆154Updated 6 months ago
- ☆237Updated last week
- Embeddable library or single binary for indexing and searching 1B vectors☆351Updated last week
- multilspy is a lsp client library in Python intended to be used to build applications around language servers.☆517Updated 4 months ago
- Inference engine for GLiNER models, in Rust☆81Updated last week
- Simple UI for debugging correlations of text embeddings☆305Updated 7 months ago
- ☆584Updated last year
- Fast parallel LLM inference for MLX☆241Updated last year
- End-to-end Generative Optimization for AI Agents☆704Updated last month
- A high-performance constrained decoding engine based on context free grammar in Rust☆58Updated 7 months ago
- xet client tech, used in huggingface_hub☆379Updated this week
- ☆236Updated last month
- Code to train and evaluate Neural Attention Memory Models to obtain universally-applicable memory systems for transformers.☆345Updated last year
- Official inference library for pre-processing of Mistral models☆846Updated last week
- OO for LLMs☆887Updated last week
- A framework for optimizing DSPy programs with RL☆304Updated this week
- Sandboxed code execution for AI agents, locally or on the cloud. Massively parallel, easy to extend. Powering SWE-agent and more.☆404Updated 2 weeks ago
- Harbor is a framework for running agent evaluations and creating and using RL environments.☆381Updated this week
- ☆476Updated 2 years ago