guidance-ai / llguidanceLinks
Super-fast Structured Outputs
☆621Updated last week
Alternatives and similar repositories for llguidance
Users that are interested in llguidance are comparing it to the libraries listed below
Sorting:
- Faster structured generation☆262Updated last month
- ☆456Updated last week
- TensorRT-LLM server with Structured Outputs (JSON) built with Rust☆62Updated 7 months ago
- High-performance MinHash implementation in Rust with Python bindings for efficient similarity estimation and deduplication of large datas…☆217Updated 2 months ago
- Inference server benchmarking tool☆130Updated 2 months ago
- Formatron empowers everyone to control the format of language models' output with minimal overhead.☆231Updated 5 months ago
- High-Performance Engine for Multi-Vector Search☆189Updated this week
- Late Interaction Models Training & Retrieval☆661Updated 3 weeks ago
- Fast, Flexible and Portable Structured Generation☆1,396Updated last week
- The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching o…☆153Updated 4 months ago
- Comparison of Language Model Inference Engines☆236Updated 11 months ago
- ☆234Updated 5 months ago
- xet client tech, used in huggingface_hub☆340Updated this week
- OO for LLMs☆875Updated this week
- Split text into semantic chunks, up to a desired chunk size. Supports calculating length by characters and tokens, and is callable from R…☆519Updated last week
- Storing long contexts in tiny caches with self-study☆218Updated last month
- ☆473Updated last year
- End-to-end Generative Optimization for AI Agents☆680Updated 3 months ago
- Fast Semantic Text Deduplication & Filtering☆848Updated last month
- Super basic implementation (gist-like) of RLMs with REPL environments.☆273Updated last month
- multilspy is a lsp client library in Python intended to be used to build applications around language servers.☆494Updated 3 months ago
- Sandboxed code execution for AI agents, locally or on the cloud. Massively parallel, easy to extend. Powering SWE-agent and more.☆382Updated this week
- Simple UI for debugging correlations of text embeddings☆302Updated 6 months ago
- ☆234Updated last week
- Atropos is a Language Model Reinforcement Learning Environments framework for collecting and evaluating LLM trajectories through diverse …☆760Updated this week
- PyTorch Single Controller☆916Updated this week
- Embeddable library or single binary for indexing and searching 1B vectors☆337Updated this week
- A framework for optimizing DSPy programs with RL☆293Updated 2 weeks ago
- Code to train and evaluate Neural Attention Memory Models to obtain universally-applicable memory systems for transformers.☆329Updated last year
- Fast parallel LLM inference for MLX☆234Updated last year