guidance-ai / llguidanceLinks
Super-fast Structured Outputs
☆281Updated this week
Alternatives and similar repositories for llguidance
Users that are interested in llguidance are comparing it to the libraries listed below
Sorting:
- Faster structured generation☆218Updated 2 weeks ago
- Comparison of Language Model Inference Engines☆217Updated 5 months ago
- TensorRT-LLM server with Structured Outputs (JSON) built with Rust☆54Updated last month
- Formatron empowers everyone to control the format of language models' output with minimal overhead.☆200Updated this week
- The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching o…☆136Updated last week
- Efficent platform for inference and serving local LLMs including an OpenAI compatible API server.☆372Updated this week
- An efficent implementation of the method proposed in "The Era of 1-bit LLMs"☆153Updated 7 months ago
- Fast parallel LLM inference for MLX☆189Updated 10 months ago
- ☆198Updated last year
- Sandboxed code execution for AI agents, locally or on the cloud. Massively parallel, easy to extend. Powering SWE-agent and more.☆204Updated last week
- High-performance MinHash implementation in Rust with Python bindings for efficient similarity estimation and deduplication of large datas…☆147Updated this week
- XTR/WARP (SIGIR'25) is an extremely fast and accurate retrieval engine based on Stanford's ColBERTv2/PLAID and Google DeepMind's XTR.☆130Updated last month
- A Lightweight Library for AI Observability☆243Updated 3 months ago
- ☆129Updated last year
- Fast, Flexible and Portable Structured Generation☆993Updated this week
- This repository has code for fine-tuning LLMs with GRPO specifically for Rust Programming using cargo as feedback☆92Updated 2 months ago
- A high-performance constrained decoding engine based on context free grammar in Rust☆52Updated 2 weeks ago
- ☆178Updated last month
- Evaluate and Enhance Your LLM Deployments for Real-World Inference Needs☆317Updated this week
- ☆152Updated 6 months ago
- 🕹️ Performance Comparison of MLOps Engines, Frameworks, and Languages on Mainstream AI Models.☆137Updated 10 months ago
- Simple UI for debugging correlations of text embeddings☆256Updated last week
- ☆121Updated last month
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆139Updated 3 months ago
- Long context evaluation for large language models☆213Updated 3 months ago
- smol models are fun too☆92Updated 6 months ago
- Start a server from the MLX library.☆187Updated 10 months ago
- TypeScript generator for llama.cpp Grammar directly from TypeScript interfaces☆137Updated 10 months ago
- ☆149Updated 2 weeks ago
- Train your own SOTA deductive reasoning model☆92Updated 2 months ago