guidance-ai / llguidanceLinks
Super-fast Structured Outputs
☆561Updated last week
Alternatives and similar repositories for llguidance
Users that are interested in llguidance are comparing it to the libraries listed below
Sorting:
- Faster structured generation☆254Updated last week
- ☆443Updated 2 months ago
- TensorRT-LLM server with Structured Outputs (JSON) built with Rust☆59Updated 6 months ago
- High-performance MinHash implementation in Rust with Python bindings for efficient similarity estimation and deduplication of large datas…☆210Updated 3 weeks ago
- High-Performance Engine for Multi-Vector Search☆173Updated 2 weeks ago
- Formatron empowers everyone to control the format of language models' output with minimal overhead.☆226Updated 4 months ago
- Efficent platform for inference and serving local LLMs including an OpenAI compatible API server.☆496Updated this week
- Inference server benchmarking tool☆118Updated 3 weeks ago
- multilspy is a lsp client library in Python intended to be used to build applications around language servers.☆459Updated last month
- Split text into semantic chunks, up to a desired chunk size. Supports calculating length by characters and tokens, and is callable from R…☆508Updated this week
- Sandboxed code execution for AI agents, locally or on the cloud. Massively parallel, easy to extend. Powering SWE-agent and more.☆339Updated this week
- xet client tech, used in huggingface_hub☆302Updated this week
- Fast, Flexible and Portable Structured Generation☆1,323Updated this week
- Fast parallel LLM inference for MLX☆223Updated last year
- Simple UI for debugging correlations of text embeddings☆296Updated 4 months ago
- A high-performance constrained decoding engine based on context free grammar in Rust☆55Updated 5 months ago
- Late Interaction Models Training & Retrieval☆626Updated this week
- The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching o…☆151Updated 3 months ago
- Code to train and evaluate Neural Attention Memory Models to obtain universally-applicable memory systems for transformers.☆325Updated last year
- This repository has code for fine-tuning LLMs with GRPO specifically for Rust Programming using cargo as feedback☆107Updated 7 months ago
- ☆572Updated last year
- Lightweight Nearest Neighbors with Flexible Backends☆311Updated 3 weeks ago
- Comparison of Language Model Inference Engines☆232Updated 10 months ago
- ☆229Updated 4 months ago
- ☆512Updated 2 weeks ago
- ☆63Updated 4 months ago
- A framework for optimizing DSPy programs with RL☆208Updated this week
- Code and Data artifact for NeurIPS 2023 paper - "Monitor-Guided Decoding of Code LMs with Static Analysis of Repository Context". `multis…☆275Updated last year
- Official inference library for pre-processing of Mistral models☆803Updated 2 weeks ago
- Open-source LLM load balancer and serving platform for self-hosting LLMs at scale 🏓🦙☆1,337Updated 2 weeks ago