dottxt-ai / outlines-coreLinks
Faster structured generation
☆252Updated 4 months ago
Alternatives and similar repositories for outlines-core
Users that are interested in outlines-core are comparing it to the libraries listed below
Sorting:
- Super-fast Structured Outputs☆539Updated last week
- A high-performance constrained decoding engine based on context free grammar in Rust☆55Updated 4 months ago
- ☆133Updated last year
- Inference engine for GLiNER models, in Rust☆71Updated 3 months ago
- The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching o…☆149Updated 2 months ago
- High-performance MinHash implementation in Rust with Python bindings for efficient similarity estimation and deduplication of large datas…☆204Updated 2 months ago
- TensorRT-LLM server with Structured Outputs (JSON) built with Rust☆60Updated 5 months ago
- High-Performance Engine for Multi-Vector Search☆160Updated last week
- This repository has code for fine-tuning LLMs with GRPO specifically for Rust Programming using cargo as feedback☆105Updated 6 months ago
- Formatron empowers everyone to control the format of language models' output with minimal overhead.☆225Updated 3 months ago
- A DSPy rewrite to(not port) Rust☆104Updated this week
- ☆159Updated 10 months ago
- Fast parallel LLM inference for MLX☆220Updated last year
- ☆139Updated last year
- Efficent platform for inference and serving local LLMs including an OpenAI compatible API server.☆473Updated last week
- A framework for optimizing DSPy programs with RL☆185Updated last week
- Use context-free grammars with an LLM☆173Updated last year
- Simple UI for debugging correlations of text embeddings☆292Updated 4 months ago
- ☆61Updated 3 months ago
- A Lightweight Library for AI Observability☆251Updated 7 months ago
- Baguetter is a flexible, efficient, and hackable search engine library implemented in Python. It's designed for quickly benchmarking, imp…☆189Updated last year
- Late Interaction Models Training & Retrieval☆608Updated last week
- ☆199Updated last year
- code for training & evaluating Contextual Document Embedding models☆197Updated 4 months ago
- Split text into semantic chunks, up to a desired chunk size. Supports calculating length by characters and tokens, and is callable from R…☆485Updated this week
- XTR/WARP (SIGIR'25) is an extremely fast and accurate retrieval engine based on Stanford's ColBERTv2/PLAID and Google DeepMind's XTR.☆166Updated 5 months ago
- ☆136Updated last month
- ☆210Updated 3 months ago
- Experimental wasm32-unknown-wasi runtime for Python code execution☆37Updated 10 months ago
- ☆58Updated 2 years ago