dottxt-ai / benchmarks
Benchmark structured generation libraries
☆26Updated 4 months ago
Alternatives and similar repositories for benchmarks:
Users that are interested in benchmarks are comparing it to the libraries listed below
- Structured Generation Evals☆12Updated 5 months ago
- A fast minimalistic implementation of guided generation on Apple Silicon using Outlines and MLX☆53Updated last year
- ☆19Updated 4 months ago
- Allows to check regexes for overlaps. Based on greenery by @qntm.☆48Updated 9 months ago
- NLP with Rust for Python 🦀🐍☆61Updated 9 months ago
- Generate interleaved text and image content in a structured format you can directly pass to downstream APIs.☆27Updated 5 months ago
- Training code for Sparse Autoencoders on Embedding models☆35Updated 3 weeks ago
- ☆28Updated 5 months ago
- A Framework For Intelligence Farming☆13Updated 10 months ago
- LLM prompt language based on Jinja. Banks provides tools and functions to build prompts text and chat messages from generic blueprints. I…☆84Updated this week
- Chat Markup Language conversation library☆55Updated last year
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆24Updated last year
- Tools to make language models a bit easier to use☆39Updated 2 weeks ago
- ☆22Updated 10 months ago
- ☆27Updated 4 months ago
- Leverage your LangChain trace data for fine tuning☆41Updated 7 months ago
- ☆48Updated 4 months ago
- A framework for evaluating function calls made by LLMs☆37Updated 7 months ago
- Check for data drift between two OpenAI multi-turn chat jsonl files.☆38Updated 11 months ago
- Lightweight tools for quick and easy LLM demo's☆26Updated 6 months ago
- Get language models to generate responses in a specific format reliably. Open source implementation of Synchromesh: Reliable code generat…☆27Updated last year
- Simple Model Similarities Analysis☆21Updated last year
- ☆24Updated last year
- minimal pytorch implementation of bm25 (with sparse tensors)☆97Updated last year
- REBUS: A Robust Evaluation Benchmark of Understanding Symbols☆13Updated 7 months ago
- Small, simple agent task environments for training and evaluation☆18Updated 4 months ago
- A fork of sqlite-utils with CLI etc removed☆14Updated 3 months ago
- Verbosity control for AI agents☆60Updated 10 months ago