dottxt-ai / benchmarks
Benchmark structured generation libraries
☆26Updated 4 months ago
Alternatives and similar repositories for benchmarks:
Users that are interested in benchmarks are comparing it to the libraries listed below
- Structured Generation Evals☆12Updated 5 months ago
- Allows to check regexes for overlaps. Based on greenery by @qntm.☆48Updated 9 months ago
- A fast minimalistic implementation of guided generation on Apple Silicon using Outlines and MLX☆51Updated last year
- Training code for Sparse Autoencoders on Embedding models☆35Updated 2 weeks ago
- Generate interleaved text and image content in a structured format you can directly pass to downstream APIs.☆27Updated 4 months ago
- NLP with Rust for Python 🦀🐍☆61Updated 9 months ago
- Chat Markup Language conversation library☆55Updated last year
- ☆18Updated 4 months ago
- The official evaluation suite and dynamic data release for MixEval.☆10Updated 5 months ago
- Synthetic data derived by templating, few shot prompting, transformations on public domain corpora, and monte carlo tree search.☆31Updated 2 weeks ago
- ☆28Updated 5 months ago
- ☆22Updated last year
- Tools to make language models a bit easier to use☆37Updated last week
- Python tools☆12Updated last year
- An attribution library for LLMs☆37Updated 5 months ago
- ☆38Updated 7 months ago
- REBUS: A Robust Evaluation Benchmark of Understanding Symbols☆13Updated 7 months ago
- 🤗 HuggingFace Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Toolkit, but for Vertex AI and unofficial)☆17Updated 11 months ago
- Using modal.com to process FineWeb-edu data☆20Updated last week
- Visualize expert firing frequencies across sentences in the Mixtral MoE model☆16Updated last year
- A miniature version of Modal☆20Updated 9 months ago
- Use sync mode Playwright interactively, inside a Jupyter notebook☆15Updated 3 months ago
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆24Updated last year
- PageRank for LLMs☆39Updated last week
- A Framework For Intelligence Farming☆13Updated 9 months ago
- Leverage your LangChain trace data for fine tuning☆41Updated 7 months ago
- LLM prompt language based on Jinja. Banks provides tools and functions to build prompts text and chat messages from generic blueprints. I…☆81Updated last week
- Run LLMs on Replicate with vLLM☆16Updated 5 months ago