dottxt-ai / benchmarks
Benchmark structured generation libraries
☆15Updated last week
Related projects: ⓘ
- Generate interleaved text and image content in a structured format you can directly pass to downstream APIs.☆21Updated last month
- Allows to check regexes for overlaps. Based on greenery by @qntm.☆34Updated 3 months ago
- 🤗 HuggingFace Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Toolkit, but for Vertex AI and unofficial)☆17Updated 6 months ago
- A fast minimalistic implementation of guided generation on Apple Silicon using Outlines and MLX☆42Updated 7 months ago
- NLP with Rust for Python 🦀🐍☆57Updated 3 months ago
- Query language for blending SQL logic and LLM reasoning across multi-modal data. [Findings of ACL 2024]☆52Updated last week
- ☆23Updated 2 weeks ago
- Structured Generation Evals☆12Updated 4 months ago
- Cold Compress is a hackable, lightweight, and open-source toolkit for creating and benchmarking cache compression methods built on top of…☆73Updated last month
- Certified Reasoning with Language Models☆27Updated 9 months ago
- LLM sampling method for enforcing syntax adherence in generated output☆21Updated last year
- a pipeline for using api calls to agnostically convert unstructured data into structured training data☆26Updated last year
- ☆65Updated 2 months ago
- An attribution library for LLMs☆31Updated this week
- A new way to generate large quantities of high quality synthetic data (on par with GPT-4), with better controllability, at a fraction of …☆19Updated last month
- Efficient BM25 with DuckDB 🦆☆12Updated last week
- ☆29Updated last year
- spaCy entry points for Curated Transformers☆23Updated 2 weeks ago
- A file utility for accessing both local and remote files through a unified interface.☆36Updated last month
- A corpus of Python programs annotated with contracts☆20Updated 2 years ago
- ☆18Updated 5 months ago
- Get language models to generate responses in a specific format reliably. Open source implementation of Synchromesh: Reliable code generat…☆23Updated 6 months ago
- Tools to make language models a bit easier to use☆22Updated last week
- Tree-based indexes for neural-search☆28Updated 6 months ago
- High-performance tokenized language data-loader for Python C++ extension☆12Updated last month
- Visualize expert firing frequencies across sentences in the Mixtral MoE model☆17Updated 8 months ago
- This repository contains code for cleaning your training data of benchmark data to help combat data snooping.☆25Updated last year
- LLM prompt language based on Jinja☆52Updated 2 weeks ago
- A place to store reusable transformer components of my own creation or found on the interwebs☆43Updated 3 weeks ago
- TextGraphs + LLMs + graph ML for entity extraction, linking, ranking, and constructing a lemma graph☆17Updated 6 months ago