aastroza / structured-generation-benchmark
Structured Generation Evals
☆12Updated 7 months ago
Alternatives and similar repositories for structured-generation-benchmark
Users that are interested in structured-generation-benchmark are comparing it to the libraries listed below
Sorting:
- ☆53Updated this week
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆32Updated last month
- Training code for Sparse Autoencoders on Embedding models☆38Updated 2 months ago
- [EMNLP 2024] A Retrieval Benchmark for Scientific Literature Search☆84Updated 5 months ago
- Benchmark structured generation libraries☆27Updated 6 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆57Updated 8 months ago
- minimal pytorch implementation of bm25 (with sparse tensors)☆101Updated last year
- ☆28Updated 7 months ago
- An unofficial implementation of the Infini-gram model proposed by Liu et al. (2024)☆32Updated 11 months ago
- ☆48Updated 6 months ago
- Codebase accompanying the Summary of a Haystack paper.☆78Updated 7 months ago
- ☆62Updated 9 months ago
- Probabilistic LLM evaluations. [CogSci2023; ACL2023]☆73Updated 9 months ago
- Score LLM pretraining data with classifiers☆55Updated last year
- Crispy reranking models by Mixedbread☆31Updated 2 weeks ago
- ☆22Updated 11 months ago
- Small, simple agent task environments for training and evaluation☆18Updated 6 months ago
- Understanding the correlation between different LLM benchmarks☆29Updated last year
- ReLM is a Regular Expression engine for Language Models☆104Updated last year
- ☆20Updated last week
- ☆27Updated last year
- ☆82Updated 4 months ago
- LLM sampling method for enforcing syntax adherence in generated output☆25Updated last year
- NLP with Rust for Python 🦀🐍☆62Updated this week
- ☆43Updated 3 months ago
- Advanced Reasoning Benchmark Dataset for LLMs☆45Updated last year
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆24Updated last year
- Trully flash implementation of DeBERTa disentangled attention mechanism.☆48Updated last week
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆53Updated 3 months ago
- Pre-train Static Word Embeddings☆60Updated last month