leap-laboratories / PIZZALinks
An attribution library for LLMs
☆41Updated 9 months ago
Alternatives and similar repositories for PIZZA
Users that are interested in PIZZA are comparing it to the libraries listed below
Sorting:
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆32Updated 2 months ago
- ☆60Updated 3 weeks ago
- ☆47Updated 4 months ago
- Functional Benchmarks and the Reasoning Gap☆87Updated 8 months ago
- A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive arguments☆81Updated 8 months ago
- Trully flash implementation of DeBERTa disentangled attention mechanism.☆58Updated last month
- A framework for optimizing DSPy programs with RL☆75Updated last week
- Vivaria is METR's tool for running evaluations and conducting agent elicitation research.☆94Updated last week
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).☆80Updated last year
- Lightweight tools for quick and easy LLM demo's☆28Updated 9 months ago
- Sphynx Hallucination Induction☆54Updated 4 months ago
- Synthetic data derived by templating, few shot prompting, transformations on public domain corpora, and monte carlo tree search.☆32Updated 3 months ago
- Train your own SOTA deductive reasoning model☆94Updated 3 months ago
- An introduction to LLM Sampling☆78Updated 6 months ago
- [EMNLP 2024] A Retrieval Benchmark for Scientific Literature Search☆89Updated 6 months ago
- Small, simple agent task environments for training and evaluation☆18Updated 7 months ago
- 📝 Reference-Free automatic summarization evaluation with potential hallucination detection☆100Updated last year
- Learning to route instances for Human vs AI Feedback (ACL 2025 Main)☆23Updated last month
- ☆85Updated 5 months ago
- ☆22Updated 2 weeks ago
- ☆47Updated last year
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆57Updated 9 months ago
- Mixing Language Models with Self-Verification and Meta-Verification☆104Updated 6 months ago
- Tools to make language models a bit easier to use☆47Updated this week
- ☆60Updated last week
- Analysis on the cost of encoder based models☆11Updated 4 months ago
- ☆84Updated 2 months ago
- ☆134Updated 2 months ago
- Writing Blog Posts with Generative Feedback Loops!☆48Updated last year
- Code for the ICLR 2024 paper "How to catch an AI liar: Lie detection in black-box LLMs by asking unrelated questions"☆70Updated last year