allenai / hybrid-preferencesLinks

Learning to route instances for Human vs AI Feedback (ACL Main '25)

☆25

Alternatives and similar repositories for hybrid-preferences

Users that are interested in hybrid-preferences are comparing it to the libraries listed below

Sorting:

ContextualAI / CLAIR_and_APO
Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment
☆60Updated last year
ConsequentAI / fneval
Functional Benchmarks and the Reasoning Gap
☆89Updated last year
LoryPack / LLM-LieDetector
Code for the ICLR 2024 paper "How to catch an AI liar: Lie detection in black-box LLMs by asking unrelated questions"
☆71Updated last year
salesforce / summary-of-a-haystack
Codebase accompanying the Summary of a Haystack paper.
☆79Updated last year
felipemaiapolo / tinyBenchmarks
Evaluating LLMs with fewer examples
☆168Updated last year
allenai / infinigram-api
☆82Updated this week
allenai / IFBench
☆88Updated last week
para-lost / ReBase
ReBase: Training Task Experts through Retrieval Based Distillation
☆29Updated 9 months ago
SALT-NLP / demonstrated-feedback
☆129Updated last year
tianyang-x / SaySelf
Public code repo for paper "SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales"
☆109Updated last year
r-three / phatgoose
Code for PHATGOOSE introduced in "Learning to Route Among Specialized Experts for Zero-Shot Generalization"
☆91Updated last year
sher222 / LeReT
Learning to Retrieve by Trying - Source code for Grounding by Trying: LLMs with Reinforcement Learning-Enhanced Retrieval
☆51Updated last year
HazyResearch / aioli
Aioli: A unified optimization framework for language model data mixing
☆28Updated 10 months ago
OSU-NLP-Group / Middleware
Middleware for LLMs: Tools Are Instrumental for Language Agents in Complex Environments (EMNLP'2024)
☆37Updated 10 months ago
oriyor / assistantbench
Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"
☆63Updated 11 months ago
SalesforceAIResearch / LaTRO
☆124Updated 9 months ago
GAIR-NLP / scaleeval
Scalable Meta-Evaluation of LLMs as Evaluators
☆42Updated last year
arcee-ai / DAM
☆55Updated last year
Tebmer / Rereading-LLM-Reasoning
EMNLP 2024 "Re-reading improves reasoning in large language models". Simply repeating the question to get bidirectional understanding for…
☆27Updated 11 months ago
msclar / formatspread
Code accompanying "How I learned to start worrying about prompt formatting".
☆110Updated 5 months ago
r-three / RAD
Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model
☆44Updated last month
AbhilashaRavichander / information-probing
☆11Updated 6 months ago
mungg / FABLES
☆58Updated last year
kanishkg / stream-of-search
Repository for the paper Stream of Search: Learning to Search in Language
☆151Updated 9 months ago
OSU-NLP-Group / llm-planning-eval
[ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"
☆54Updated last year
allenai / SciRIFF
Dataset and evaluation suite enabling LLM instruction-following for scientific literature understanding.
☆44Updated 8 months ago
aryamanarora / causalgym
CausalGym: Benchmarking causal interpretability methods on linguistic tasks
☆50Updated 11 months ago
bespokelabsai / verifiers
Verifiers for LLM Reinforcement Learning
☆79Updated 7 months ago
sfeucht / footprints
https://footprints.baulab.info
☆17Updated last year
hamishivi / EasyLM
Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Fl…
☆75Updated last year