allenai / hybrid-preferences
Learning to route instances for Human vs AI Feedback
☆19Updated last week
Alternatives and similar repositories for hybrid-preferences:
Users that are interested in hybrid-preferences are comparing it to the libraries listed below
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆54Updated 5 months ago
- ☆48Updated 3 months ago
- Plug-and-play Search Interfaces with Pyserini and Hugging Face☆32Updated last year
- ☆17Updated 4 months ago
- Few-shot Learning with Auxiliary Data☆26Updated last year
- Measuring and Controlling Persona Drift in Language Model Dialogs☆16Updated 11 months ago
- Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model☆42Updated last year
- ☆21Updated 3 weeks ago
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆24Updated 11 months ago
- [arXiv preprint] Official Repository for "Evaluating Language Models as Synthetic Data Generators"☆34Updated 2 months ago
- PyTorch implementation for MRL☆18Updated 11 months ago
- ☆19Updated 4 months ago
- Embedding Recycling for Language models☆38Updated last year
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆31Updated last year
- https://footprints.baulab.info☆16Updated 4 months ago
- Minimum Description Length probing for neural network representations☆18Updated 3 weeks ago
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…☆46Updated last year
- This repo contains code for the paper "Psychologically-informed chain-of-thought prompts for metaphor understanding in large language mod…☆14Updated last year
- Arrakis is a library to conduct, track and visualize mechanistic interpretability experiments.☆25Updated last week
- EMNLP 2024 "Re-reading improves reasoning in large language models". Simply repeating the question to get bidirectional understanding for…☆24Updated 2 months ago
- [SIGIR 2024 (Demo)] CoSearchAgent: A Lightweight Collborative Search Agent with Large Language Models☆22Updated last year
- ReBase: Training Task Experts through Retrieval Based Distillation☆28Updated 2 weeks ago
- BPE modification that implements removing of the intermediate tokens during tokenizer training.☆25Updated 2 months ago
- [ACL 2024] <Large Language Models for Automated Open-domain Scientific Hypotheses Discovery>. It has also received the best poster award …☆38Updated 3 months ago
- ☆57Updated 4 months ago
- Aligning with Human Judgement: The Role of Pairwise Preference in Large Language Model Evaluators (Liu et al.; COLM 2024)☆43Updated last month
- ☆40Updated last week
- Exploration using DSPy to optimize modules to maximize performance on the OpenToM dataset☆14Updated 11 months ago
- Interview-based evaluation of LLMs☆15Updated last month