This is code for most of the experiments in the paper Understanding the Effects of RLHF on LLM Generalisation and Diversity
☆48Jan 19, 2024Updated 2 years ago
Alternatives and similar repositories for rlfh-gen-div
Users that are interested in rlfh-gen-div are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for Paper (Policy Optimization in RLHF: The Impact of Out-of-preference Data)☆29Dec 19, 2023Updated 2 years ago
- ☆23Jun 22, 2025Updated 9 months ago
- Explore, Establish, Exploit: Red Teaming Language Models from Scratch☆13Jun 21, 2023Updated 2 years ago
- Human-centric environment representations from egocentric video☆14Feb 5, 2026Updated last month
- Code for EMNLP'24 paper - On Diversified Preferences of Large Language Model Alignment☆16Aug 6, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- [NeurIPS 2025] Reasoning Models Better Express Their Confidence"☆22Nov 19, 2025Updated 4 months ago
- RewardBench: the first evaluation tool for reward models.☆705Feb 16, 2026Updated last month
- Learning from preferences is a common paradigm for fine-tuning language models. Yet, many algorithmic design decisions come into play. Ou…☆32Apr 20, 2024Updated last year
- ☆160Nov 23, 2024Updated last year
- Recipes to train reward model for RLHF.☆1,523Apr 24, 2025Updated 11 months ago
- This is an official implementation of the paper ``Building Math Agents with Multi-Turn Iterative Preference Learning'' with multi-turn DP…☆32Dec 5, 2024Updated last year
- [NeurIPS 2024 Oral] Aligner: Efficient Alignment by Learning to Correct☆191Jan 16, 2025Updated last year
- Data and code for EACL'24 paper: Over-Reasoning and Redundant Calculation of Large Language Models☆11Jan 23, 2024Updated 2 years ago
- Directed masked autoencoders☆14Mar 17, 2026Updated last week
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Code for the paper "Modelling Latent Translations for Cross-Lingual Transfer"☆17Nov 22, 2021Updated 4 years ago
- JudgeLRM: Large Reasoning Models as a Judge☆41Jan 29, 2026Updated 2 months ago
- EWoK dataset generation framework☆10May 14, 2024Updated last year
- ☆14Jul 24, 2024Updated last year
- ☆12Mar 7, 2024Updated 2 years ago
- A recipe for online RLHF and online iterative DPO.☆543Dec 28, 2024Updated last year
- ☆25May 16, 2024Updated last year
- Paper List for In-context Learning 🌷☆19Jan 3, 2023Updated 3 years ago
- ☆31Jul 14, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Official implementation of Bootstrapping Language Models via DPO Implicit Rewards☆47Apr 15, 2025Updated 11 months ago
- RLBench simulation project for autonomous bin picking using Pandas robot arm