facebookresearch/rlfh-gen-div

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/facebookresearch/rlfh-gen-div)

facebookresearch / rlfh-gen-div

This is code for most of the experiments in the paper Understanding the Effects of RLHF on LLM Generalisation and Diversity

☆50

Alternatives and similar repositories for rlfh-gen-div

Users that are interested in rlfh-gen-div are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

EsYoon7 / RLHF-TLCR
View on GitHub
[ACL'24 Findings] Official code for "TLCR: Token-Level Continuous Reward for Fine-grained Reinforcement Learning from Human Feedback"
☆12Dec 6, 2024Updated last year
alexrame / rewardedsoups
View on GitHub
Rewarded soups official implementation
☆64Sep 27, 2023Updated 2 years ago
liziniu / policy_optimization
View on GitHub
Code for Paper (Policy Optimization in RLHF: The Impact of Out-of-preference Data)
☆29Dec 19, 2023Updated 2 years ago
YichenZW / awesome-llm-diversity
View on GitHub
A curated collection of research papers exploring diversity in Large Language Model text generation. This repository tracks cutting-edge …
☆15Jun 19, 2026Updated last month
Algorithmic-Alignment-Lab / CommonClaim
View on GitHub
Explore, Establish, Exploit: Red Teaming Language Models from Scratch
☆15Jun 21, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
dunzeng / MORE
View on GitHub
Code for EMNLP'24 paper - On Diversified Preferences of Large Language Model Alignment
☆16Aug 6, 2024Updated last year
ernie-research / CD-RLHF
View on GitHub
[ACL'25] Official code of curiosity-driven RLHF
☆16Jun 22, 2025Updated last year
allenai / reward-bench
View on GitHub
RewardBench: the first evaluation tool for reward models.
☆727Feb 16, 2026Updated 5 months ago
HumanCompatibleAI / overcooked-hAI-exp
View on GitHub
Overcooked-AI Experiment Psiturk Demo (for MTurk experiments)
☆13May 10, 2021Updated 5 years ago
Asap7772 / understanding-rlhf
View on GitHub
Learning from preferences is a common paradigm for fine-tuning language models. Yet, many algorithmic design decisions come into play. Ou…
☆32Apr 20, 2024Updated 2 years ago
WeiXiongUST / Building-Math-Agents-with-Multi-Turn-Iterative-Preference-Learning
View on GitHub
This is an official implementation of the paper ``Building Math Agents with Multi-Turn Iterative Preference Learning'' with multi-turn DP…
☆32Dec 5, 2024Updated last year
vwxyzjn / summarize_from_feedback_details
View on GitHub
☆164Nov 23, 2024Updated last year
PKU-Alignment / aligner
View on GitHub
[NeurIPS 2024 Oral] Aligner: Efficient Alignment by Learning to Correct
☆194Jan 16, 2025Updated last year
d223302 / Over-Reasoning-of-LLMs
View on GitHub
Data and code for EACL'24 paper: Over-Reasoning and Redundant Calculation of Large Language Models
☆11Jan 23, 2024Updated 2 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
facebookresearch / dmae_st
View on GitHub
Directed masked autoencoders
☆14Mar 25, 2026Updated 4 months ago
McGill-NLP / latent-translation
View on GitHub
Code for the paper "Modelling Latent Translations for Cross-Lingual Transfer"
☆17Nov 22, 2021Updated 4 years ago
sashrikap / context-steering
View on GitHub
Code for the paper "CoS: Enhancing Personalization and Mitigating Bias with Context Steering"
☆20Dec 13, 2024Updated last year
justincui03 / or-bench
View on GitHub
[ICML 2025] Official repository for paper "OR-Bench: An Over-Refusal Benchmark for Large Language Models"
☆29Mar 4, 2025Updated last year
yizhongw / truthfulqa_reeval
View on GitHub
☆12Mar 7, 2024Updated 2 years ago
MattYoon / reasoning-models-confidence
View on GitHub
[NeurIPS 2025] Reasoning Models Better Express Their Confidence"
☆23Nov 19, 2025Updated 8 months ago
donglixp / ICL_PaperList
View on GitHub
Paper List for In-context Learning 🌷
☆19Jan 3, 2023Updated 3 years ago
RLHFlow / Online-RLHF
View on GitHub
A recipe for online RLHF and online iterative DPO.
☆544Dec 28, 2024Updated last year
HITsz-TMG / ICL-State-Vector
View on GitHub
☆12Jul 4, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
tangzhy / RealCritic
View on GitHub
☆15Jan 27, 2025Updated last year
vmicheli / lm-butlers
View on GitHub
☆12Aug 30, 2021Updated 4 years ago
GAIR-NLP / Preference-Dissection
View on GitHub
☆25May 16, 2024Updated 2 years ago
safety-research / SHADE-Arena
View on GitHub
☆26Jun 22, 2025Updated last year
google-deepmind / nao_top10
View on GitHub
☆19Mar 1, 2023Updated 3 years ago
thestephencasper / explore_establish_exploit_llms
View on GitHub
☆31Jul 14, 2023Updated 3 years ago
tml-epfl / sharpness-vs-generalization
View on GitHub
A modern look at the relationship between sharpness and generalization [ICML 2023]
☆44Sep 11, 2023Updated 2 years ago
amy-hyunji / Generative-Multihop-Retrieval
View on GitHub
☆33Mar 31, 2023Updated 3 years ago
zomux / lanmt-ebm
View on GitHub
lanmt ebm
☆12Jun 19, 2020Updated 6 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
jimimvp / torch_rl
View on GitHub
Reinforcement learning library for PyTorch.
☆11Jun 15, 2018Updated 8 years ago
kaistAI / InstructIR
View on GitHub
IntructIR, a novel benchmark specifically designed to evaluate the instruction following ability in information retrieval models. Our foc…
☆32Jun 13, 2024Updated 2 years ago
hbin0701 / Self-Explore
View on GitHub
[𝐄𝐌𝐍𝐋𝐏 𝐅𝐢𝐧𝐝𝐢𝐧𝐠𝐬 𝟐𝟎𝟐𝟒 & 𝐀𝐂𝐋 𝟐𝟎𝟐𝟒 𝐍𝐋𝐑𝐒𝐄 𝐎𝐫𝐚𝐥] 𝘌𝘯𝘩𝘢𝘯𝘤𝘪𝘯𝘨 𝘔𝘢𝘵𝘩𝘦𝘮𝘢𝘵𝘪𝘤𝘢𝘭 𝘙𝘦𝘢𝘴𝘰𝘯𝘪𝘯…
☆52May 4, 2024Updated 2 years ago
RUCBM / ICLEval
View on GitHub
☆14Jun 24, 2024Updated 2 years ago
DavisPL / PCCC
View on GitHub
Proof-carrying code completions in Dafny
☆11Apr 4, 2025Updated last year
facebookresearch / gen_dgrl
View on GitHub
Official codebase for "The Generalization Gap in Offline Reinforcement Learning" accepted to ICLR 2024
☆29Apr 8, 2026Updated 3 months ago
sygi / vic-tensorflow
View on GitHub
Implementation of Variational Intrinsic Control in tensorflow
☆11Apr 5, 2017Updated 9 years ago