allenai / FineGrainedRLHF
☆275Updated 4 months ago
Alternatives and similar repositories for FineGrainedRLHF:
Users that are interested in FineGrainedRLHF are comparing it to the libraries listed below
- This is the repository that contains the source code for the Self-Evaluation Guided MCTS for online DPO.☆306Updated 9 months ago
- RewardBench: the first evaluation tool for reward models.☆562Updated this week
- Official Repo for ICLR 2024 paper MINT: Evaluating LLMs in Multi-turn Interaction with Tools and Language Feedback by Xingyao Wang*, Ziha…☆123Updated 11 months ago
- [ICML 2024] Selecting High-Quality Data for Training Language Models☆169Updated 10 months ago
- A large-scale, fine-grained, diverse preference dataset (and models).☆337Updated last year
- Codes and Data for Scaling Relationship on Learning Mathematical Reasoning with Large Language Models☆260Updated 7 months ago
- ToolkenGPT: Augmenting Frozen Language Models with Massive Tools via Tool Embeddings - NeurIPS 2023 (oral)☆262Updated last year
- 🐋 An unofficial implementation of Self-Alignment with Instruction Backtranslation.☆139Updated 10 months ago
- All available datasets for Instruction Tuning of Large Language Models☆250Updated last year
- [ICML 2024] LESS: Selecting Influential Data for Targeted Instruction Tuning☆441Updated 6 months ago
- Implementation of ICML 23 Paper: Specializing Smaller Language Models towards Multi-Step Reasoning.☆130Updated last year
- Code for STaR: Bootstrapping Reasoning With Reasoning (NeurIPS 2022)☆205Updated 2 years ago
- Official repository of NEFTune: Noisy Embeddings Improves Instruction Finetuning☆395Updated 11 months ago
- RLHF implementation details of OAI's 2019 codebase☆186Updated last year
- Self-Alignment with Principle-Following Reward Models☆160Updated last year
- Collection of papers for scalable automated alignment.☆89Updated 6 months ago
- ☆150Updated 4 months ago
- LLaMA-TRL: Fine-tuning LLaMA with PPO and LoRA☆212Updated last year
- ☆137Updated 5 months ago
- Simple next-token-prediction for RLHF☆225Updated last year
- ☆328Updated 3 months ago
- Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]☆139Updated 6 months ago
- ☆174Updated 9 months ago
- DSIR large-scale data selection framework for language model training☆246Updated last year
- Source Code of Paper "GPTScore: Evaluate as You Desire"☆246Updated 2 years ago
- A curated list of Human Preference Datasets for LLM fine-tuning, RLHF, and eval.☆356Updated last year
- Repository for Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning☆162Updated last year
- ☆66Updated last year
- ☆45Updated 2 months ago
- Official implementation for the paper "DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models"☆482Updated 3 months ago