voidful / TextRL
Implementation of ChatGPT RLHF (Reinforcement Learning with Human Feedback) on any generation model in huggingface's transformer (blommz-176B/bloom/gpt/bart/T5/MetaICL)
☆543Updated 6 months ago
Related projects ⓘ
Alternatives and complementary repositories for TextRL
- Code repository for supporting the paper "Atlas Few-shot Learning with Retrieval Augmented Language Models",(https//arxiv.org/abs/2208.03…☆517Updated 11 months ago
- A modular RL library to fine-tune language models to human preferences☆2,213Updated 8 months ago
- Reproduce results and replicate training fo T0 (Multitask Prompted Training Enables Zero-Shot Task Generalization)☆457Updated 2 years ago
- A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.☆782Updated 4 months ago
- Implementation of Reinforcement Learning from Human Feedback (RLHF)☆169Updated last year
- ☆259Updated 11 months ago
- Accompanying repo for the RLPrompt paper☆300Updated 5 months ago
- A full pipeline to finetune Vicuna LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human…☆208Updated 6 months ago
- [NIPS2023] RRHF & Wombat☆798Updated last year
- Expanding natural instructions☆959Updated 11 months ago
- Reading list of Instruction-tuning. A trend starts from Natrural-Instruction (ACL 2022), FLAN (ICLR 2022) and T0 (ICLR 2022).☆756Updated last year
- DialogStudio: Towards Richest and Most Diverse Unified Dataset Collection and Instruction-Aware Models for Conversational AI☆478Updated 6 months ago
- Crosslingual Generalization through Multitask Finetuning☆516Updated last month
- Original Implementation of Prompt Tuning from Lester, et al, 2021☆657Updated 5 months ago
- Code for the paper Fine-Tuning Language Models from Human Preferences☆1,232Updated last year
- [NeurIPS'22 Spotlight] A Contrastive Framework for Neural Text Generation☆466Updated 8 months ago
- A curated list of Human Preference Datasets for LLM fine-tuning, RLHF, and eval.☆317Updated last year
- This repository contains a collection of papers and resources on Reasoning in Large Language Models.☆543Updated last year