andy-yangz / Awesome-RLHFView external linksLinks
Awesome Reinforcement Learning from Human Feedback, the secret behind ChatGPT XD
☆23Dec 13, 2022Updated 3 years ago
Alternatives and similar repositories for Awesome-RLHF
Users that are interested in Awesome-RLHF are comparing it to the libraries listed below
Sorting:
- SIGIR 2021: Proactive Retrieval-based Chatbots based on Relevant Knowledge and Goals☆11Jul 30, 2021Updated 4 years ago
- [ACM MM 2022]: Multi-Modal Experience Inspired AI Creation☆21Nov 27, 2024Updated last year
- The official code of our paper at EMNLP 2022: Back to the Future: Bidirectional Information Decoupling Network for Multi-turn Dialogue Mo…☆16Feb 17, 2023Updated 3 years ago
- PyTorch implementation for ACL 2021 paper "Maria: A Visual Experience Powered Conversational Agent".☆24Sep 19, 2021Updated 4 years ago
- Embedding-based evaluation metrics for dialogue generation.☆15Jan 8, 2023Updated 3 years ago
- The sources codes of the DR-BERT model and baselines☆38Nov 17, 2021Updated 4 years ago
- TL;DR: We propose a large-scale cross-domain persuasion dataset covers 13,000 scenarios in 35 domains, with the developed PersuGPT model …☆17Feb 12, 2025Updated last year
- Pile Deduplication Code☆18May 15, 2023Updated 2 years ago
- [ACL'21] Dialogue Response Selection with Hierarchical Curriculum Learning☆21Nov 15, 2022Updated 3 years ago
- Paper, dataset and code list for multimodal dialogue.☆22Jan 2, 2025Updated last year
- The codebase for "Learning from Easy to Complex: Adaptive Multi-curricula Learning for Neural Dialogue Generation" (Cai et al., AAAI 2020…☆20Jun 18, 2024Updated last year
- A (somewhat) minimal library for finetuning language models with PPO on human feedback.☆90Nov 23, 2022Updated 3 years ago
- [MMM 2025 Best Paper] RoLD: Robot Latent Diffusion for Multi-Task Policy Modeling☆22Aug 4, 2024Updated last year
- ☆21Aug 26, 2025Updated 5 months ago
- Source code for the TMLR paper "Black-Box Prompt Learning for Pre-trained Language Models"☆57Sep 7, 2023Updated 2 years ago
- ☆63Jan 2, 2020Updated 6 years ago
- MultilingualShareGPT, the free multi-language corpus for LLM training☆73Apr 6, 2023Updated 2 years ago
- Code for ReviewRobot: Explainable Paper Review Generation based on Knowledge Synthesis☆30May 31, 2021Updated 4 years ago
- Open source implementation of InstructGPT (not finished)☆31Apr 13, 2023Updated 2 years ago
- ☆32Apr 24, 2024Updated last year
- Code, Models and Datasets for OpenViDial Dataset☆132Jan 22, 2022Updated 4 years ago
- An experimental implementation of the retrieval-enhanced language model☆75Dec 29, 2022Updated 3 years ago
- Implementation of ChatGPT RLHF (Reinforcement Learning with Human Feedback) on any generation model in huggingface's transformer (blommz-…☆565May 9, 2024Updated last year
- A python library for making API calls to Bonsai BRAIN.☆14Oct 6, 2022Updated 3 years ago
- Fine-grained Post-training for Improving Retrieval-based Dialogue Systems - NAACL 2021☆95Jul 8, 2021Updated 4 years ago
- Python SIR-x model implementation☆10Dec 8, 2022Updated 3 years ago
- CBLUE 2/3 任务实现☆10Aug 1, 2024Updated last year
- Technical Report: Is ChatGPT a Good NLG Evaluator? A Preliminary Study☆43Mar 8, 2023Updated 2 years ago
- The Document of WenLan API, which was used to obtain image and text feature.☆41Jan 10, 2023Updated 3 years ago
- Enable Comprehensive LLM Evaluation on Graph Reasoning☆75Jun 12, 2025Updated 8 months ago
- This repo contains codes and instructions for baselines in the VLUE benchmark.☆41Jul 16, 2022Updated 3 years ago
- ☆12Sep 8, 2020Updated 5 years ago
- Fault-Tolerant Pure Functional Programming Language For JVM☆10Apr 6, 2022Updated 3 years ago
- Official implementation of our paper at ACL 2023: Pre-training Multi-party Dialogue Models with Latent Discourse Inference☆10Jul 10, 2023Updated 2 years ago
- Tools for Natural Language Processing☆12Feb 16, 2018Updated 8 years ago
- Low memory usage random access reader for csv and general files☆14Jun 16, 2022Updated 3 years ago
- ☆11Apr 10, 2023Updated 2 years ago
- ☆11Nov 23, 2024Updated last year
- A simple enigma machine in Go☆11Nov 14, 2022Updated 3 years ago