openai / following-instructions-human-feedback
☆1,195Updated 2 years ago
Alternatives and similar repositories for following-instructions-human-feedback:
Users that are interested in following-instructions-human-feedback are comparing it to the libraries listed below
- Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"☆1,682Updated last year
- Code for "Learning to summarize from human feedback"☆1,006Updated last year
- Code for the paper Fine-Tuning Language Models from Human Preferences☆1,282Updated last year
- Expanding natural instructions☆975Updated last year
- A modular RL library to fine-tune language models to human preferences☆2,276Updated 11 months ago
- A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)☆4,587Updated last year
- Dromedary: towards helpful, ethical and reliable LLMs.☆1,138Updated last year
- ☆1,497Updated last week
- Open-source pre-training implementation of Google's LaMDA in PyTorch. Adding RLHF similar to ChatGPT.☆471Updated 11 months ago
- ☆443Updated last year
- Large-scale pretrained models for goal-directed dialog☆862Updated last year
- A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.☆794Updated 7 months ago
- Reproduce results and replicate training fo T0 (Multitask Prompted Training Enables Zero-Shot Task Generalization)☆461Updated 2 years ago
- 800,000 step-level correctness labels on LLM solutions to MATH problems☆1,907Updated last year
- [NeurIPS 22] [AAAI 24] Recurrent Transformer-based long-context architecture.☆760Updated 3 months ago
- Ask Me Anything language model prompting☆544Updated last year
- Original Implementation of Prompt Tuning from Lester, et al, 2021☆667Updated 2 months ago
- Implementation of the specific Transformer architecture from PaLM - Scaling Language Modeling with Pathways☆820Updated 2 years ago
- ☆1,192Updated last year
- Beyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language models☆2,962Updated 7 months ago
- Toolkit for creating, sharing and using natural language prompts.☆2,773Updated last year
- TruthfulQA: Measuring How Models Imitate Human Falsehoods☆672Updated last month
- ☆728Updated 8 months ago
- Implementation of Toolformer, Language Models That Can Use Tools, by MetaAI☆2,002Updated 7 months ago
- Alpaca dataset from Stanford, cleaned and curated☆1,537Updated last year
- Reflexion: an autonomous agent with dynamic memory and self-reflection☆384Updated last year
- A collection of modular datasets generated by GPT-4, General-Instruct - Roleplay-Instruct - Code-Instruct - and Toolformer☆1,623Updated last year
- PaL: Program-Aided Language Models (ICML 2023)☆481Updated last year
- ☆229Updated 2 years ago
- Codes for "Chameleon: Plug-and-Play Compositional Reasoning with Large Language Models".☆1,109Updated last year