openai / following-instructions-human-feedbackLinks

☆1,231

Alternatives and similar repositories for following-instructions-human-feedback

Users that are interested in following-instructions-human-feedback are comparing it to the libraries listed below

Sorting:

openai / lm-human-preferences
Code for the paper Fine-Tuning Language Models from Human Preferences
☆1,350Updated 2 years ago
openai / summarize-from-feedback
Code for "Learning to summarize from human feedback"
☆1,036Updated last year
anthropics / hh-rlhf
Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"
☆1,766Updated last month
lucidrains / PaLM-pytorch
Implementation of the specific Transformer architecture from PaLM - Scaling Language Modeling with Pathways
☆823Updated 2 years ago
google-research / FLAN
☆1,532Updated 3 weeks ago
allenai / natural-instructions
Expanding natural instructions
☆1,010Updated last year
google / BIG-bench
Beyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language models
☆3,091Updated last year
allenai / RL4LMs
A modular RL library to fine-tune language models to human preferences
☆2,332Updated last year
IBM / Dromedary
Dromedary: towards helpful, ethical and reliable LLMs.
☆1,148Updated 2 months ago
google-research / prompt-tuning
Original Implementation of Prompt Tuning from Lester, et al, 2021
☆689Updated 4 months ago
tatsu-lab / alpaca_farm
A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.
☆821Updated last year
lucidrains / toolformer-pytorch
Implementation of Toolformer, Language Models That Can Use Tools, by MetaAI
☆2,042Updated last year
openai / prm800k
800,000 step-level correctness labels on LLM solutions to MATH problems
☆2,032Updated 2 years ago
openai / automated-interpretability
☆1,025Updated last year
bigscience-workshop / xmtf
Crosslingual Generalization through Multitask Finetuning
☆537Updated 10 months ago
openai / grade-school-math
☆1,303Updated last year
bigscience-workshop / promptsource
Toolkit for creating, sharing and using natural language prompts.
☆2,914Updated last year
conceptofmind / LaMDA-rlhf-pytorch
Open-source pre-training implementation of Google's LaMDA in PyTorch. Adding RLHF similar to ChatGPT.
☆471Updated last year
HazyResearch / ama_prompting
Ask Me Anything language model prompting
☆547Updated 2 years ago
CarperAI / trlx
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
☆4,688Updated last year
ruixiangcui / AGIEval
☆758Updated last year
lucidrains / RETRO-pytorch
Implementation of RETRO, Deepmind's Retrieval based Attention net, in Pytorch
☆870Updated last year
conceptofmind / PaLM
An open-source implementation of Google's PaLM models
☆820Updated last year
bigscience-workshop / t-zero
Reproduce results and replicate training fo T0 (Multitask Prompted Training Enables Zero-Shot Task Generalization)
☆463Updated 2 years ago
bigscience-workshop / bigscience
Central place for the engineering/scaling WG: documentation, SLURM scripts and logs, compute environment and data.
☆1,006Updated last year
microsoft / GODEL
Large-scale pretrained models for goal-directed dialog
☆876Updated last year
booydar / recurrent-memory-transformer
[NeurIPS 22] [AAAI 24] Recurrent Transformer-based long-context architecture.
☆766Updated 9 months ago
yaodongC / awesome-instruction-dataset
A collection of open-source dataset to train instruction-following LLMs (ChatGPT,LLaMA,Alpaca)
☆1,127Updated last year
reasoning-machines / pal
PaL: Program-Aided Language Models (ICML 2023)
☆502Updated 2 years ago
pengbaolin / LLM-Augmenter
☆444Updated 2 years ago