conceptofmind / LaMDA-rlhf-pytorchLinks

Open-source pre-training implementation of Google's LaMDA in PyTorch. Adding RLHF similar to ChatGPT.

☆471

Alternatives and similar repositories for LaMDA-rlhf-pytorch

Users that are interested in LaMDA-rlhf-pytorch are comparing it to the libraries listed below

Sorting:

bigscience-workshop / xmtf
Crosslingual Generalization through Multitask Finetuning
☆537Updated 10 months ago
mallorbc / Finetune_LLMs
Repo for fine-tuning Casual LLMs
☆457Updated last year
zphang / minimal-llama
☆458Updated last year
lucidrains / PaLM-pytorch
Implementation of the specific Transformer architecture from PaLM - Scaling Language Modeling with Pathways
☆823Updated 2 years ago
huggingface / transformers-bloom-inference
Fast Inference Solutions for BLOOM
☆563Updated 9 months ago
Xirider / finetune-gpt2xl
Guide: Finetune GPT2-XL (1.5 Billion Parameters) and finetune GPT-NEO (2.7 B) on a single GPU with Huggingface Transformers using DeepSpe…
☆437Updated 2 years ago
voidful / TextRL
Implementation of ChatGPT RLHF (Reinforcement Learning with Human Feedback) on any generation model in huggingface's transformer (blommz-…
☆562Updated last year
declare-lab / flan-alpaca
This repository contains code for extending the Stanford Alpaca synthetic instruction tuning to existing instruction-tuned models such as…
☆352Updated 2 years ago
ypeleg / llama
User-friendly LLaMA: Train or Run the model using PyTorch. Nothing else.
☆339Updated 2 years ago
linhduongtuan / BLOOM-LORA
Due to restriction of LLaMA, we try to reimplement BLOOM-LoRA (much less restricted BLOOM license here https://huggingface.co/spaces/bigs…
☆184Updated 2 years ago
xrsrke / instructGOOSE
Implementation of Reinforcement Learning from Human Feedback (RLHF)
☆171Updated 2 years ago
microsoft / GODEL
Large-scale pretrained models for goal-directed dialog
☆876Updated last year
feizc / MLE-LLaMA
Multi-language Enhanced LLaMA
☆301Updated 2 years ago
henrywoo / minichatgpt
minichatgpt - To Train ChatGPT In 5 Minutes
☆169Updated 2 years ago
bigscience-workshop / t-zero
Reproduce results and replicate training fo T0 (Multitask Prompted Training Enables Zero-Shot Task Generalization)
☆463Updated 2 years ago
allenai / natural-instructions
Expanding natural instructions
☆1,010Updated last year
galatolofederico / vanilla-llama
Plain pytorch implementation of LLaMA
☆188Updated 2 years ago
LAION-AI / Open-Instruction-Generalist
Open Instruction Generalist is an assistant trained on massive synthetic instructions to perform many millions of tasks
☆208Updated last year
lucidrains / RETRO-pytorch
Implementation of RETRO, Deepmind's Retrieval based Attention net, in Pytorch
☆870Updated last year
bigcode-project / Megatron-LM
Ongoing research training transformer models at scale
☆390Updated 11 months ago
conceptofmind / PaLM
An open-source implementation of Google's PaLM models
☆820Updated last year
GanjinZero / RRHF
[NIPS2023] RRHF & Wombat
☆811Updated last year
hpcaitech / PaLM-colossalai
Scalable PaLM implementation of PyTorch
☆190Updated 2 years ago
HazyResearch / ama_prompting
Ask Me Anything language model prompting
☆547Updated 2 years ago
yxuansu / OpenAlpaca
OpenAlpaca: A Fully Open-Source Instruction-Following Model Based On OpenLLaMA
☆302Updated 2 years ago
huggingface / large_language_model_training_playbook
An open collection of implementation tips, tricks and resources for training large language models
☆478Updated 2 years ago
google-research / FLAN
☆1,532Updated 3 weeks ago
ethanyanjiali / minChatGPT
A minimum example of aligning language models with RLHF similar to ChatGPT
☆220Updated last year
jackaduma / Vicuna-LoRA-RLHF-PyTorch
A full pipeline to finetune Vicuna LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human…
☆219Updated last year
bigscience-workshop / bigscience
Central place for the engineering/scaling WG: documentation, SLURM scripts and logs, compute environment and data.
☆1,006Updated last year