HumanSignal / RLHFLinks

Collection of links, tutorials and best practices of how to collect the data and build end-to-end RLHF system to finetune Generative AI models

☆223

Alternatives and similar repositories for RLHF

Users that are interested in RLHF are comparing it to the libraries listed below

Sorting:

tcapelle / llm_recipes
A set of scripts and notebooks on LLM finetunning and dataset creation
☆110Updated last year
microsoft / llm-data-creation
Model, Code & Data for the EMNLP'23 paper "Making Large Language Models Better Data Creators"
☆134Updated 2 years ago
pacman100 / LLM-Workshop
LLM Workshop by Sourab Mangrulkar
☆394Updated last year
alopatenko / LLMEvaluation
A comprehensive guide to LLM evaluation methods designed to assist in identifying the most suitable evaluation techniques for various use…
☆144Updated 2 weeks ago
rashmimarganiatgithub / LLMS_Library_2023
LLM_library is a comprehensive repository serves as a one-stop resource hands-on code, insightful summaries.
☆69Updated last year
patronus-ai / Lynx-hallucination-detection
☆43Updated last year
ayulockin / neurips-llm-efficiency-challenge
Starter pack for NeurIPS LLM Efficiency Challenge 2023.
☆126Updated 2 years ago
davanstrien / awesome-synthetic-datasets
awesome synthetic (text) datasets
☆302Updated 3 months ago
rajshah4 / LLM-Evaluation
Sample notebooks and prompts for LLM evaluation
☆151Updated 2 weeks ago
xfactlab / orpo
Official repository for ORPO
☆464Updated last year
center-for-humans-and-machines / transformer-heads
Toolkit for attaching, training, saving and loading of new heads for transformer models
☆289Updated 7 months ago
huggingface / data-is-better-together
Let's build better datasets, together!
☆262Updated 10 months ago
daniel-furman / sft-demos
Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.
☆78Updated last year
glgh / awesome-llm-human-preference-datasets
A curated list of Human Preference Datasets for LLM fine-tuning, RLHF, and eval.
☆381Updated 2 years ago
jongjyh / TrFr
Truth Forest: Toward Multi-Scale Truthfulness in Large Language Models through Intervention without Tuning
☆46Updated last year
huggingface / llm-swarm
Manage scalable open LLM inference endpoints in Slurm clusters
☆273Updated last year
huggingface / llm_training_handbook
An open collection of methodologies to help with successful training of large language models.
☆536Updated last year
SALT-NLP / demonstrated-feedback
☆128Updated last year
FourthBrain / Building-with-Instruction-Tuned-LLMs-A-Step-by-Step-Guide
Resources relating to the DLAI event: https://www.youtube.com/watch?v=eTieetk2dSw
☆188Updated 2 years ago
qcri / LLMeBench
Benchmarking Large Language Models
☆100Updated 4 months ago
tomekkorbak / pretraining-with-human-feedback
Code accompanying the paper Pretraining Language Models with Human Preferences
☆180Updated last year
explodinggradients / nemesis
Reward Model framework for LLM RLHF
☆61Updated 2 years ago
ibm-self-serve-assets / SuperKnowa
Build Enterprise RAG (Retriver Augmented Generation) Pipelines to tackle various Generative AI use cases with LLM's by simply plugging co…
☆113Updated last year
mzbac / llama2-fine-tune
Scripts for fine-tuning Llama2 via SFT and DPO.
☆204Updated 2 years ago
llm-efficiency-challenge / neurips_llm_efficiency_challenge
NeurIPS Large Language Model Efficiency Challenge: 1 LLM + 1GPU + 1Day
☆256Updated 2 years ago
huggingface / large_language_model_training_playbook
An open collection of implementation tips, tricks and resources for training large language models
☆482Updated 2 years ago
LLM360 / Analysis360
Open Implementations of LLM Analyses
☆107Updated last year
rasbt / LLM-finetuning-scripts
☆216Updated last year
JinjieNi / MixEval
The official evaluation suite and dynamic data release for MixEval.
☆250Updated 11 months ago
sileod / tasksource
Datasets collection and preprocessings framework for NLP extreme multitask learning
☆188Updated 3 months ago