HumanSignal / RLHFLinks
Collection of links, tutorials and best practices of how to collect the data and build end-to-end RLHF system to finetune Generative AI models
☆222Updated last year
Alternatives and similar repositories for RLHF
Users that are interested in RLHF are comparing it to the libraries listed below
Sorting:
- A curated list of Human Preference Datasets for LLM fine-tuning, RLHF, and eval.☆367Updated last year
- Benchmarking library for RAG☆209Updated 2 weeks ago
- ToolQA, a new dataset to evaluate the capabilities of LLMs in answering challenging questions with external tools. It offers two levels …☆268Updated last year
- Inquisitive Parrots for Search☆193Updated 3 weeks ago
- Model, Code & Data for the EMNLP'23 paper "Making Large Language Models Better Data Creators"☆135Updated last year
- [EMNLP 2023] Enabling Large Language Models to Generate Text with Citations. Paper: https://arxiv.org/abs/2305.14627☆490Updated 8 months ago
- Github repository for "RAGTruth: A Hallucination Corpus for Developing Trustworthy Retrieval-Augmented Language Models"☆186Updated 6 months ago
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆130Updated last year
- This is the repo for the paper Shepherd -- A Critic for Language Model Generation☆219Updated last year
- Reward Model framework for LLM RLHF☆61Updated 2 years ago
- Build Enterprise RAG (Retriver Augmented Generation) Pipelines to tackle various Generative AI use cases with LLM's by simply plugging co…☆111Updated 11 months ago
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆77Updated 8 months ago
- Dense X Retrieval: What Retrieval Granularity Should We Use?☆157Updated last year
- Scalable training for dense retrieval models.☆298Updated 2 weeks ago
- This repository contains a collection of papers and resources on Reasoning in Large Language Models.☆564Updated last year
- Fine-Tuning Embedding for RAG with Synthetic Data☆501Updated last year
- LLM_library is a comprehensive repository serves as a one-stop resource hands-on code, insightful summaries.☆69Updated last year
- Notebooks for training universal 0-shot classifiers on many different tasks☆130Updated 5 months ago
- ☆86Updated last year
- awesome synthetic (text) datasets☆282Updated 7 months ago
- [EMNLP 2023] The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-Tuning☆243Updated last year
- Scripts for fine-tuning Llama2 via SFT and DPO.☆201Updated last year
- RAGElo is a set of tools that helps you selecting the best RAG-based LLM agents by using an Elo ranker☆112Updated 2 weeks ago
- RankLLM is a Python toolkit for reproducible information retrieval research using rerankers, with a focus on listwise reranking.☆472Updated this week
- Notes and commented code for RLHF (PPO)☆96Updated last year
- A set of scripts and notebooks on LLM finetunning and dataset creation☆111Updated 8 months ago
- Official repository for ORPO☆455Updated last year
- Generative Representational Instruction Tuning☆654Updated 3 months ago
- RewardBench: the first evaluation tool for reward models.☆604Updated 2 weeks ago
- ☆123Updated 8 months ago