ashishjamarkattel / reinforment-learning-with-human-feedbackLinks
☆15Updated last year
Alternatives and similar repositories for reinforment-learning-with-human-feedback
Users that are interested in reinforment-learning-with-human-feedback are comparing it to the libraries listed below
Sorting:
- Notes and commented code for RLHF (PPO)☆99Updated last year
- ☆84Updated last year
- Tutorial for how to build BERT from scratch☆96Updated last year
- Building a 2.3M-parameter LLM from scratch with LLaMA 1 architecture.☆180Updated last year
- Distributed training (multi-node) of a Transformer model☆75Updated last year
- a simplified version of Meta's Llama 3 model to be used for learning☆41Updated last year
- Collection of links, tutorials and best practices of how to collect the data and build end-to-end RLHF system to finetune Generative AI m…☆224Updated 2 years ago
- Master the essential steps of pretraining large language models (LLMs). Learn to create high-quality datasets, configure model architectu…☆21Updated 11 months ago
- Welcome to the LLMs Interview Prep Guide! This GitHub repository offers a curated set of interview questions and answers tailored for Dat…☆145Updated last year
- nanoGRPO is a lightweight implementation of Group Relative Policy Optimization (GRPO)☆111Updated 2 months ago
- Combining ViT and GPT-2 for image captioning. Trained on MS-COCO. The model was implemented mostly from scratch.☆43Updated last year
- LLM (Large Language Model) FineTuning☆546Updated 3 months ago
- LORA: Low-Rank Adaptation of Large Language Models implemented using PyTorch☆111Updated 2 years ago
- Fine tune a T5 transformer model using PyTorch & Transformers🤗☆216Updated 4 years ago
- Fine-Tuning Llama3-8B LLM in a multi-GPU environment using DeepSpeed☆18Updated last year
- LLaMA 3 is one of the most promising open-source model after Mistral, we will recreate it's architecture in a simpler manner.☆172Updated 11 months ago
- Direct Preference Optimization from scratch in PyTorch☆102Updated 3 months ago
- Building LLaMA 4 MoE from Scratch☆57Updated 3 months ago
- A simplified LLAMA implementation for training and inference tasks.☆32Updated 2 weeks ago
- Complete implementation of Llama2 with/without KV cache & inference 🚀☆47Updated last year
- Research projects built on top of Transformers☆65Updated 4 months ago
- A (somewhat) minimal library for finetuning language models with PPO on human feedback.☆85Updated 2 years ago
- LLaMA-TRL: Fine-tuning LLaMA with PPO and LoRA☆219Updated 2 years ago
- Scripts for fine-tuning Llama2 via SFT and DPO.☆200Updated last year
- minimal GRPO implementation from scratch☆92Updated 4 months ago
- ☆54Updated 5 months ago
- 1st Place Solution for LLM - Detect AI Generated Text Kaggle Competition☆198Updated last year
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆77Updated 9 months ago
- Notes about "Attention is all you need" video (https://www.youtube.com/watch?v=bCz4OMemCcA)☆295Updated 2 years ago
- This repository contains an exhaustive coverage of a hands on approach to PyTorch along side powerful tools to accelerate model tuning an…☆123Updated 3 weeks ago