ashishjamarkattel / reinforment-learning-with-human-feedback
☆16Updated last year
Alternatives and similar repositories for reinforment-learning-with-human-feedback:
Users that are interested in reinforment-learning-with-human-feedback are comparing it to the libraries listed below
- ☆80Updated last year
- Notes and commented code for RLHF (PPO)☆79Updated last year
- a simplified version of Meta's Llama 3 model to be used for learning☆41Updated 10 months ago
- Tutorial for how to build BERT from scratch☆91Updated 10 months ago
- Welcome to the LLMs Interview Prep Guide! This GitHub repository offers a curated set of interview questions and answers tailored for Dat…☆129Updated last year
- A simplified LLAMA implementation for training and inference tasks.☆30Updated 4 months ago
- Small and Efficient Mathematical Reasoning LLMs☆71Updated last year
- Direct Preference Optimization from scratch in PyTorch☆89Updated last year
- LORA: Low-Rank Adaptation of Large Language Models implemented using PyTorch☆99Updated last year
- Fine-tuning large language models (LLMs) is crucial for enhancing performance across domain-specific task applications. This comprehensiv…☆12Updated 6 months ago
- A repository of Python scripts to scrape code contents of the public repositories of `huggingface`.☆51Updated last year
- Distributed training (multi-node) of a Transformer model☆63Updated 11 months ago
- Fine tune a T5 transformer model using PyTorch & Transformers🤗☆212Updated 4 years ago
- A (somewhat) minimal library for finetuning language models with PPO on human feedback.☆85Updated 2 years ago
- Complete implementation of Llama2 with/without KV cache & inference 🚀☆47Updated 10 months ago
- RAGs: Simple implementations of Retrieval Augmented Generation (RAG) Systems☆98Updated 2 months ago
- Collection of links, tutorials and best practices of how to collect the data and build end-to-end RLHF system to finetune Generative AI m…☆215Updated last year
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆73Updated 5 months ago
- Starter pack for NeurIPS LLM Efficiency Challenge 2023.☆124Updated last year
- 🐋 An unofficial implementation of Self-Alignment with Instruction Backtranslation.☆137Updated 9 months ago
- This repository contains a custom implementation of the BERT model, fine-tuned for specific tasks, along with an implementation of Low Ra…☆69Updated last year
- 1st Place Solution for LLM - Detect AI Generated Text Kaggle Competition☆174Updated 10 months ago
- Instruct-tune Open LLaMA / RedPajama / StableLM models on consumer hardware using QLoRA☆81Updated last year
- ☆175Updated last year
- Define Transformers, T5 model and RoBERTa Encoder decoder model for product names generation☆48Updated 3 years ago
- A blueprint for creating Pretraining and Fine-Tuning datasets for Indic languages☆105Updated 5 months ago
- LLaMA-TRL: Fine-tuning LLaMA with PPO and LoRA☆208Updated last year
- ☆17Updated 2 months ago
- Official implementation of paper "Autonomous Data Selection with Language Models for Mathematical Texts" (As Huggingface Daily Papers: ht…☆80Updated 4 months ago
- ☆45Updated 3 years ago