jcolano / DPO
Direct Preference Optimization Implementation
β16Updated last year
Alternatives and similar repositories for DPO:
Users that are interested in DPO are comparing it to the libraries listed below
- Complete implementation of Llama2 with/without KV cache & inference πβ47Updated 11 months ago
- Building GPT ...β17Updated 5 months ago
- A set of scripts and notebooks on LLM finetunning and dataset creationβ107Updated 7 months ago
- Starter pack for NeurIPS LLM Efficiency Challenge 2023.β124Updated last year
- A collection of fine-tuning notebooks!β27Updated last year
- Sample notebooks and prompts for LLM evaluationβ124Updated 2 weeks ago
- β143Updated 9 months ago
- A comprehensive guide to LLM evaluation methods designed to assist in identifying the most suitable evaluation techniques for various useβ¦β115Updated last week
- This playlab encompasses a multitude of projects crafted through the utilization of Large Language Models, showcasing the versatility andβ¦β118Updated 2 weeks ago
- Fine-tune an LLM to perform batch inference and online serving.β110Updated this week
- LLM_library is a comprehensive repository serves as a one-stop resource hands-on code, insightful summaries.β69Updated last year
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absoluteβ¦β49Updated 9 months ago
- Various installation guides for Large Language Modelsβ69Updated last week
- Fine-Tuning Llama3-8B LLM in a multi-GPU environment using DeepSpeedβ17Updated 11 months ago
- This repository contains the code for dataset curation and finetuning of instruct variant of the Bilingual OpenHathi model. The resultinβ¦β23Updated last year
- β56Updated 5 months ago
- minimal GRPO implementation from scratchβ87Updated last month
- So, I trained a Llama a 130M architecture I coded from ground up to build a small instruct model from scratch. Trained on FineWeb datasetβ¦β14Updated last month
- RAGs: Simple implementations of Retrieval Augmented Generation (RAG) Systemsβ104Updated 3 months ago
- ARAGOG- Advanced RAG Output Grading. Exploring and comparing various Retrieval-Augmented Generation (RAG) techniques on AI research paperβ¦β102Updated last year
- Low latency, High Accuracy, Custom Query routers for Humans and Agents. Built by Prithivi Daβ102Updated last month
- β36Updated 9 months ago
- Code repository for "Introducing Airavata: Hindi Instruction-tuned LLM"β59Updated 6 months ago
- β23Updated 6 months ago
- A collection of hand on notebook for LLMs practitionerβ47Updated 3 months ago
- Prune transformer layersβ69Updated 11 months ago
- Set of scripts to finetune LLMsβ37Updated last year
- Lightweight demos for finetuning LLMs. Powered by π€ transformers and open-source datasets.β76Updated 6 months ago
- Material for the series of seminars on Large Language Modelsβ34Updated last year
- β25Updated 3 weeks ago