jcolano / DPOLinks
Direct Preference Optimization Implementation
β17Updated last year
Alternatives and similar repositories for DPO
Users that are interested in DPO are comparing it to the libraries listed below
Sorting:
- Complete implementation of Llama2 with/without KV cache & inference πβ48Updated last year
- LLM_library is a comprehensive repository serves as a one-stop resource hands-on code, insightful summaries.β69Updated last year
- A comprehensive guide to LLM evaluation methods designed to assist in identifying the most suitable evaluation techniques for various useβ¦β136Updated this week
- A set of scripts and notebooks on LLM finetunning and dataset creationβ110Updated 10 months ago
- Fine-tune an LLM to perform batch inference and online serving.β112Updated 2 months ago
- Starter pack for NeurIPS LLM Efficiency Challenge 2023.β125Updated last year
- β28Updated 10 months ago
- This repository contains the code for dataset curation and finetuning of instruct variant of the Bilingual OpenHathi model. The resultinβ¦β23Updated last year
- This playlab encompasses a multitude of projects crafted through the utilization of Large Language Models, showcasing the versatility andβ¦β124Updated 3 weeks ago
- Sample notebooks and prompts for LLM evaluationβ138Updated 2 months ago
- A comprehensive deep dive into the world of tokensβ226Updated last year
- Notes from the Latent Space paper club. Follow along or start your own!β235Updated last year
- Just some stuff for Interview questions, books, annotated paper, notes, cheat sheets etc etc related to ML,AI, Deep Learning and Data Scβ¦β118Updated 3 months ago
- β145Updated last year
- Material for the series of seminars on Large Language Modelsβ34Updated last year
- [ACL'25] Official Code for LlamaDuo: LLMOps Pipeline for Seamless Migration from Service LLMs to Small-Scale Local LLMsβ314Updated last month
- A collection of fine-tuning notebooks!β28Updated last year
- A repository containing general tutorials I'd like to share with the world.β46Updated last month
- Curated list of weekly published LLM papersβ184Updated 2 weeks ago
- Repository for ACM India Summer School on Generative AI for Textβ13Updated last year
- Resources relating to the DLAI event: https://www.youtube.com/watch?v=eTieetk2dSwβ188Updated 2 years ago
- β26Updated 4 months ago
- Low latency, High Accuracy, Custom Query routers for Humans and Agents. Built by Prithivi Daβ113Updated 4 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absoluteβ¦β49Updated last year
- A blueprint for creating Pretraining and Fine-Tuning datasets for Indic languagesβ107Updated 10 months ago
- Following emerging Large Language Model Operations (LLM Ops) best practices in the industry, youβll learn all about the key technologies β¦β277Updated last year
- awesome synthetic (text) datasetsβ293Updated last month
- Building GPT ...β18Updated 8 months ago
- Code repository for "Introducing Airavata: Hindi Instruction-tuned LLM"β61Updated 10 months ago
- Toolkit for attaching, training, saving and loading of new heads for transformer modelsβ285Updated 5 months ago