jcolano / DPOLinks
Direct Preference Optimization Implementation
β16Updated last year
Alternatives and similar repositories for DPO
Users that are interested in DPO are comparing it to the libraries listed below
Sorting:
- Complete implementation of Llama2 with/without KV cache & inference πβ47Updated last year
- Starter pack for NeurIPS LLM Efficiency Challenge 2023.β125Updated last year
- A set of scripts and notebooks on LLM finetunning and dataset creationβ110Updated 9 months ago
- LLM_library is a comprehensive repository serves as a one-stop resource hands-on code, insightful summaries.β69Updated last year
- Fine-tune an LLM to perform batch inference and online serving.β112Updated last month
- A comprehensive guide to LLM evaluation methods designed to assist in identifying the most suitable evaluation techniques for various useβ¦β123Updated this week
- β40Updated last month
- Sample notebooks and prompts for LLM evaluationβ135Updated last month
- List of resources, libraries and more for developers who would like to build with open-source machine learning off-the-shelfβ198Updated last year
- This repository contains the code for dataset curation and finetuning of instruct variant of the Bilingual OpenHathi model. The resultinβ¦β23Updated last year
- Just some stuff for Interview questions, books, annotated paper, notes, cheat sheets etc etc related to ML,AI, Deep Learning and Data Scβ¦β119Updated 2 months ago
- This playlab encompasses a multitude of projects crafted through the utilization of Large Language Models, showcasing the versatility andβ¦β124Updated 2 months ago
- awesome synthetic (text) datasetsβ286Updated last week
- Building GPT ...β18Updated 7 months ago
- Notebooks for fine tuning pali gemmaβ111Updated 3 months ago
- This project showcases an LLMOps pipeline that fine-tunes a small-size LLM model to prepare for the outage of the service LLM.β307Updated 3 months ago
- β27Updated 9 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absoluteβ¦β49Updated last year
- β144Updated 11 months ago
- Unlock the potential of finetuning Large Language Models (LLMs). Learn from industry expert, and discover when to apply finetuning, data β¦β60Updated last year
- Resources relating to the DLAI event: https://www.youtube.com/watch?v=eTieetk2dSwβ186Updated 2 years ago
- RAGs: Simple implementations of Retrieval Augmented Generation (RAG) Systemsβ123Updated 5 months ago
- Let's build better datasets, together!β260Updated 6 months ago
- β40Updated last year
- Building a chatbot powered with a RAG pipeline to read,summarize and quote the most relevant papers related to the user query.β167Updated last year
- β204Updated last year
- Notes from the Latent Space paper club. Follow along or start your own!β234Updated 11 months ago
- Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 linesβ197Updated last year
- Banishing LLM Hallucinations Requires Rethinking Generalizationβ276Updated last year
- A comprehensive deep dive into the world of tokensβ224Updated last year