jcolano / DPOLinks
Direct Preference Optimization Implementation
β17Updated last year
Alternatives and similar repositories for DPO
Users that are interested in DPO are comparing it to the libraries listed below
Sorting:
- Complete implementation of Llama2 with/without KV cache & inference πβ48Updated last year
- LLM_library is a comprehensive repository serves as a one-stop resource hands-on code, insightful summaries.β69Updated last year
- Building GPT ...β18Updated 10 months ago
- Starter pack for NeurIPS LLM Efficiency Challenge 2023.β126Updated 2 years ago
- A comprehensive guide to LLM evaluation methods designed to assist in identifying the most suitable evaluation techniques for various useβ¦β144Updated last week
- Just some stuff for Interview questions, books, annotated paper, notes, cheat sheets etc etc related to ML,AI, Deep Learning and Data Scβ¦β118Updated 2 months ago
- A set of scripts and notebooks on LLM finetunning and dataset creationβ110Updated last year
- β29Updated last year
- Sample notebooks and prompts for LLM evaluationβ151Updated last week
- This playlab encompasses a multitude of projects crafted through the utilization of Large Language Models, showcasing the versatility andβ¦β134Updated 2 months ago
- List of resources, libraries and more for developers who would like to build with open-source machine learning off-the-shelfβ198Updated last year
- β146Updated last year
- Fine-tune an LLM to perform batch inference and online serving.β113Updated 4 months ago
- A blueprint for creating Pretraining and Fine-Tuning datasets for Indic languagesβ203Updated last year
- Various installation guides for Large Language Modelsβ74Updated 6 months ago
- Notes from the Latent Space paper club. Follow along or start your own!β239Updated last year
- Building a chatbot powered with a RAG pipeline to read,summarize and quote the most relevant papers related to the user query.β168Updated last year
- This repository contains the code for dataset curation and finetuning of instruct variant of the Bilingual OpenHathi model. The resultinβ¦β23Updated last year
- AI algorithmsβ141Updated last year
- Following emerging Large Language Model Operations (LLM Ops) best practices in the industry, youβll learn all about the key technologies β¦β281Updated last year
- "LLM from Zero to Hero: An End-to-End Large Language Model Journey from Data to Application!"β134Updated 2 weeks ago
- Toolkit for attaching, training, saving and loading of new heads for transformer modelsβ289Updated 7 months ago
- A collection of fine-tuning notebooks!β29Updated 2 years ago
- LLM Prompting Engineering Simplified Bookβ121Updated last year
- Domain Adapted Language Modeling Toolkit - E2E RAGβ329Updated 11 months ago
- Leetcode Intensive tutorial and study guide generated by llama-index, networkx, scikit-learn and pydanticβ114Updated last year
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absoluteβ¦β50Updated last year
- [ACL'25] Official Code for LlamaDuo: LLMOps Pipeline for Seamless Migration from Service LLMs to Small-Scale Local LLMsβ314Updated 3 months ago
- Build Enterprise RAG (Retriver Augmented Generation) Pipelines to tackle various Generative AI use cases with LLM's by simply plugging coβ¦β113Updated last year
- RAGs: Simple implementations of Retrieval Augmented Generation (RAG) Systemsβ135Updated 9 months ago