jcolano / DPOLinks
Direct Preference Optimization Implementation
β16Updated last year
Alternatives and similar repositories for DPO
Users that are interested in DPO are comparing it to the libraries listed below
Sorting:
- Complete implementation of Llama2 with/without KV cache & inference πβ46Updated last year
- Fine-tune an LLM to perform batch inference and online serving.β111Updated 3 weeks ago
- LLM_library is a comprehensive repository serves as a one-stop resource hands-on code, insightful summaries.β69Updated last year
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absoluteβ¦β49Updated 10 months ago
- Sample notebooks and prompts for LLM evaluationβ128Updated last week
- A set of scripts and notebooks on LLM finetunning and dataset creationβ111Updated 8 months ago
- This repository contains the code for dataset curation and finetuning of instruct variant of the Bilingual OpenHathi model. The resultinβ¦β23Updated last year
- Various installation guides for Large Language Modelsβ68Updated last month
- A comprehensive guide to LLM evaluation methods designed to assist in identifying the most suitable evaluation techniques for various useβ¦β118Updated 3 weeks ago
- RAGs: Simple implementations of Retrieval Augmented Generation (RAG) Systemsβ105Updated 4 months ago
- Lightweight demos for finetuning LLMs. Powered by π€ transformers and open-source datasets.β77Updated 7 months ago
- Building a chatbot powered with a RAG pipeline to read,summarize and quote the most relevant papers related to the user query.β167Updated last year
- Doing simple retrieval from LLM models at various context lengths to measure accuracyβ99Updated last year
- A collection of fine-tuning notebooks!β27Updated last year
- β38Updated 10 months ago
- A lightweight evaluation suite tailored specifically for assessing Indic LLMs across a diverse range of tasksβ35Updated 11 months ago
- β143Updated 10 months ago
- This is the reproduction repository for my π€ Hugging Face blog post on synthetic dataβ68Updated last year
- Building GPT ...β17Updated 6 months ago
- Build Enterprise RAG (Retriver Augmented Generation) Pipelines to tackle various Generative AI use cases with LLM's by simply plugging coβ¦β110Updated 10 months ago
- Starter pack for NeurIPS LLM Efficiency Challenge 2023.β122Updated last year
- This playlab encompasses a multitude of projects crafted through the utilization of Large Language Models, showcasing the versatility andβ¦β122Updated last month
- Codebase accompanying the Summary of a Haystack paper.β78Updated 8 months ago
- Code repository for "Introducing Airavata: Hindi Instruction-tuned LLM"β59Updated 7 months ago
- β24Updated 7 months ago
- β87Updated last year
- β26Updated last month
- minimal GRPO implementation from scratchβ90Updated 2 months ago
- Just a bunch of benchmark logs for different LLMsβ119Updated 10 months ago
- Low latency, High Accuracy, Custom Query routers for Humans and Agents. Built by Prithivi Daβ105Updated 2 months ago