jcolano / DPO
Direct Preference Optimization Implementation
β16Updated last year
Alternatives and similar repositories for DPO:
Users that are interested in DPO are comparing it to the libraries listed below
- Complete implementation of Llama2 with/without KV cache & inference πβ47Updated 10 months ago
- Fine-tune an LLM to perform batch inference and online serving.β104Updated this week
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absoluteβ¦β49Updated 9 months ago
- β22Updated 6 months ago
- A set of scripts and notebooks on LLM finetunning and dataset creationβ105Updated 6 months ago
- Sample notebooks and prompts for LLM evaluationβ124Updated 4 months ago
- This repository contains the code for dataset curation and finetuning of instruct variant of the Bilingual OpenHathi model. The resultinβ¦β23Updated last year
- This playlab encompasses a multitude of projects crafted through the utilization of Large Language Models, showcasing the versatility andβ¦β111Updated last week
- LLM_library is a comprehensive repository serves as a one-stop resource hands-on code, insightful summaries.β69Updated last year
- A comprehensive guide to LLM evaluation methods designed to assist in identifying the most suitable evaluation techniques for various useβ¦β109Updated this week
- β143Updated 8 months ago
- Building GPT ...β17Updated 4 months ago
- Starter pack for NeurIPS LLM Efficiency Challenge 2023.β124Updated last year
- Building a chatbot powered with a RAG pipeline to read,summarize and quote the most relevant papers related to the user query.β166Updated 11 months ago
- Doing simple retrieval from LLM models at various context lengths to measure accuracyβ99Updated last year
- Low latency, High Accuracy, Custom Query routers for Humans and Agents. Built by Prithivi Daβ101Updated last week
- A collection of fine-tuning notebooks!β26Updated last year
- minimal GRPO implementation from scratchβ72Updated 3 weeks ago
- Mistral + Haystack: build RAG pipelines that rock π€β103Updated last year
- ARAGOG- Advanced RAG Output Grading. Exploring and comparing various Retrieval-Augmented Generation (RAG) techniques on AI research paperβ¦β101Updated 11 months ago
- Optimized Large Language Models for Financial Applications β Efficient, Scalable, and Domain-Specific AI for Finance.β46Updated last week
- This is the reproduction repository for my π€ Hugging Face blog post on synthetic dataβ68Updated last year
- An index of all of our weekly concepts + code events for aspiring AI Engineers and Business Leaders!!β67Updated this week
- RAGs: Simple implementations of Retrieval Augmented Generation (RAG) Systemsβ100Updated 2 months ago
- β34Updated 9 months ago
- β56Updated 4 months ago
- β20Updated 5 months ago
- Repository containing awesome resources regarding Hugging Face tooling.β46Updated last year
- Codebase accompanying the Summary of a Haystack paper.β77Updated 6 months ago
- This repository will contain the presentation and python jupyter notebooks for the DataHack Summit 2024 conference talk, Improving Real-wβ¦β107Updated 6 months ago