papers related to Direct Preference Optimization(DPO)
☆20Jul 16, 2024Updated last year
Alternatives and similar repositories for awesome-DPO
Users that are interested in awesome-DPO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Arduino Library for ADXL362 Micropower 3-axis accelerometer☆18Nov 18, 2022Updated 3 years ago
- RLVR for LLMs in optimization modeling☆61Apr 15, 2026Updated 2 months ago
- ☆30Apr 28, 2026Updated 2 months ago
- [ECCV Workshop W-CODA 2024] Official code for ReGentS☆17Oct 22, 2024Updated last year
- [ICML 2023] Meta-SAGE: Scale Meta-Learning Scheduled Adaptation with Guided Exploration for Mitigating Scale Shift on Combinatorial Optim…☆10Dec 19, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆35Jul 2, 2025Updated 11 months ago
- ☆13Jul 15, 2024Updated last year
- 🔥🔥🔥Latest Papers, Codes on Uncertainty-based RL☆58Aug 24, 2025Updated 10 months ago
- A client library for Rainbow Robotics' cobots☆18Apr 16, 2026Updated 2 months ago
- NLP☆14Oct 17, 2022Updated 3 years ago
- The official implementation of Preference Data Reward-Augmentation.☆18May 1, 2025Updated last year
- ☆16May 22, 2025Updated last year
- From-Classification-to-Clinical☆13Apr 26, 2024Updated 2 years ago
- Algebraic value editing in pretrained language models☆70Nov 1, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A list of papers regarding generalization in (deep) reinforcement learning☆11Aug 13, 2023Updated 2 years ago
- Codebase for Paper Reusing Embeddings: Reproducible Reward Model Research in Large Language Model Alignment without GPUs☆22Apr 24, 2025Updated last year
- 《多模态大模型部署微调指南》快速部署/微调多模态大模型☆14Dec 4, 2024Updated last year
- Record experiment data easily☆14Aug 13, 2022Updated 3 years ago
- FairGAN: GANs-based Fairness-aware Learning for Recommendations with Implicit Feedback☆15Oct 8, 2022Updated 3 years ago
- ☆18Oct 8, 2024Updated last year
- The collection of related papers and resources for the paper Time Series Analysis for Education: Methods, Applications, and Future Direct…☆20Apr 12, 2025Updated last year
- All in How You Ask for It: Simple Black-Box Method for Jailbreak Attacks☆17Apr 24, 2024Updated 2 years ago
- [SIGIR'25] Code of "Generative Recommender with End-to-End Learnable Item Tokenization".☆35Apr 17, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- [ICLR 2026] Do Not Let Low-Probability Tokens Over-Dominate in RL for LLMs☆46May 20, 2025Updated last year
- "AI Commit Message Tool uses AI to automatically generate concise and professional Git commit messages, which you can then edit and confi…☆14Jul 14, 2025Updated 11 months ago
- LLMAD code☆30Oct 31, 2024Updated last year
- ☆81Jun 8, 2026Updated 3 weeks ago
- ☆17Aug 1, 2025Updated 10 months ago
- Examples from the book Introduction to the Practice of Statistics☆22Mar 4, 2026Updated 3 months ago
- ☆29Jul 16, 2024Updated last year
- A Survey of Direct Preference Optimization (DPO)☆96Jul 4, 2025Updated 11 months ago
- ☆17Sep 5, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Implementation of paper "Do Wide and Deep Networks Learn the Same Things?"☆16Mar 15, 2022Updated 4 years ago
- [ACL 2025 Findings] Official implementation of the paper "Unveiling the Key Factors for Distilling Chain-of-Thought Reasoning".☆22Feb 26, 2025Updated last year
- A MBTI test on Large Language Model like GPT-3.☆28May 2, 2022Updated 4 years ago
- ☆18Jun 30, 2023Updated 2 years ago
- Scaling Agentic Environments Automatically.☆66Mar 26, 2026Updated 3 months ago
- Official code implementation of SKU, Accepted by ACL 2024 Findings☆20Dec 18, 2024Updated last year
- This repository hosts the DataAssistant, a robust Python class designed to integrate seamlessly with OpenAI's API. It facilitates the cre…☆13Jul 2, 2024Updated last year