[EMNLP 2024] Introducing Filtered Direct Preference Optimization (fDPO) that enhances language model alignment with human preferences by discarding lower-quality samples compared to those generated by the learning model
☆16Nov 27, 2024Updated last year
Alternatives and similar repositories for filtered-dpo
Users that are interested in filtered-dpo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code of "Regularized Best-of-N Sampling with Minimum Bayes Risk Objective for Language Model Alignment" (2025).☆14Apr 4, 2025Updated last year
- ヒューリスティック探索入門☆20Dec 9, 2023Updated 2 years ago
- ☆12Jan 2, 2024Updated 2 years ago
- Self-Supervised Alignment with Mutual Information☆20May 24, 2024Updated 2 years ago
- Code for magnetic mirror descent.☆18Oct 5, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- [EMNLP2025] Remedy: Learning Machine Translation Evaluation from Human Preferences with Reward Modeling☆17Nov 20, 2025Updated 6 months ago
- [ACL'24] Beyond One-Preference-Fits-All Alignment: Multi-Objective Direct Preference Optimization☆101Aug 20, 2024Updated last year
- Documentation at☆14Mar 27, 2025Updated last year
- ☆14Aug 10, 2023Updated 2 years ago
- Demonstrating the usage of FGYM: A Toolkit for benchmarking FPGA-accelerated Reinforcement Learning☆14Aug 12, 2021Updated 4 years ago
- docker for UTH-BERT: https://ai-health.m.u-tokyo.ac.jp/uth-bert☆14Mar 24, 2023Updated 3 years ago
- Official Implementation of "Personalized Pieces: Efficient Personalized Large Language Models through Collaborative Efforts" at EMNLP 202…☆13Oct 27, 2024Updated last year
- PyTorch implementation of Count-Based Exploration with Neural Density Models☆10Mar 22, 2018Updated 8 years ago
- [ICLR 2025 SSI-FM] Self-Taught Self-Correction for Small Language Models☆11Sep 19, 2025Updated 8 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Project of ACL 2025 "UAlign: Leveraging Uncertainty Estimations for Factuality Alignment on Large Language Models"☆14Mar 25, 2025Updated last year
- ☆23Aug 10, 2022Updated 3 years ago
- Learning Safety Constraints for Large Language Models (ICML2025)☆35May 25, 2026Updated 3 weeks ago
- Decentralized Reinforcment Learning: Global Decision-Making via Local Economic Transactions (ICML 2020)☆43Dec 8, 2022Updated 3 years ago
- Beyond Empathy: Integrating Diagnostic and Therapeutic Reasoning with Large Language Models for Mental Health Counseling☆49Apr 19, 2026Updated last month
- Look Back to Reason Forward: Revisitable Memory for Long-Context LLM Agents☆41Apr 13, 2026Updated 2 months ago
- RIBES is an automatic evaluation metric for machine translation.☆13Sep 7, 2017Updated 8 years ago
- fast api with machine learning☆10Apr 23, 2023Updated 3 years ago
- Draw ALL Your Imagine: A Holistic Benchmark and Agent Framework for Complex Instruction-based Image Generation☆23Sep 24, 2025Updated 8 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Reasoning-based Evaluation and Ranking of Translations.☆20Jun 2, 2026Updated last week
- [XLLM@ACL2025] Official Code for "Less is More: Enhancing Structured Multi-Agent Reasoning via Quality-Guided Distillation"☆22Jul 29, 2025Updated 10 months ago
- Experimental tl;dr summaries for datasets on the Hugging Face Hub!☆10Apr 4, 2024Updated 2 years ago
- The official Python SDK for FastLabel API, the Data Platform for AI☆16Jun 1, 2026Updated 2 weeks ago
- Source code for our paper: "ARIA: Training Language Agents with Intention-Driven Reward Aggregation".☆30Aug 9, 2025Updated 10 months ago
- SpanAlign: Sentence Alignment Method based on Cross-Language Span Prediction and ILP☆15Mar 24, 2021Updated 5 years ago
- Store a better zsh history and search via interactive grep☆17Aug 25, 2020Updated 5 years ago
- Inference-time alignment for harmlessness through cross-model guidance (ACL 2024). Code + MM-Harmful Bench.☆38Oct 2, 2024Updated last year
- Ext-Oracle Summarization: extractive summarization that maximize ROUGE w.r.t. target☆11Jun 24, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ウェブサイト「サンプルで学ぶ Go 言語」のソースコード☆17Aug 17, 2024Updated last year
- This is code for the EMNLP 2022 Paper "UniRPG: Unified Discrete Reasoning over Table and Text as Program Generation".☆10Apr 30, 2023Updated 3 years ago
- [ACMMM 2025] ComplexBench-Edit: Benchmarking Complex Instruction-Driven Image Editing via Compositional Dependencies☆22Jun 20, 2025Updated 11 months ago
- The Prism Alignment Project☆93Apr 25, 2024Updated 2 years ago
- Codes for ICCV 2021 paper "AGKD-BML: Defense Against Adversarial Attack by Attention Guided Knowledge Distillation and Bi-directional Met…☆12Mar 3, 2022Updated 4 years ago
- ☆14Mar 1, 2019Updated 7 years ago
- Using conversational games to evaluate powerful LLMs☆18Sep 3, 2023Updated 2 years ago