This repository contains code for the paper Direct Preference Optimization with an Offset (ODPO).
☆18Feb 17, 2025Updated last year
Alternatives and similar repositories for odpo
Users that are interested in odpo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆12Dec 13, 2022Updated 3 years ago
- ☆20Nov 19, 2023Updated 2 years ago
- ☆10Sep 28, 2018Updated 7 years ago
- A Structured Span Selector (NAACL 2022). A structured span selector with a WCFG for span selection tasks (coreference resolution, semanti…☆21Jul 11, 2022Updated 3 years ago
- ☆12Dec 4, 2021Updated 4 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- T-GD: Transferable GAN-generated Images Detection Framework. (ICML 2020)☆18May 12, 2021Updated 4 years ago
- Offline RL experiments☆15Oct 1, 2022Updated 3 years ago
- [ICML 2025] Official code of "AlphaDPO: Adaptive Reward Margin for Direct Preference Optimization"☆30Jan 10, 2026Updated 2 months ago
- A Qwen .5B reasoning model trained on OpenR1-Math-220k☆14Oct 11, 2025Updated 5 months ago
- Official Codebase for "Aligning Diffusion Behaviors with Q-functions for Efficient Continuous Control" (NeurIPS 2024)☆15Oct 29, 2024Updated last year
- 统计微信朋友圈送出的赞票与得到的赞票人员比例☆11May 3, 2016Updated 9 years ago
- mPLM-Sim: Better Cross-Lingual Similarity and Transfer in Multilingual Pretrained Language Models☆11Jan 19, 2024Updated 2 years ago
- Official repo for vidar and vidarc: video foundation model for robotics.☆40Dec 22, 2025Updated 3 months ago
- Code for ACL '19 paper: Towards Improving Neural Named Entity Recognition with Gazetteers☆32Jul 2, 2021Updated 4 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- ☆39Jul 30, 2024Updated last year
- Source code of NeurIPS 2022 paper “Co-Modality Graph Contrastive Learning for Imbalanced Node Classification”☆21Jan 15, 2023Updated 3 years ago
- Noise Reduction Methods for Distantly Supervised Biomedical Relation Extraction☆11Oct 25, 2017Updated 8 years ago
- SimPER: A Minimalist Approach to Preference Alignment without Hyperparameters (ICLR 2025)☆17Aug 22, 2025Updated 7 months ago
- Neo4j 大规模 三元组 CVS 导入进数据库☆11Jul 31, 2020Updated 5 years ago
- Manipulate tensors with PackedSequence and CattedSequence☆12Jan 4, 2026Updated 2 months ago
- Concise Reasoning via Reinforcement Learning☆13Apr 16, 2025Updated 11 months ago
- [NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free Reward☆948Feb 16, 2025Updated last year
- Evaluation results for Machine Translation within the BigScience project☆11May 15, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Basic PyTorch Implementation of 'Neural Architecture Search with Reinforcement Learning' (https://arxiv.org/abs/1611.01578)☆13Feb 24, 2018Updated 8 years ago
- Transfer learning for neural machine translation using cross-lingual word embeddings☆10Dec 17, 2025Updated 3 months ago
- Official code and data of "3AM: An Ambiguity-Aware Multi-Modal Machine Translation Dataset"☆12Dec 8, 2024Updated last year
- A repo to design basic Policy Gradient labs☆12Jul 6, 2023Updated 2 years ago
- Repository containing the open source code of works published at the FBK MT unit.☆60Mar 19, 2026Updated last week
- Leveraging Local and Global Patterns for Self-Attention Networks☆12Jun 3, 2019Updated 6 years ago
- Code for the EACL 2024 paper: "Small Language Models Improve Giants by Rewriting Their Outputs"☆12Apr 20, 2024Updated last year
- Python package to augment multilingual data☆15Feb 15, 2023Updated 3 years ago
- Source code and dataset for ECML-PKDD 2020 paper "Hierarchical Interaction Networks with Rethinking Mechanism for Document-level Sentimen…☆10Jul 28, 2020Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Code for "Improving Translation Faithfulness of Large Language Models via Augmenting Instructions"☆12Aug 26, 2023Updated 2 years ago
- Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"☆11Apr 10, 2025Updated 11 months ago
- The official repository of Continuous Memory Representation for Anomaly Detection☆33Nov 27, 2024Updated last year
- Generic PyTorch implementation of einsum that supports different semirings☆50Dec 4, 2025Updated 3 months ago
- Classifying Relations by Ranking with Convolutional Neural Networks☆12May 22, 2019Updated 6 years ago
- ☆46Oct 26, 2021Updated 4 years ago
- NMT with ssp☆11Oct 28, 2021Updated 4 years ago