This repository contains code for the paper Direct Preference Optimization with an Offset (ODPO).
☆20Feb 17, 2025Updated last year
Alternatives and similar repositories for odpo
Users that are interested in odpo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆21Nov 19, 2023Updated 2 years ago
- A Structured Span Selector (NAACL 2022). A structured span selector with a WCFG for span selection tasks (coreference resolution, semanti…☆21Jul 11, 2022Updated 3 years ago
- Code Repository for "Please Mind the Root: Decoding Arborescences for Dependency Parsing" and "On Finding the K-best Non-projective Depen…☆20Dec 12, 2022Updated 3 years ago
- [EMNLP 2024] Enhancing High-order Interaction Awareness in LLM-based Recommender Model.☆13Jan 9, 2025Updated last year
- Code for EMNLP2023 paper "MolCA: Molecular Graph-Language Modeling with Cross-Modal Projector and Uni-Modal Adapter".☆12Dec 27, 2023Updated 2 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Offline RL experiments☆15Oct 1, 2022Updated 3 years ago
- Automatic Generation of Scaffolding Questions for Learning Math, EMNLP 2022. RL, REINFORCE☆25Jun 30, 2023Updated 2 years ago
- A Qwen .5B reasoning model trained on OpenR1-Math-220k☆14Oct 11, 2025Updated 8 months ago
- Official Codebase for "Aligning Diffusion Behaviors with Q-functions for Efficient Continuous Control" (NeurIPS 2024)☆15Oct 29, 2024Updated last year
- 统计微信朋友圈送出的赞票与得到的赞票人员比例☆11May 3, 2016Updated 10 years ago
- mPLM-Sim: Better Cross-Lingual Similarity and Transfer in Multilingual Pretrained Language Models☆11Jan 19, 2024Updated 2 years ago
- Official repo for vidar and vidarc: video foundation model for robotics.☆40Dec 22, 2025Updated 5 months ago
- All-in-one benchmarking platform for evaluating LLM.☆15Nov 12, 2025Updated 7 months ago
- Pytorch implementation of BEAR in "Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction"☆11Oct 29, 2019Updated 6 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Source code of NeurIPS 2022 paper “Co-Modality Graph Contrastive Learning for Imbalanced Node Classification”☆21Jan 15, 2023Updated 3 years ago
- Noise Reduction Methods for Distantly Supervised Biomedical Relation Extraction☆11Oct 25, 2017Updated 8 years ago
- SimPER: A Minimalist Approach to Preference Alignment without Hyperparameters (ICLR 2025)☆17Aug 22, 2025Updated 9 months ago
- Neo4j 大规模 三元组 CVS 导入进数据库☆11Jul 31, 2020Updated 5 years ago
- Manipulate tensors with PackedSequence and CattedSequence☆12Jan 4, 2026Updated 5 months ago
- Evaluation results for Machine Translation within the BigScience project☆11May 15, 2023Updated 3 years ago
- Official code and data of "3AM: An Ambiguity-Aware Multi-Modal Machine Translation Dataset"☆12Dec 8, 2024Updated last year
- A repo to design basic Policy Gradient labs☆12Jul 6, 2023Updated 2 years ago
- Repository containing the open source code of works published at the FBK MT unit.☆60Mar 19, 2026Updated 2 months ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- ☆11Sep 5, 2025Updated 9 months ago
- [NeurIPS 2024] Official code of $\beta$-DPO: Direct Preference Optimization with Dynamic $\beta$☆51Oct 23, 2024Updated last year
- [ICLR 2025] Unintentional Unalignment: Likelihood Displacement in Direct Preference Optimization☆32Jan 7, 2026Updated 5 months ago
- Leveraging Local and Global Patterns for Self-Attention Networks☆12Jun 3, 2019Updated 7 years ago
- A Python gradient-descent implementation of the Neighborhood Components Analysis (NCA) method for metric learning.☆16Jan 10, 2017Updated 9 years ago
- Code for the EACL 2024 paper: "Small Language Models Improve Giants by Rewriting Their Outputs"☆12Apr 20, 2024Updated 2 years ago
- Python package to augment multilingual data☆15Feb 15, 2023Updated 3 years ago
- Source code and dataset for ECML-PKDD 2020 paper "Hierarchical Interaction Networks with Rethinking Mechanism for Document-level Sentimen…☆11Jul 28, 2020Updated 5 years ago
- Code for "Improving Translation Faithfulness of Large Language Models via Augmenting Instructions"☆12Aug 26, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"☆11Apr 10, 2025Updated last year
- Generic PyTorch implementation of einsum that supports different semirings☆50Dec 4, 2025Updated 6 months ago
- Classifying Relations by Ranking with Convolutional Neural Networks☆12May 22, 2019Updated 7 years ago
- Play games in the OpenAI gym using the keyboard☆16Nov 21, 2017Updated 8 years ago
- ☆11Feb 2, 2023Updated 3 years ago
- Multicultural Proverbs and Sayings☆13Jan 11, 2025Updated last year
- ☆10May 31, 2021Updated 5 years ago