This repository contains code for the paper Direct Preference Optimization with an Offset (ODPO).
☆18Feb 17, 2025Updated last year
Alternatives and similar repositories for odpo
Users that are interested in odpo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆12Dec 13, 2022Updated 3 years ago
- ☆20Nov 19, 2023Updated 2 years ago
- ☆10Sep 28, 2018Updated 7 years ago
- A PyTorch implementation of the CorefQA Model.☆10Jun 27, 2020Updated 5 years ago
- A Structured Span Selector (NAACL 2022). A structured span selector with a WCFG for span selection tasks (coreference resolution, semanti…☆21Jul 11, 2022Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Code Repository for "Please Mind the Root: Decoding Arborescences for Dependency Parsing" and "On Finding the K-best Non-projective Depen…☆20Dec 12, 2022Updated 3 years ago
- A library for constrained RLHF.☆13Feb 19, 2024Updated 2 years ago
- Code for EMNLP2023 paper "MolCA: Molecular Graph-Language Modeling with Cross-Modal Projector and Uni-Modal Adapter".☆12Dec 27, 2023Updated 2 years ago
- Offline RL experiments☆15Oct 1, 2022Updated 3 years ago
- 本工具采用随机算法计算指定文件夹内两两 .docx 文件间的相似性。☆15Jun 15, 2020Updated 5 years ago
- This repository contains the joint use of CPO and SimPO method for better reference-free preference learning methods.☆56Aug 13, 2024Updated last year
- Minimal implementation of the Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models paper (ArXiv 20232401.01335)☆29Mar 1, 2024Updated 2 years ago
- A Qwen .5B reasoning model trained on OpenR1-Math-220k☆14Oct 11, 2025Updated 6 months ago
- mPLM-Sim: Better Cross-Lingual Similarity and Transfer in Multilingual Pretrained Language Models☆11Jan 19, 2024Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- [ICML 2025] Official code of "AlphaDPO: Adaptive Reward Margin for Direct Preference Optimization"☆31Jan 10, 2026Updated 3 months ago
- Pytorch implementation of BEAR in "Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction"☆11Oct 29, 2019Updated 6 years ago
- Code for ACL '19 paper: Towards Improving Neural Named Entity Recognition with Gazetteers☆32Jul 2, 2021Updated 4 years ago
- ☆15Feb 26, 2025Updated last year
- Noise Reduction Methods for Distantly Supervised Biomedical Relation Extraction☆11Oct 25, 2017Updated 8 years ago
- Neo4j 大规模 三元组 CVS 导入进数据库☆11Jul 31, 2020Updated 5 years ago
- My Java implementation of RSA encryption.☆18Aug 28, 2013Updated 12 years ago
- [NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free Reward☆948Feb 16, 2025Updated last year
- Evaluation results for Machine Translation within the BigScience project☆11May 15, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Basic PyTorch Implementation of 'Neural Architecture Search with Reinforcement Learning' (https://arxiv.org/abs/1611.01578)☆13Feb 24, 2018Updated 8 years ago
- Create single file standalone Python scripts with builtin frozen file system☆18Apr 9, 2022Updated 4 years ago
- A repo to design basic Policy Gradient labs☆12Jul 6, 2023Updated 2 years ago
- [NeurIPS 2024] Official code of $\beta$-DPO: Direct Preference Optimization with Dynamic $\beta$☆51Oct 23, 2024Updated last year
- Official implementation for "ALI-Agent: Assessing LLMs'Alignment with Human Values via Agent-based Evaluation"☆21Jan 31, 2026Updated 2 months ago
- [ICLR 2025] Unintentional Unalignment: Likelihood Displacement in Direct Preference Optimization☆32Jan 7, 2026Updated 3 months ago
- Python package to augment multilingual data☆15Feb 15, 2023Updated 3 years ago
- Source code and dataset for ECML-PKDD 2020 paper "Hierarchical Interaction Networks with Rethinking Mechanism for Document-level Sentimen…☆10Jul 28, 2020Updated 5 years ago
- Code for "Improving Translation Faithfulness of Large Language Models via Augmenting Instructions"☆12Aug 26, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"☆11Apr 10, 2025Updated last year
- NMT with ssp☆11Oct 28, 2021Updated 4 years ago
- ☆11Feb 2, 2023Updated 3 years ago
- Multicultural Proverbs and Sayings☆13Jan 11, 2025Updated last year
- ☆13Oct 13, 2022Updated 3 years ago
- TaCo: Enhancing Cross-Lingual Transfer for Low-Resource Languages in LLMs through Translation-Assisted Chain-of-Thought Processes☆14Jul 1, 2025Updated 9 months ago
- ☆14Apr 18, 2020Updated 5 years ago