rycolab/odpo

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/rycolab/odpo)

rycolab / odpo

This repository contains code for the paper Direct Preference Optimization with an Offset (ODPO).

☆21

Alternatives and similar repositories for odpo

Users that are interested in odpo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

YuxianMeng / CorefQA-pytorch
View on GitHub
A PyTorch implementation of the CorefQA Model.
☆10Jun 27, 2020Updated 6 years ago
lyutyuh / structured-span-selector
View on GitHub
A Structured Span Selector (NAACL 2022). A structured span selector with a WCFG for span selection tasks (coreference resolution, semanti…
☆21Jul 11, 2022Updated 4 years ago
zjlww / dsp
View on GitHub
Digital Speech Processing in PyTorch.
☆15Aug 12, 2022Updated 3 years ago
Zechao-Guan / TopoDiT-3D
View on GitHub
☆15May 13, 2025Updated last year
psky1111 / Tencent-TSSR
View on GitHub
Official implementation of TSSR
☆16Mar 5, 2026Updated 4 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
maxreciprocate / offline
View on GitHub
Offline RL experiments
☆15Oct 1, 2022Updated 3 years ago
CD-link / XSpecMesh
View on GitHub
Official implementation of the paper: “XSpecMesh: Quality-Preserving Auto-Regressive Mesh Generation Acceleration via Multi-Head Speculat…
☆17Aug 7, 2025Updated 11 months ago
isri-aist / jvrc_mj_description
View on GitHub
JVRC1 model files for MuJoCo
☆11Mar 24, 2026Updated 4 months ago
Acciente717 / check_docx_similarity
View on GitHub
本工具采用随机算法计算指定文件夹内两两 .docx 文件间的相似性。
☆15Jun 15, 2020Updated 6 years ago
thomasgauthier / LLM-self-play
View on GitHub
Minimal implementation of the Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models paper (ArXiv 20232401.01335)
☆29Mar 1, 2024Updated 2 years ago
HarleyCoops / smolThinker-.5B
View on GitHub
A Qwen .5B reasoning model trained on OpenR1-Math-220k
☆14Updated this week
cisnlp / mPLM-Sim
View on GitHub
mPLM-Sim: Better Cross-Lingual Similarity and Transfer in Multilingual Pretrained Language Models
☆11Jan 19, 2024Updated 2 years ago
princeton-nlp / SimPO
View on GitHub
[NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free Reward
☆956Feb 16, 2025Updated last year
TAU-VAILab / BlendedPC
View on GitHub
Blended Point Cloud Diffusion for Localized Text-guided Shape Editing
☆17Jul 2, 2026Updated 3 weeks ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
sayakpaul / Adversarial-Examples-in-Deep-Learning
View on GitHub
Shows how to create basic image adversaries, and train adversarially robust image classifiers (to some extent).
☆13Oct 14, 2020Updated 5 years ago
RL-VIG / DMNSP
View on GitHub
[ICCV 2025] Official code of paper "Dynamic Multi-Layer Null Space Projection for Vision-Language Continual Learning"
☆27Sep 8, 2025Updated 10 months ago
intelligent-control-lab / StableLego
View on GitHub
☆19Mar 10, 2026Updated 4 months ago
NAVER-INTEL-Co-Lab / gaudi-lavcap
View on GitHub
☆15Jan 24, 2025Updated last year
jacky121298 / WLST
View on GitHub
[ICRA 2024] WLST: Weak Labels Guided Self-training for Weakly-supervised Domain Adaptation on 3D Object Detection
☆12Feb 6, 2024Updated 2 years ago
ysy-phoenix / evalhub
View on GitHub
All-in-one benchmarking platform for evaluating LLM.
☆15Nov 12, 2025Updated 8 months ago
sony / mmaudiosep
View on GitHub
☆16Apr 30, 2026Updated 2 months ago
MikaStars39 / StableMask
View on GitHub
PyTorch implementation of StableMask (ICML'24)
☆15Jun 27, 2024Updated 2 years ago
speedcell4 / torchrua
View on GitHub
Manipulate tensors with PackedSequence and CattedSequence
☆12Jan 4, 2026Updated 6 months ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
ai-wand / concise-reasoning
View on GitHub
Concise Reasoning via Reinforcement Learning
☆13Apr 16, 2025Updated last year
ethanliuzhuo / Neo4j_Knowledge_Graph_csv_import
View on GitHub
Neo4j 大规模三元组 CVS 导入进数据库
☆11Jul 31, 2020Updated 5 years ago
sail-sg / offbench
View on GitHub
☆16Jun 1, 2023Updated 3 years ago
thu-ml / Noise-Contrastive-Alignment
View on GitHub
Code accompanying the paper "Noise Contrastive Alignment of Language Models with Explicit Rewards" (NeurIPS 2024)
☆59Nov 8, 2024Updated last year
rbawden / mt-bigscience
View on GitHub
Evaluation results for Machine Translation within the BigScience project
☆11May 15, 2023Updated 3 years ago
MaxyLee / 3AM
View on GitHub
Official code and data of "3AM: An Ambiguity-Aware Multi-Modal Machine Translation Dataset"
☆12Dec 8, 2024Updated last year
NUS-HPC-AI-Lab / pytorch-lamb
View on GitHub
PyTorch implementation of LAMB for ImageNet/ResNet-50 training
☆13May 13, 2021Updated 5 years ago
GeorgeVern / smala
View on GitHub
Python source code for EMNLP 2021 Findings paper: "Subword Mapping and Anchoring Across Languages".
☆13Sep 17, 2021Updated 4 years ago
princeton-nlp / unintentional-unalignment
View on GitHub
[ICLR 2025] Unintentional Unalignment: Likelihood Displacement in Direct Preference Optimization
☆32Jan 7, 2026Updated 6 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
GeorgeVern / lmcor
View on GitHub
Code for the EACL 2024 paper: "Small Language Models Improve Giants by Rewriting Their Outputs"
☆12Apr 20, 2024Updated 2 years ago
Jumpat / tigon
View on GitHub
Official repository of Text-Image Conditioned 3D Generation (TIGON, CVPR 2026)
☆27Updated this week
scewiner / Leveraging
View on GitHub
Leveraging Local and Global Patterns for Self-Attention Networks
☆12Jun 3, 2019Updated 7 years ago
terminal-agent / reptile
View on GitHub
💻 Terminal-Agent with Human-in-the-Loop Learning
☆41Jan 16, 2026Updated 6 months ago
Unbabel / smaug
View on GitHub
Python package to augment multilingual data
☆15Feb 15, 2023Updated 3 years ago
thu-ml / Efficient-Diffusion-Alignment
View on GitHub
Official Codebase for "Aligning Diffusion Behaviors with Q-functions for Efficient Continuous Control" (NeurIPS 2024)
☆15Oct 29, 2024Updated last year
erlendd / Neighborhood-Components-Analysis-in-Python
View on GitHub
A Python gradient-descent implementation of the Neighborhood Components Analysis (NCA) method for metric learning.
☆16Jan 10, 2017Updated 9 years ago