This repository contains the joint use of CPO and SimPO method for better reference-free preference learning methods.
☆57Aug 13, 2024Updated last year
Alternatives and similar repositories for CPO_SIMPO
Users that are interested in CPO_SIMPO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free Reward☆954Feb 16, 2025Updated last year
- Control LLM generation format efficiently. A simple version of microsoft/aici in vllm and transformers☆14Jun 7, 2024Updated last year
- This repository contains code for the paper Direct Preference Optimization with an Offset (ODPO).☆19Feb 17, 2025Updated last year
- ☆25Oct 22, 2022Updated 3 years ago
- DPO, but faster 🚀☆52Dec 6, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- This is the repository for our EMNLP 2022 paper "The Importance of Being Parameters: An Intra-Distillation Method for Serious Gains".☆10Jun 2, 2023Updated 2 years ago
- ☆14Oct 11, 2023Updated 2 years ago
- mPLM-Sim: Better Cross-Lingual Similarity and Transfer in Multilingual Pretrained Language Models☆11Jan 19, 2024Updated 2 years ago
- Masking tokens to modify the predictions of a pretrained sentence classifier☆16Feb 4, 2020Updated 6 years ago
- First explanation metric (diagnostic report) for text generation evaluation☆62Mar 3, 2025Updated last year
- State-of-the-art LLM-based translation models.☆584Apr 9, 2025Updated last year
- Repository for Skill Set Optimization☆14Jul 26, 2024Updated last year
- Scalable, structured, dynamically-scheduled hyperparameter optimization.☆19Oct 13, 2022Updated 3 years ago
- Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Fl…☆78Aug 17, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- DyNet implementation of stack LSTM experiments by Grefenstette et al.☆21Oct 6, 2017Updated 8 years ago
- ☆16Feb 6, 2024Updated 2 years ago
- MetricEval: A framework that conceptualizes and operationalizes four main components of metric evaluation, in terms of reliability and va…☆12Nov 6, 2023Updated 2 years ago
- ☆32Jul 11, 2024Updated last year
- Official repository for ACL 2025 paper "Model Extrapolation Expedites Alignment"☆75May 20, 2025Updated 11 months ago
- Mirror for Java and PHP libraries and text resources to facilitate the use of Inuktitut in its written form on computers and the web☆10Aug 2, 2015Updated 10 years ago
- Multi-task modelling extensions for huggingface transformers☆21Mar 3, 2023Updated 3 years ago
- ☆18Jul 30, 2018Updated 7 years ago
- Is In-Context Learning Sufficient for Instruction Following in LLMs? [ICLR 2025]☆32Jan 23, 2025Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆12Aug 15, 2023Updated 2 years ago
- String Distance using cython☆13Jan 19, 2020Updated 6 years ago
- [ICLR 2025] Unintentional Unalignment: Likelihood Displacement in Direct Preference Optimization☆32Jan 7, 2026Updated 3 months ago
- ☆12Jun 19, 2024Updated last year
- Hugging Face RoBERTa with Flash Attention 2☆24Sep 14, 2025Updated 7 months ago
- ☆51Oct 28, 2024Updated last year
- Code for paper 《Drug-Drug Interaction Extraction via Recurrent Hybrid Convolutional Neural Networks with an Improved Focal Loss》☆14Mar 12, 2019Updated 7 years ago
- Evaluating Reward Models in Multilingual Settings (ACL Main '25)☆42May 16, 2025Updated 11 months ago
- 2024CCF国际AIOps挑战赛-赛道二(GLM4):基于检索增强的运维知识问答挑战赛解决方案分享。☆14Jul 5, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A pipeline for the automatic construction of geometry problems along with step-by-step solutions.☆17Aug 27, 2025Updated 8 months ago
- Code for WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models.☆90Sep 12, 2024Updated last year
- A different, but useful, textcat approach.☆18Jul 15, 2024Updated last year
- An Empirical Study On Contrastive Search And Contrastive Decoding For Open-ended Text Generation☆27Jun 7, 2024Updated last year
- wePoker is a multi-player poker game for Android☆11Mar 20, 2013Updated 13 years ago
- Set-Encoder: Permutation-Invariant Inter-Passage Attention for Listwise Passage Re-Ranking with Cross-Encoders☆18May 23, 2025Updated 11 months ago
- [ACL'26 Findings] Official code for "BAPO: Boundary-Aware Policy Optimization for Reliable Agentic Search"☆28Apr 23, 2026Updated last week