This repository contains the joint use of CPO and SimPO method for better reference-free preference learning methods.
β56Aug 13, 2024Updated last year
Alternatives and similar repositories for CPO_SIMPO
Users that are interested in CPO_SIMPO are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free Rewardβ946Feb 16, 2025Updated last year
- DPO, but faster πβ48Dec 6, 2024Updated last year
- First explanation metric (diagnostic report) for text generation evaluationβ62Mar 3, 2025Updated last year
- Code and models for EMNLP 2024 paper "WPO: Enhancing RLHF with Weighted Preference Optimization"β41Sep 24, 2024Updated last year
- mPLM-Sim: Better Cross-Lingual Similarity and Transfer in Multilingual Pretrained Language Modelsβ11Jan 19, 2024Updated 2 years ago
- MetricEval: A framework that conceptualizes and operationalizes four main components of metric evaluation, in terms of reliability and vaβ¦β12Nov 6, 2023Updated 2 years ago
- β16Feb 6, 2024Updated 2 years ago
- Latent Large Language Modelsβ19Aug 24, 2024Updated last year
- Control LLM generation format efficiently. A simple version of microsoft/aici in vllm and transformersβ14Jun 7, 2024Updated last year
- Code for paper γDrug-Drug Interaction Extraction via Recurrent Hybrid Convolutional Neural Networks with an Improved Focal Lossγβ14Mar 12, 2019Updated 6 years ago
- β14Oct 11, 2023Updated 2 years ago
- Masking tokens to modify the predictions of a pretrained sentence classifierβ16Feb 4, 2020Updated 6 years ago
- This repository contains a Python implementation that allows you to use gorilla-llm/gorilla-openfunctions-v2 LLM to perform function callβ¦β17Apr 7, 2024Updated last year
- [BMVC 2024 Oral β¨] Revisiting Image Captioning Training Paradigm via Direct CLIP-based Optimizationβ20Sep 11, 2024Updated last year
- This repository contains code for the paper Direct Preference Optimization with an Offset (ODPO).β18Feb 17, 2025Updated last year
- Code for "MIM: Mutual Information Machine" paper.β15Nov 22, 2022Updated 3 years ago
- Official repository for ACL 2025 paper "Model Extrapolation Expedites Alignment"β75May 20, 2025Updated 9 months ago
- Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flβ¦β78Aug 17, 2024Updated last year
- Text-2-SQLβ19Feb 21, 2025Updated last year
- [ICML 2025] Official code of "AlphaDPO: Adaptive Reward Margin for Direct Preference Optimization"β30Jan 10, 2026Updated last month
- Training and evaluation code for the paper "Headless Language Models: Learning without Predicting with Contrastive Weight Tying" (https:/β¦β28Apr 17, 2024Updated last year
- Hugging Face RoBERTa with Flash Attention 2β24Sep 14, 2025Updated 5 months ago
- Source Code for DialogBERT: Discourse-Aware Response Generation via Learning to Recover and Rank Utterances (https://arxiv.org/pdf/2012.0β¦β79Jan 2, 2022Updated 4 years ago
- State-of-the-art LLM-based translation models.β579Apr 9, 2025Updated 10 months ago
- MAIR: A Massive Benchmark for Evaluating Instructed Retrieval. Evaluate your retrieval models on 126 diverse tasks. [EMNLP 2024]β24Nov 3, 2024Updated last year
- Multi-task modelling extensions for huggingface transformersβ21Mar 3, 2023Updated 3 years ago
- An Empirical Study On Contrastive Search And Contrastive Decoding For Open-ended Text Generationβ27Jun 7, 2024Updated last year
- β56Nov 6, 2024Updated last year
- β51Oct 28, 2024Updated last year
- β22Oct 26, 2020Updated 5 years ago
- Diffusion Reflectance Map: Single-Image Stochastic Inverse Rendering of Illumination and Reflectanceβ23Jan 20, 2025Updated last year
- ICML 2024 - Official Repository for EXO: Towards Efficient Exact Optimization of Language Model Alignmentβ56Jun 16, 2024Updated last year
- Code of "Improving Machine Translation with Human Feedback: An Exploration of Quality Estimation as a Reward Model"β23Jun 28, 2024Updated last year
- β63Oct 3, 2024Updated last year
- Fast and versatile tokenizer for language models, compatible with SentencePiece, Tokenizers, Tiktoken and more. Supports BPE, Unigram andβ¦β44Oct 10, 2025Updated 4 months ago
- [EMNLP 2025] Verification Engineering for RL in Instruction Followingβ51Jan 5, 2026Updated 2 months ago
- Pytorch implementation of HyperLLaVA: Dynamic Visual and Language Expert Tuning for Multimodal Large Language Modelsβ28Mar 22, 2024Updated last year
- β32Jul 11, 2024Updated last year
- Official Code for M-Rα΄α΄‘α΄Κα΄ Bα΄Ι΄α΄Κ: Evaluating Reward Models in Multilingual Settings (ACL 2025 Main)β40May 16, 2025Updated 9 months ago