fe1ixxu / CPO_SIMPOView external linksLinks
This repository contains the joint use of CPO and SimPO method for better reference-free preference learning methods.
β56Aug 13, 2024Updated last year
Alternatives and similar repositories for CPO_SIMPO
Users that are interested in CPO_SIMPO are comparing it to the libraries listed below
Sorting:
- DPO, but faster πβ47Dec 6, 2024Updated last year
- [NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free Rewardβ944Feb 16, 2025Updated 11 months ago
- First explanation metric (diagnostic report) for text generation evaluationβ62Mar 3, 2025Updated 11 months ago
- Code and models for EMNLP 2024 paper "WPO: Enhancing RLHF with Weighted Preference Optimization"β41Sep 24, 2024Updated last year
- mPLM-Sim: Better Cross-Lingual Similarity and Transfer in Multilingual Pretrained Language Modelsβ11Jan 19, 2024Updated 2 years ago
- Repository for Skill Set Optimizationβ14Jul 26, 2024Updated last year
- β16Feb 6, 2024Updated 2 years ago
- MetricEval: A framework that conceptualizes and operationalizes four main components of metric evaluation, in terms of reliability and vaβ¦β12Nov 6, 2023Updated 2 years ago
- Control LLM generation format efficiently. A simple version of microsoft/aici in vllm and transformersβ14Jun 7, 2024Updated last year
- Latent Large Language Modelsβ19Aug 24, 2024Updated last year
- Masking tokens to modify the predictions of a pretrained sentence classifierβ16Feb 4, 2020Updated 6 years ago
- Set-Encoder: Permutation-Invariant Inter-Passage Attention for Listwise Passage Re-Ranking with Cross-Encodersβ18May 23, 2025Updated 8 months ago
- β14Oct 11, 2023Updated 2 years ago
- Code for paper γDrug-Drug Interaction Extraction via Recurrent Hybrid Convolutional Neural Networks with an Improved Focal Lossγβ14Mar 12, 2019Updated 6 years ago
- This repository contains a Python implementation that allows you to use gorilla-llm/gorilla-openfunctions-v2 LLM to perform function callβ¦β17Apr 7, 2024Updated last year
- [BMVC 2024 Oral β¨] Revisiting Image Captioning Training Paradigm via Direct CLIP-based Optimizationβ20Sep 11, 2024Updated last year
- Code for "MIM: Mutual Information Machine" paper.β15Nov 22, 2022Updated 3 years ago
- Official repository for ACL 2025 paper "Model Extrapolation Expedites Alignment"β75May 20, 2025Updated 8 months ago
- Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flβ¦β78Aug 17, 2024Updated last year
- [ICML 2025] Official code of "AlphaDPO: Adaptive Reward Margin for Direct Preference Optimization"β29Jan 10, 2026Updated last month
- Text-2-SQLβ19Feb 21, 2025Updated 11 months ago
- [ECCVβ24] Official repository for "BEAF: Observing Before-AFter Changes to Evaluate Hallucination in Vision-language Models"β21Mar 26, 2025Updated 10 months ago
- State-of-the-art LLM-based translation models.β577Apr 9, 2025Updated 10 months ago
- An Empirical Study On Contrastive Search And Contrastive Decoding For Open-ended Text Generationβ27Jun 7, 2024Updated last year
- Multi-task modelling extensions for huggingface transformersβ21Mar 3, 2023Updated 2 years ago
- β51Oct 28, 2024Updated last year
- Diffusion Reflectance Map: Single-Image Stochastic Inverse Rendering of Illumination and Reflectanceβ23Jan 20, 2025Updated last year
- ICML 2024 - Official Repository for EXO: Towards Efficient Exact Optimization of Language Model Alignmentβ57Jun 16, 2024Updated last year
- Code of "Improving Machine Translation with Human Feedback: An Exploration of Quality Estimation as a Reward Model"β23Jun 28, 2024Updated last year
- β63Oct 3, 2024Updated last year
- Fast and versatile tokenizer for language models, compatible with SentencePiece, Tokenizers, Tiktoken and more. Supports BPE, Unigram andβ¦β42Oct 10, 2025Updated 4 months ago
- Pytorch implementation of HyperLLaVA: Dynamic Visual and Language Expert Tuning for Multimodal Large Language Modelsβ28Mar 22, 2024Updated last year
- [EMNLP 2025] Verification Engineering for RL in Instruction Followingβ50Jan 5, 2026Updated last month
- ANN Search through the COVID CORD-19 Dataset using SBERT.β26May 9, 2020Updated 5 years ago
- [ACL 2025] RetroLLM: Empowering LLMs to Retrieve Fine-grained Evidence within Generationβ118Jan 23, 2025Updated last year
- [ICLR 2025] MiniPLM: Knowledge Distillation for Pre-Training Language Modelsβ73Nov 23, 2024Updated last year
- The official implementation of Self-Play Fine-Tuning (SPIN)β1,234May 8, 2024Updated last year
- β25Oct 22, 2022Updated 3 years ago
- β33May 12, 2023Updated 2 years ago