MinkaiXu / fPOLinks
f-PO: Generalizing Preference Optimization with f-divergence Minimization
☆11Updated 4 months ago
Alternatives and similar repositories for fPO
Users that are interested in fPO are comparing it to the libraries listed below
Sorting:
- Applies ROME and MEMIT on Mamba-S4 models☆14Updated last year
- Official implementation of Our NeurIPS 2024 Paper "Boundary Matters: A Bi-Level Active Finetuning Method"☆13Updated 6 months ago
- Official Code for ICLR 2024 Paper: Non-negative Contrastive Learning☆45Updated last year
- Official implementation of AAAI24 paper "A Dual-way Enhanced Framework from Text Matching Point of View for Multimodal Entity Linking"☆8Updated 10 months ago
- ☆17Updated 7 months ago
- Official Implementation of UA^{2}-Agent and other baseline algorithms of "Towards Unified Alignment Between Agents, Humans, and Environme…☆18Updated 9 months ago
- Official implementation of ORCA proposed in the paper "Cross-Modal Fine-Tuning: Align then Refine"☆71Updated last year
- source code for paper "Riemannian Preconditioned LoRA for Fine-Tuning Foundation Models"☆30Updated last year
- ☆131Updated 2 weeks ago
- ☆19Updated 4 months ago
- Official PyTorch implementation of Rethinking Guidance Information to Utilize Unlabeled Samples: A Label-Encoding Perspective.☆19Updated 10 months ago
- [ICLR 2025] Code&Data for the paper "Super(ficial)-alignment: Strong Models May Deceive Weak Models in Weak-to-Strong Generalization"☆13Updated last year
- [ICML 2024] How Interpretable Are Interpretable Graph Neural Networks?☆14Updated last year
- Lightweight Adapting for Black-Box Large Language Models☆23Updated last year
- Code Repository for the NeurIPS 2022 paper: "Hyper-Representations as Generative Models: Sampling Unseen Neural Network Weights".☆16Updated last year
- ☆13Updated last month
- Code for GFlowNet-EM, a novel algorithm for fitting latent variable models with compositional latents and an intractable true posterior.☆41Updated last year
- Official implementation of MARIO: Model Agnostic Recipe for Improving OOD Generalization of Graph Contrastive Learning☆17Updated last year
- Official Repo for FoodieQA paper (EMNLP 2024)☆16Updated last month
- ☆16Updated 4 months ago
- ☆12Updated last year
- exploring whether LLMs perform case-based or rule-based reasoning☆30Updated last year
- ☆11Updated last year
- PyTorch codes for the paper "An Empirical Study of Multimodal Model Merging"☆37Updated last year
- Differentiable Top-k Classification Learning☆83Updated 2 years ago
- Official repo of Progressive Data Expansion: data, code and evaluation☆29Updated last year
- OpenReivew Submission Visualization (ICLR 2024/2025)☆151Updated 9 months ago
- ☆55Updated last year
- [EMNLP 2023, Main Conference] Sparse Low-rank Adaptation of Pre-trained Language Models☆81Updated last year
- SCoRe: Training Language Models to Self-Correct via Reinforcement Learning☆12Updated 6 months ago