[ICLR 2025] Weighted-Reward Preference Optimization for Implicit Model Fusion
☆14Mar 17, 2025Updated 11 months ago
Alternatives and similar repositories for WRPO
Users that are interested in WRPO are comparing it to the libraries listed below
Sorting:
- ☆18Apr 18, 2025Updated 10 months ago
- [ICLR2025 Spotlight] Advantage-Guided Distillation for Preference Alignment in Small Language Models☆24Feb 10, 2025Updated last year
- [ICLR 2025] Official Pytorch Implementation of "Mix-LN: Unleashing the Power of Deeper Layers by Combining Pre-LN and Post-LN" by Pengxia…☆29Jul 24, 2025Updated 7 months ago
- Official Implementation of FastKV: Decoupling of Context Reduction and KV Cache Compression for Prefill-Decoding Acceleration☆29Nov 22, 2025Updated 3 months ago
- [ACL 2025] Analyzing LLMs' Multilingual Knowledge Boundary Cognition Across Languages Through the Lens of Internal Representations☆18Oct 18, 2025Updated 4 months ago
- The Code and Script of "David's Slingshot: A Strategic Coordination Framework of Small LLMs Matches Large LLMs in Data Synthesis"☆34Jun 13, 2025Updated 8 months ago
- Official PyTorch Implementation for Vision-Language Models Create Cross-Modal Task Representations, ICML 2025☆33May 1, 2025Updated 9 months ago
- Official implementation of "OpenCity3D: What do Vision-Language Models know about Urban Environments?" @ WACV2025☆16Nov 24, 2024Updated last year
- ☆21Jul 25, 2025Updated 7 months ago
- [ACL 2025 Findings] Text2World: Benchmarking Large Language Models for Symbolic World Model Generation☆28Feb 25, 2025Updated last year
- Code for the paper "Cottention: Linear Transformers With Cosine Attention"☆20Nov 15, 2025Updated 3 months ago
- Implementation of SmoothCache, a project aimed at speeding-up Diffusion Transformer (DiT) based GenAI models with error-guided caching.☆47Jul 17, 2025Updated 7 months ago
- Sequence Matters : Harnessing Video Model in 3D Super-Resolution☆43Jan 6, 2026Updated last month
- [ICLR 2026] Fast-Slow Toolpath Agent with Subroutine Mining for Efficient Multi-turn Image Editing☆29Feb 6, 2026Updated 3 weeks ago
- Official repository of DialSim☆29Oct 31, 2025Updated 4 months ago
- [EMNLP 2025] Code for paper "Table-R1: Inference-Time Scaling for Table Reasoning"☆29Jun 3, 2025Updated 8 months ago
- [ICLR'25] Code for KaSA, an official implementation of "KaSA: Knowledge-Aware Singular-Value Adaptation of Large Language Models"☆20Jan 16, 2025Updated last year
- ☆19Jan 8, 2025Updated last year
- [ICLR 2026] Official repository of "InternSVG: Towards Unified SVG Tasks with Multimodal Large Language Models".☆90Feb 6, 2026Updated 3 weeks ago
- ☆29Apr 22, 2025Updated 10 months ago
- [ICCV'25 Oral] The official implementation of Distilling Diffusion Models to Efficient 3D LiDAR Scene Completion☆63Jul 24, 2025Updated 7 months ago
- TPDiff: Temporal Pyramid Video Diffusion Model☆25Mar 13, 2025Updated 11 months ago
- ☆22Jun 10, 2025Updated 8 months ago
- [NeurIPS25] Official Implementation (Pytorch) of "DeepVideo-R1"☆31Updated this week
- ITACLIP: Boosting Training-Free Semantic Segmentation with Image, Text, and Architectural Enhancements [CVPRW 2025]☆24Jan 31, 2026Updated last month
- Multimodal Music Generation with Explicit Bridges and Retrieval Augmentation: A framework for generating multimodal music by bridging dif…☆28Jan 21, 2025Updated last year
- 🎮Manipulates mobile phones just like how you would. Official code for "MobA: Multifaceted Memory-Enhanced Adaptive Planning for Efficien…☆27Oct 10, 2025Updated 4 months ago
- ☆19Jan 10, 2025Updated last year
- Official implementation of "Beyond Theorem Proving: Formulation, Framework and Benchmark for Formal Problem-Solving"☆29May 8, 2025Updated 9 months ago
- [ACM MM 2025] MLLMs for Aesthetics Reasoning☆23Jan 5, 2026Updated last month
- [ICLR 2025] Distilled Decoding 1: One-step Sampling of Image Auto-regressive Models with Flow Matching☆54Apr 21, 2025Updated 10 months ago
- ☆23Jul 5, 2024Updated last year
- ☆41May 27, 2025Updated 9 months ago
- This repository contains the code and data for the paper "VisOnlyQA: Large Vision Language Models Still Struggle with Visual Perception o…☆28Jul 9, 2025Updated 7 months ago
- Task Preference Optimization: Improving Multimodal Large Language Models with Vision Task Alignment☆64Jul 22, 2025Updated 7 months ago
- Diffusion-Sharpening: Fine-tuning Diffusion Models with Denoising Trajectory Sharpening☆69May 18, 2025Updated 9 months ago
- ☆38Jan 8, 2026Updated last month
- ☆45Nov 9, 2025Updated 3 months ago
- A Text2SQL benchmark for evaluation of Large Language Models☆41Updated this week