SLIT-AI/WRPO

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/SLIT-AI/WRPO)

SLIT-AI / WRPO

[ICLR 2025] Weighted-Reward Preference Optimization for Implicit Model Fusion

☆14

Alternatives and similar repositories for WRPO

Users that are interested in WRPO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

SLIT-AI / FuseChat-3.0
View on GitHub
☆18Apr 18, 2025Updated last year
SLIT-AI / ADPA
View on GitHub
[ICLR2025 Spotlight] Advantage-Guided Distillation for Preference Alignment in Small Language Models
☆26Feb 10, 2025Updated last year
fanqiwan / Explore-Instruct
View on GitHub
EMNLP'2023: Explore-Instruct: Enhancing Domain-Specific Instruction Coverage through Active Exploration
☆36Mar 10, 2024Updated 2 years ago
hedj17 / DST
View on GitHub
This project implements two dynamic spatiotemporal interpolation (DST) methods, i.e., coarse-grained DST (CGDST) and fine-grained DST (FG…
☆11Apr 15, 2022Updated 4 years ago
google / humanio
View on GitHub
Human I/O, published at CHI 2024, Honorable Mentions Award
☆18May 21, 2026Updated last month
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
pixeli99 / MixLN
View on GitHub
[ICLR 2025] Official Pytorch Implementation of "Mix-LN: Unleashing the Power of Deeper Layers by Combining Pre-LN and Post-LN" by Pengxia…
☆30Jul 24, 2025Updated 11 months ago
GX-XinGao / GRA
View on GitHub
The Code and Script of "David's Slingshot: A Strategic Coordination Framework of Small LLMs Matches Large LLMs in Data Synthesis"
☆34Jun 13, 2025Updated last year
Roblox / SmoothCache
View on GitHub
Implementation of SmoothCache, a project aimed at speeding-up Diffusion Transformer (DiT) based GenAI models with error-guided caching.
☆48Jul 17, 2025Updated last year
HHTheBest / Computer-Architecture-Review
View on GitHub
A Brief Review for Computer Architecture
☆19Apr 23, 2025Updated last year
opencity3d / opencity3d
View on GitHub
Official implementation of "OpenCity3D: What do Vision-Language Models know about Urban Environments?" @ WACV2025
☆19Nov 24, 2024Updated last year
yinyueqin / DenseRewardRLHF-PPO
View on GitHub
This repository contains the code and released models for the paper Segmenting Text and Learning Their Rewards for Improved RLHF in Langu…
☆19Jan 8, 2025Updated last year
dongwonjo / FastKV
View on GitHub
[ACL Findings 2026] Official Implementation of "FastKV: Decoupling of Context Reduction and KV Cache Compression for Prefill-Decoding Acc…
☆32Apr 14, 2026Updated 3 months ago
symanto-research / merge-tokenizers
View on GitHub
Package to align tokens from different tokenizations.
☆16Mar 25, 2024Updated 2 years ago
g-luo / vlm_cross_modal_reps
View on GitHub
Official PyTorch Implementation for Vision-Language Models Create Cross-Modal Task Representations, ICML 2025
☆34May 1, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
wbs2788 / MTM
View on GitHub
Multimodal Music Generation with Explicit Bridges and Retrieval Augmentation: A framework for generating multimodal music by bridging dif…
☆28Jan 21, 2025Updated last year
showlab / TPDiff
View on GitHub
TPDiff: Temporal Pyramid Video Diffusion Model
☆25Mar 13, 2025Updated last year
danaesavi / ImageChain
View on GitHub
This repository is associated with the research paper titled ImageChain: Advancing Sequential Image-to-Text Reasoning in Multimodal Large…
☆15Jun 4, 2025Updated last year
18907305772 / FuseAI
View on GitHub
FuseAI Project
☆93Jan 25, 2025Updated last year
OpenDFM / MobA
View on GitHub
🎮Manipulates mobile phones just like how you would. Official code for "MobA: Multifaceted Memory-Enhanced Adaptive Planning for Efficien…
☆28Oct 10, 2025Updated 9 months ago
yigu1008 / Diffusion-RPO
View on GitHub
☆15Mar 30, 2025Updated last year
xydaytoy / EVA
View on GitHub
☆14Apr 16, 2024Updated 2 years ago
bigai-nlco / RuleReasoner
View on GitHub
[ICLR 2026] RuleReasoner: Reinforced Rule-based Reasoning via Domain-aware Dynamic Sampling
☆39Feb 25, 2026Updated 4 months ago
imagination-research / distilled-decoding
View on GitHub
[ICLR 2025] Distilled Decoding 1: One-step Sampling of Image Auto-regressive Models with Flow Matching
☆55Apr 21, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
m-arda-aydn / ITACLIP
View on GitHub
ITACLIP: Boosting Training-Free Semantic Segmentation with Image, Text, and Architectural Enhancements [CVPRW 2025]
☆24Jan 31, 2026Updated 5 months ago
Gen-Verse / Diffusion-Sharpening
View on GitHub
Diffusion-Sharpening: Fine-tuning Diffusion Models with Denoising Trajectory Sharpening
☆72May 18, 2025Updated last year
wang8740 / MAP
View on GitHub
Documentation at
☆14Mar 27, 2025Updated last year
DHPark98 / SequenceMatters
View on GitHub
Sequence Matters : Harnessing Video Model in 3D Super-Resolution
☆45Jan 6, 2026Updated 6 months ago
JarvisPei / CMoE
View on GitHub
[ACL 2026 Main] Analytical FFN-to-MoE Restructuring via Activation Pattern Analysis
☆46Jun 30, 2026Updated 3 weeks ago
songrise / MLLM4Art
View on GitHub
[ACM MM 2025] MLLMs for Aesthetics Reasoning
☆26Jan 5, 2026Updated 6 months ago
kanishkg / boxing-gym
View on GitHub
☆11Jul 30, 2025Updated 11 months ago
KongLongGeFDU / TransferTOD
View on GitHub
The code repository of paper "TransferTOD: A Generalizable Chinese Multi-Domain Task-Oriented Dialogue System with Transfer Capabilities"
☆20May 12, 2026Updated 2 months ago
dadelani / sib-200
View on GitHub
SIB-200: A Simple, Inclusive, and Big Evaluation Dataset for Topic Classification in 200+ Languages and Dialects
☆26May 20, 2026Updated 2 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
IIT-PAVIS / Positional_Diffusion
View on GitHub
Code for "Positional Diffusion: Ordering Unordered Sets with Diffusion Probabilistic Models"
☆18Mar 21, 2023Updated 3 years ago
wangbohan97 / MPS
View on GitHub
☆13Jul 5, 2024Updated 2 years ago
NadavSc / Diff-Mamba
View on GitHub
☆22Jan 23, 2026Updated 5 months ago
shengliu66 / FractionalReason
View on GitHub
Official github repo for "Fractional Reasoning via Latent Steering Vectors Improves Inference Time Compute"
☆17Jun 30, 2025Updated last year
IAAR-Shanghai / SEAP
View on GitHub
☆22Jun 10, 2025Updated last year
Aaron617 / text2world
View on GitHub
[ACL 2025 Findings] Text2World: Benchmarking Large Language Models for Symbolic World Model Generation
☆29Feb 25, 2025Updated last year
gmongaras / Cottention_Transformer
View on GitHub
Code for the paper "Cottention: Linear Transformers With Cosine Attention"
☆20Nov 15, 2025Updated 8 months ago