[ICML 2026] a unified reinforcement learning toolbox for joint RL on language models and diffusion models
☆80Mar 31, 2026Updated last month
Alternatives and similar repositories for UniRL
Users that are interested in UniRL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆24Mar 16, 2026Updated last month
- ☆44Jan 4, 2026Updated 4 months ago
- https://huggingface.co/datasets/multimodal-reasoning-lab/Zebra-CoT☆134Jan 30, 2026Updated 3 months ago
- Pytorch Implementation of "SSR-Encoder: Encoding Selective Subject Representation for Subject-Driven Generation"(CVPR 2024)☆128Jul 22, 2024Updated last year
- [CVPR 2025] OmniMMI: A Comprehensive Multi-modal Interaction Benchmark in Streaming Video Contexts☆22Apr 10, 2026Updated 3 weeks ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- [CVPR 2026] Thinking with Programming Vision: Towards a Unified View for Thinking with Images☆69Jan 23, 2026Updated 3 months ago
- Consistent Autoregressive Video Generation with Long Context☆81Feb 6, 2026Updated 3 months ago
- Code for Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense? [COLM 2024]☆24Aug 13, 2024Updated last year
- ☆11Mar 11, 2025Updated last year
- ☆27Jun 22, 2024Updated last year
- Extend BoxDiff to SDXL (SDXL-based layout-to-image generation)☆27May 23, 2024Updated last year
- [ICLR 2026 Oral] DiffusionNFT: Online Diffusion Reinforcement with Forward Process☆806Feb 10, 2026Updated 2 months ago
- [ICLR2026] The official code of "Weak-to-Strong Diffusion with Reflection".☆58Jan 28, 2026Updated 3 months ago
- Exploring Representation-Aligned Latent Space for Better Generation☆19Mar 17, 2026Updated last month
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆45Dec 16, 2025Updated 4 months ago
- ICML2025, I Think, Therefore I Diffuse: Enabling Multimodal In-Context Reasoning in Diffusion Models☆191Sep 7, 2025Updated 8 months ago
- ☆77Apr 9, 2026Updated 3 weeks ago
- Evaluation for 3D reconstruction, includes monocular depth, video depth, relative camera pose & multi-view point map estimation.☆20Aug 26, 2025Updated 8 months ago
- Official implementation of "Opt-In Art: Learning Art Styles Only from Few Examples" (Accepted by NeurIPS 2025)☆33Nov 30, 2025Updated 5 months ago
- ☆114Feb 4, 2026Updated 3 months ago
- Official repository for the UAE paper, unified-GRPO, and unified-Bench☆164Sep 12, 2025Updated 7 months ago
- Official code repository for "Self-transcendence: Is External Feature Guidance Indispensable for Accelerating Diffusion Transformer Train…☆32Mar 17, 2026Updated last month
- Code for the paper "Overconfidence is a Dangerous Thing: Mitigating Membership Inference Attacks by Enforcing Less Confident Prediction" …☆13Sep 6, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [CVPR`2024, Oral] Attention Calibration for Disentangled Text-to-Image Personalization☆110Apr 10, 2024Updated 2 years ago
- Official repository for “PixelGen: Pixel Diffusion Beats Latent Diffusion with Perceptual Loss”☆242Feb 3, 2026Updated 3 months ago
- [ICML 2025 Oral] An official implementation of VideoRoPE & VideoRoPE++☆221Apr 15, 2026Updated 3 weeks ago
- ☆18Apr 4, 2025Updated last year
- Adaptive Multimodal Reasoning via Reinforcement Learning☆23Jan 11, 2026Updated 3 months ago
- ☆86Oct 10, 2025Updated 6 months ago
- Enhance robot task understanding ability through visual semantic graph☆10May 20, 2021Updated 4 years ago
- [CVPR 2025] OmniMMI: A Comprehensive Multi-modal Interaction Benchmark in Streaming Video Contexts☆17Apr 2, 2025Updated last year
- BranchGRPO: Stable and Efficient GRPO with Structured Branching in Diffusion Models☆43Oct 30, 2025Updated 6 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [Preprint] GMem: A Modular Approach for Ultra-Efficient Generative Models☆43Mar 11, 2025Updated last year
- Official codebase for the paper Latent Visual Reasoning☆152Oct 22, 2025Updated 6 months ago
- code for paper "Ju Xu, Zhanxing Zhu. Reinforced Continual Learning. NIPS 2018."☆37Feb 17, 2019Updated 7 years ago
- [NeurIPS 2025] HoPE: Hybrid of Position Embedding for Long Context Vision-Language Models☆29Feb 19, 2026Updated 2 months ago
- [ACL2026] Uni-MMMU : A Massive Multi-discipline Multimodal Unified Benchmark☆24Apr 13, 2026Updated 3 weeks ago
- [ACL2025 Findings] Benchmarking Multihop Multimodal Internet Agents☆53Feb 27, 2025Updated last year
- Code for FreeTraj, a tuning-free method for trajectory-controllable video generation☆113Sep 19, 2025Updated 7 months ago