AlenjandroWang / UniReasonLinks
☆50Updated this week
Alternatives and similar repositories for UniReason
Users that are interested in UniReason are comparing it to the libraries listed below
Sorting:
- [Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics]: VisuoThink: Empowering LVLM Reasoning with Mul…☆101Updated 6 months ago
- Autoregressive Semantic Visual Reconstruction Helps VLMs Understand Better☆186Updated last week
- 🔥 OneThinker: All-in-one Reasoning Model for Image and Video☆388Updated last month
- 🦎 Yo'Chameleon: Your Personalized Chameleon (CVPR 2025)☆150Updated 8 months ago
- Official repository of DARE: dLLM Alignment and Reinforcement Executor☆159Updated last week
- Data and sample evaluation codes for Multimodal Rewardbench 2☆135Updated last month
- MixGRPO: Unlocking Flow-based GRPO Efficiency with Mixed ODE-SDE☆1,092Updated this week
- [NeurIPS'2025] Official repository for "LiveStar: Live Streaming Assistant for Real-World Online Video Understanding"☆108Updated 2 months ago
- Logic-in-frames: Dynamic keyframe search via visual semantic-logical verification for long video understanding☆58Updated 2 months ago
- [AAAI 2026 Oral] Official repository for InfiGUI-G1. We introduce Adaptive Exploration Policy Optimization (AEPO) to overcome semantic al…☆133Updated 2 months ago
- A simple, unified multimodal models training engine. Lean, flexible, and built for hacking at scale.☆708Updated 2 weeks ago
- **Deep Video Discovery (DVD)** is a deep-research style question answering agent designed for understanding extra-long videos.☆346Updated 3 months ago
- 🌐 WorldLens: Full-Spectrum Evaluations of Driving World Models in Real World☆178Updated 3 weeks ago
- First Video Deep Research Benchmark☆139Updated 2 weeks ago
- 🔥 [AAAI 2026 Oral] Official code for Decomposing the Neurons: Activation Sparsity via Mixture of Experts for Continual Test Time Adaptat…☆75Updated last year
- Think with 3D: Geometric Imagination Grounded Spatial Reasoning from Limited Views☆181Updated 2 months ago
- UR2: Unify RAG and Reasoning through Reinforcement Learning☆126Updated 2 months ago
- [Tutorial] Few-Step Distillation for Text-to-Image Generation: A Practical Guide☆337Updated last month
- Lumina-DiMOO - An Open-Sourced Multi-Modal Large Diffusion Language Model☆934Updated last month
- Official code implementation of Context Cascade Compression: Exploring the Upper Limits of Text Compression☆284Updated last week
- Explain Before You Answer: A Survey on Compositional Visual Reasoning☆306Updated 3 months ago
- [AAAI 2026] GUI-G²: Gaussian Reward Modeling for GUI Grounding☆299Updated this week
- Efficient DiT architecture for text2any tasks, ICLR2025☆447Updated 8 months ago
- OneCAT: Decoder-Only Auto-Regressive Model for Unified Understanding and Generation☆255Updated 4 months ago
- [AAAI 2026 🔥] Official implementation of "NeuralGS: Bridging Neural Fields and 3D Gaussian Splatting for Compact 3D Representation"☆176Updated 5 months ago
- [NeurIPS 2025🔥]Main source code of SRPO framework.☆186Updated 2 months ago
- ☆74Updated 10 months ago
- [EMNLP2025]Official implementation: Agent-style vision question answer in Autonomous Driving!☆136Updated 4 months ago
- Official repository of MMGenBench☆120Updated 11 months ago
- [NeurIPS2024] MVGamba: Unify 3D Content Generation as State Space Sequence Modeling☆65Updated last year