High-Resolution Visual Reasoning via Multi-Turn Grounding-Based Reinforcement Learning
☆53Jul 23, 2025Updated 7 months ago
Alternatives and similar repositories for MGPO
Users that are interested in MGPO are comparing it to the libraries listed below
Sorting:
- ☆57Updated this week
- FormulaOne: A dataset of algorithmic problems based on MSO formulas.☆24Aug 14, 2025Updated 6 months ago
- ☆18Aug 1, 2025Updated 7 months ago
- a ComfyUI plugin that provides a user interface of AudioMass, full-featured web-based audio & waveform editing tool☆27Feb 6, 2026Updated 3 weeks ago
- A framework that allows you to apply Sparse AutoEncoder on any models☆51Jul 11, 2025Updated 7 months ago
- VisPlay: Self-Evolving Vision-Language Models☆44Feb 12, 2026Updated 2 weeks ago
- [NeurIPS 2025 Spotlight] Fast-Slow Thinking GRPO for Large Vision-Language Model Reasoning☆49Jan 20, 2026Updated last month
- [ICML 2025] Streamline Without Sacrifice - Squeeze out Computation Redundancy in LMM☆20May 22, 2025Updated 9 months ago
- Syphus: Automatic Instruction-Response Generation Pipeline☆14Dec 14, 2023Updated 2 years ago
- Descript Audio Codec - VAE Variant (.dac-vae): High-Fidelity Audio Compression with Variational Autoencoder☆31Aug 30, 2025Updated 6 months ago
- Egocentric Video Understanding Dataset (EVUD)☆33Jul 4, 2024Updated last year
- ☆14May 31, 2022Updated 3 years ago
- Code release for BiGS: Bidirectional Primitives for Relightable 3D Gaussian Splatting☆27Mar 16, 2025Updated 11 months ago
- Official implementation for Design2GarmentCode: Turning Design Concepts to Tangible Garments Through Program Synthesis (CVPR 2025)☆35Nov 19, 2025Updated 3 months ago
- Scaffold Prompting to promote LMMs☆46Dec 16, 2024Updated last year
- Toolbox for GTA-Human Datasets☆25Oct 9, 2024Updated last year
- ☆33Jul 15, 2025Updated 7 months ago
- Official implementation of CEED-VLA: Consistency Vision-Language-Action Model with Early-Exit Decoding.☆48Sep 15, 2025Updated 5 months ago
- E-GRPO: High Entropy Steps Drive Effective Reinforcement Learning for Flow Models☆39Jan 5, 2026Updated last month
- CoRL25-"AimBot: A Simple Auxiliary Visual Cue to Enhance Spatial Awareness of Visuomotor Policies"☆43Aug 15, 2025Updated 6 months ago
- The official repo for “Unleashing the Reasoning Potential of Pre-trained LLMs by Critique Fine-Tuning on One Problem” [EMNLP25]☆34Sep 1, 2025Updated 6 months ago
- The official implementation of "Neural Point-based Volumetric Avatar: Surface-guided Neural Points for Efficient and Photorealistic Volum…☆23Mar 27, 2024Updated last year
- Memory Efficient Training Framework for Large Video Generation Model☆25Apr 22, 2024Updated last year
- [CoRL 2025] UniSkill: Imitating Human Videos via Cross-Embodiment Skill Representations☆76Dec 18, 2025Updated 2 months ago
- [ICCV 2025] Boosting MLLM Reasoning with Text-Debiased Hint-GRPO☆46Jul 1, 2025Updated 8 months ago
- ☆34Updated this week
- Official repository for code and information related to the HumanOLAT dataset (ICCV 2025).☆38Nov 17, 2025Updated 3 months ago
- Code for "Learning Generalizable Robotic Reward Functions from "In-The-Wild" Human Videos"☆28Oct 25, 2021Updated 4 years ago
- the official implementation of the paper: Neural Parameterization for Dynamic Human Head Editing☆31Feb 14, 2024Updated 2 years ago
- Scaling Spatial Intelligence with Multimodal Foundation Models☆177Feb 6, 2026Updated 3 weeks ago
- Official respository for ReasonGen-R1☆74Jun 23, 2025Updated 8 months ago
- ☆55Feb 2, 2026Updated last month
- The official repository of paper named 'A Refer-and-Ground Multimodal Large Language Model for Biomedicine'☆34Nov 5, 2024Updated last year
- [AAAI2025] GraphAvatar: Compact Head Avatars with GNN-Generated 3D Gaussians☆37Apr 2, 2025Updated 11 months ago
- A collection of existing public 3D Cloth Data☆35Jul 5, 2022Updated 3 years ago
- [CVPR 2025] SAT-HMR: Real-Time Multi-Person 3D Mesh Estimation via Scale-Adaptive Tokens☆106Jan 26, 2026Updated last month
- ☆40Mar 3, 2024Updated last year
- Unlocking Iterative Reasoning for Any Image Editor☆89Jan 18, 2026Updated last month
- Official code for ECCV 2024 paper: Learn to Optimize Denoising Scores A Unified and Improved Diffusion Prior for 3D Generation☆72Jul 11, 2024Updated last year