E-GRPO: High Entropy Steps Drive Effective Reinforcement Learning for Flow Models
☆43Jan 5, 2026Updated 6 months ago
Alternatives and similar repositories for VisualGRPO
Users that are interested in VisualGRPO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- VLS: Steering Pretrained Robot Policies via Vision–Language Models☆64Mar 29, 2026Updated 3 months ago
- Official Implementation of wd1☆31Sep 25, 2025Updated 9 months ago
- [CVPR25] CoLLM: A Large Language Model for Composed Image Retrieval☆28Mar 26, 2025Updated last year
- This project is based on the [LTX-Video](https://github.com/Lightricks/LTX-Video) algorithm of the diffusers and optimized and accelerate…☆15Dec 31, 2024Updated last year
- ☆68Aug 16, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A survey for visual generation alignment☆141Nov 9, 2025Updated 7 months ago
- [ACM MM25] LongWriter-V: Enabling Ultra-Long and High-Fidelity Generation in Vision-Language Models☆24Mar 29, 2025Updated last year
- [NeurIPS25] Official Implementation (Pytorch) of "DeepVideo-R1"☆35Feb 22, 2026Updated 4 months ago
- ☆46Jun 16, 2026Updated 2 weeks ago
- Official implementation of LaVin-DiT☆53Jan 27, 2025Updated last year
- ☆60Jul 4, 2025Updated last year
- ABC: Achieving Better Control of Multimodal Embeddings using VLMs [TMLR2025]☆21Aug 21, 2025Updated 10 months ago
- Official implementation of NeurIPS 2025 paper "SceneWeaver: All-in-One 3D Scene Synthesis with an Extensible and Self-Reflective Agent"☆163Nov 13, 2025Updated 7 months ago
- Official implementation for our paper: Rethinking Video Tokenization: A Conditioned Diffusion-based Approach☆17Apr 2, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Aligning Agentic World Models via Knowledgeable Experience Learning☆36May 15, 2026Updated last month
- TPDiff: Temporal Pyramid Video Diffusion Model☆25Mar 13, 2025Updated last year
- The official implementation of Diffusion Distillation With Direct Preference Optimization For Efficient 3D LiDAR Scene Completion [AAAI'2…☆17Feb 2, 2026Updated 5 months ago
- Repo for "Q-Eval-100K: Evaluating Visual Quality and Alignment Level for Text-to-Vision Content"☆73Jun 9, 2025Updated last year
- ☆18May 13, 2025Updated last year
- Official Repo for SvS: A Self-play with Variational Problem Synthesis strategy for RLVR training☆54Dec 13, 2025Updated 6 months ago
- DreamCinema: Cinematic Transfer with Free Camera and 3D Character☆96Jun 13, 2025Updated last year
- ☆98Apr 3, 2026Updated 3 months ago
- PyTorch implementation of "HERO: Human Reaction Generation from Videos (ICCV 2025)"☆37Mar 27, 2026Updated 3 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- [CVPR 2025 Oral] PyTorch re-implementation for Autoregressive Distillation of Diffusion Transformers (ARD).☆144Oct 1, 2025Updated 9 months ago
- Can 3D Vision-Language Models Truly Understand Natural Language?☆20Mar 28, 2024Updated 2 years ago
- Nearest Neighbor Normalization (EMNLP 2024)☆21Nov 1, 2024Updated last year
- [EMNLP 2024] Preserving Multi-Modal Capabilities of Pre-trained VLMs for Improving Vision-Linguistic Compositionality☆22Oct 8, 2024Updated last year
- [CVPR 2024] The official implementation of paper "Sculpting Holistic 3D Representation in Contrastive Language-Image-3D Pre-training"☆35Apr 21, 2024Updated 2 years ago
- ☆31Sep 12, 2025Updated 9 months ago
- [CVPR'25] Attention IoU: Examining Biases in CelebA using Attention Maps☆13Mar 26, 2025Updated last year
- OmniStream: Mastering Perception, Reconstruction and Action in Continuous Streams☆108Mar 15, 2026Updated 3 months ago
- ☆15Sep 17, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- MM-PRM: Enhancing Multimodal Mathematical Reasoning with Scalable Step-Level Supervision☆30May 26, 2025Updated last year
- [ICML 2025] M-STAR (Multimodal Self-Evolving TrAining for Reasoning) Project. Diving into Self-Evolving Training for Multimodal Reasoning☆75Jul 13, 2025Updated 11 months ago
- High-Resolution Visual Reasoning via Multi-Turn Grounding-Based Reinforcement Learning☆55Jul 23, 2025Updated 11 months ago
- SFT+RL boosts multimodal reasoning☆47Jun 27, 2025Updated last year
- LLMs 部署与微调☆10May 18, 2023Updated 3 years ago
- ☆12Apr 22, 2025Updated last year
- ☆30Jun 25, 2021Updated 5 years ago