yunlong10 / Video-R4Links
Reinforcing Text-Rich Video Reasoning with Visual Rumination
☆27Updated last month
Alternatives and similar repositories for Video-R4
Users that are interested in Video-R4 are comparing it to the libraries listed below
Sorting:
- LVAS-Agent Code Base☆21Updated 8 months ago
- VideoCoF: Unified Video Editing with Temporal Reasoner☆122Updated last week
- This project is the official implementation of 'DreamOmni3: Scribble-based Editing and Generation''☆24Updated last week
- Official Repository of paper: "MotionEdit: Benchmarking and Learning Motion-Centric Image Editing"☆51Updated this week
- An official implementation of SwapAnyone.☆72Updated 9 months ago
- Video-as-Answer: Predict and Generate Next Video Event with Joint-GRPO☆83Updated last month
- ☆33Updated 2 months ago
- VFXMaster: Unlocking Dynamic Visual Effect Generation via In-Context Learning☆60Updated 2 months ago
- PICABench: How Far Are We from Physically Realistic Image Editing?☆33Updated 2 months ago
- ☆132Updated 6 months ago
- [CVPR 2025] Science-T2I: Addressing Scientific Illusions in Image Synthesis☆62Updated 8 months ago
- Official implementation of the paper "Bind-Your-Avatar: Multi-Talking-Character Video Generation with Dynamic 3D-mask-based Embedding Rou…☆29Updated 3 months ago
- RePlan: Reasoning-Guided Region Planning for Complex Instruction-Based Image Editing☆52Updated 2 weeks ago
- Frame Guidance: Training-Free Guidance for Frame-Level Control in Video Diffusion Model (Arxiv 2025)☆38Updated 6 months ago
- [ICML 2025] VistaDPO: Video Hierarchical Spatial-Temporal Direct Preference Optimization for Large Video Models☆38Updated 6 months ago
- [NeurIPS 2024] Official PyTorch Implementation of "FlowTurbo: Towards Real-time Flow-Based Image Generation with Velocity Refiner"☆70Updated 2 months ago
- ☆23Updated last year
- Official code of "UniVid: Unifying Vision Tasks with Pre-trained Video Generation Models" WACV2026☆36Updated last month
- ☆91Updated 4 months ago
- Official PyTorch Implementation of "SVG-T2I: Scaling up Text-to-Image Latent Diffusion Model Without Variational Autoencoder".☆118Updated 3 weeks ago
- ☆35Updated 3 weeks ago
- The official UniVerse-1 code.☆116Updated 2 months ago
- A unified framework for controllable caption generation across images, videos, and audio. Supports multi-modal inputs and customizable ca…☆52Updated 5 months ago
- CODA: Coordinating the Cerebrum and Cerebellum for a Dual-Brain Computer Use Agent with Decoupled Reinforcement Learning☆32Updated 4 months ago
- Make self forcing endless. Add cache purging. Add prompt controllability.☆68Updated 4 months ago
- [Arxiv 2025] In-Video Instructions: Visual Signals as Generative Control☆46Updated last month
- Test-time Scaling for VAR models☆29Updated 3 months ago
- Code for "VideoRepair: Improving Text-to-Video Generation via Misalignment Evaluation and Localized Refinement"☆51Updated last year
- OpenVE-3M: A Large-Scale High-Quality Dataset for Instruction-Guided Video Editing☆30Updated 2 weeks ago
- The code implementation for the paper "DreamLifting: A Plug-in Module Lifting MV Diffusion Models for 3D Asset Generation".☆29Updated 4 months ago