yunlong10/Video-R4

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/yunlong10/Video-R4)

yunlong10 / Video-R4

Reinforcing Text-Rich Video Reasoning with Visual Rumination

☆28

Alternatives and similar repositories for Video-R4

Users that are interested in Video-R4 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

yunlong10 / VidComposition
View on GitHub
[CVPR 2025] VidComposition: Can MLLMs Analyze Compositions in Compiled Videos?
☆30May 10, 2025Updated last year
jing-bi / awesome-M.LLM-reasoning
View on GitHub
☆20May 11, 2025Updated last year
yunlong10 / AVicuna
View on GitHub
[AAAI 2025] Empowering LLMs with Pseudo-Untrimmed Videos for Audio-Visual Temporal Understanding
☆34Mar 21, 2025Updated last year
yunlong10 / CAT-V
View on GitHub
[AAAI 26 Demo] Offical repo for CAT-V - Caption Anything in Video: Object-centric Dense Video Captioning with Spatiotemporal Multimodal P…
☆67Jan 27, 2026Updated 5 months ago
yunlong10 / Awesome-Video-LMM-Post-Training
View on GitHub
🔥🔥🔥 Latest Papers, Codes and Datasets on Video-LMM Post-Training
☆296Mar 3, 2026Updated 4 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
WikiChao / DAVIS
View on GitHub
[🏆 IJCV 2025 & ACCV 2024 Best Paper Honorable Mention] Official pytorch implementation of the paper "High-Quality Visually-Guided Sound …
☆33Mar 30, 2026Updated 3 months ago
liangsusan-git / AV-NeRF
View on GitHub
[NeurIPS 2023] AV-NeRF: Learning Neural Fields for Real-World Audio-Visual Scene Synthesis
☆36Feb 15, 2024Updated 2 years ago
hanghuacs / MMComposition
View on GitHub
☆17Jun 20, 2025Updated last year
WikiChao / FreSca
View on GitHub
[CVPR 2025 GMCV] Test-Time Frequency Scaling: Instant Frequency Control for Any Diffusion Model
☆55May 31, 2025Updated last year
yunlong10 / Awesome-AI4Animation
View on GitHub
[ICCVW 2025] This repository includes latest papers, projects and datasets on GenAI for Cel-Animation. Accepted by ICCV 2025 AISTORY Wor…
☆206Jan 13, 2026Updated 6 months ago
hanghuacs / FineCaption
View on GitHub
☆39Jun 20, 2025Updated last year
WikiChao / ZeroSep
View on GitHub
[NeurIPS 2025] Separate Anything in Audio with Zero Training
☆60Nov 3, 2025Updated 8 months ago
WikiChao / ScalingConcept
View on GitHub
☆24Nov 1, 2024Updated last year
WikiChao / VisAH
View on GitHub
[CVPR 2025] Pytorch implementation of the paper "Learning to Highlight Audio by Watching Movies"
☆15Oct 1, 2025Updated 9 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Songluchuan / Tri2plane
View on GitHub
[ECCV 2024] The repository for 'Tri$^{2}$-plane: Volumetric Avatar Reconstruction with Feature Pyramid'
☆141May 4, 2025Updated last year
yeates / Aurora
View on GitHub
Aurora: Unified Video Editing with a Tool-Using Agent
☆58Jun 16, 2026Updated last month
ZhangAIPI / YOPO_MLLM_Pruning
View on GitHub
Pruning the VLLMs
☆106Dec 9, 2024Updated last year
McGill-NLP / AURORA
View on GitHub
Code and data for the paper: Learning Action and Reasoning-Centric Image Editing from Videos and Simulation
☆35Jun 30, 2025Updated last year
ZehuaKcrissLi / GTR-Voice
View on GitHub
☆16Nov 11, 2024Updated last year
VainF / In-Video-Instructions
View on GitHub
[Arxiv 2025] In-Video Instructions: Visual Signals as Generative Control
☆45Nov 25, 2025Updated 7 months ago
zihuixue / ProgCaptioner
View on GitHub
Code release for the paper "Progress-Aware Video Frame Captioning" (CVPR 2025)
☆26Jul 16, 2025Updated last year
RomainCroze / Statistical-Image-Completion
View on GitHub
C/C++ -- Patchmatch/Graphcut
☆14Jan 3, 2014Updated 12 years ago
SaraGhazanfari / CoF
View on GitHub
Chain-of-Frames [CVPR 2026]
☆40Jul 2, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
zhousheng97 / EgoTextVQA
View on GitHub
[CVPR'25] 🌟🌟 EgoTextVQA: Towards Egocentric Scene-Text Aware Video Question Answering
☆52Jun 19, 2025Updated last year
CUC-MIPG / IC-Effect
View on GitHub
Official repo for paper "IC-Effect: Precise and Efficient Video Effects Editing via In-Context Learning"
☆43Jan 29, 2026Updated 5 months ago
Foruck / Kinematic-Phrases
View on GitHub
☆15Jun 2, 2025Updated last year
kai422 / SCALE
View on GitHub
[ICLR 2024] Scaling for Training Time and Post-hoc Out-of-distribution Detection Enhancement.
☆15Mar 12, 2024Updated 2 years ago
YeolJ00 / Vector-Prism
View on GitHub
Official repository of "Vector Prism: Animating Vector Graphics by Stratifying Semantic Structure" CVPR 2026 Highlight
☆26Dec 17, 2025Updated 7 months ago
oooolga / Ctrl-V
View on GitHub
👆Pytorch implementation of "Ctrl-V: Higher Fidelity Video Generation with Bounding-Box Controlled Object Motion"
☆35Jul 28, 2025Updated 11 months ago
ysy31415 / EffectMaker
View on GitHub
Code repo for EffectMaker: Unifying Reasoning and Generation for Customized Visual Effect Creation
☆42Mar 6, 2026Updated 4 months ago
doc-doc / EgoBlind
View on GitHub
EgoBlind: Towards Egocentric Visual Assistance for the Blind (NeurIPS'25, D&B Track)
☆23Apr 20, 2026Updated 3 months ago
Songluchuan / StreamMEcode
View on GitHub
[Siggraph 2025] The code for "StreamME: Simplify 3D Gaussian Avatar within Live Stream"
☆24Jul 22, 2025Updated 11 months ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
guozinan126 / MUSAR
View on GitHub
☆30May 7, 2025Updated last year
heliossun / LaCoT
View on GitHub
[NeurIPS 2025] Official code for paper: Latent Chain-of-Thought for Visual Reasoning
☆36Oct 16, 2025Updated 9 months ago
WeChatCV / NovaEdit
View on GitHub
[CVPR26] Nova: Video Editing via single/multiple frame references
☆49Mar 4, 2026Updated 4 months ago
Sanyuan-Chen / CSS_with_EETransformer
View on GitHub
Code for the ICASSP-2021 paper: Don't shoot butterfly with rifles: Multi-channel Continuous Speech Separation with Early Exit Transformer
☆12Sep 2, 2021Updated 4 years ago
LaVi-Lab / Rethink_CoT_Video
View on GitHub
Official code for "Rethinking Chain-of-Thought Reasoning for Videos"
☆21Dec 14, 2025Updated 7 months ago
HumanMLLM / IRG-MotionLLM
View on GitHub
(ECCV2026) Official repository of paper "IRG-MotionLLM: Interleaving Motion Generation, Assessment and Refinement for Text-to-Motion Gene…
☆30Jul 1, 2026Updated 2 weeks ago
zjr2000 / LLMVA-GEBC
View on GitHub
Winner solution to Generic Event Boundary Captioning task in LOVEU Challenge (CVPR 2023 workshop)
☆29Jan 1, 2024Updated 2 years ago