Reinforcing Text-Rich Video Reasoning with Visual Rumination
☆27Nov 24, 2025Updated 3 months ago
Alternatives and similar repositories for Video-R4
Users that are interested in Video-R4 are comparing it to the libraries listed below
Sorting:
- [CVPR 2025] VidComposition: Can MLLMs Analyze Compositions in Compiled Videos?☆29May 10, 2025Updated 9 months ago
- [NeurIPS 2025] Encoder-Decoder Diffusion Language Models for Efficient Training and Inference☆36Oct 29, 2025Updated 4 months ago
- ☆37Oct 29, 2025Updated 4 months ago
- [CVPR 2026] Official pytorch implementation of "ReDirector: Creating Any-Length Video Retakes with Rotary Camera Encoding"☆17Dec 17, 2025Updated 2 months ago
- ☆17Jun 20, 2025Updated 8 months ago
- ☆35Dec 16, 2025Updated 2 months ago
- A node for ComfyUI that adjusts a latent image before the VAE decoding step in order to improve your image quality.☆35Dec 30, 2025Updated last month
- ☆20May 11, 2025Updated 9 months ago
- ☆26Jan 4, 2025Updated last year
- [AAAI 2025] Empowering LLMs with Pseudo-Untrimmed Videos for Audio-Visual Temporal Understanding☆34Mar 21, 2025Updated 11 months ago
- RePlan: Reasoning-Guided Region Planning for Complex Instruction-Based Image Editing☆58Dec 26, 2025Updated 2 months ago
- ☆21Dec 14, 2025Updated 2 months ago
- Multi-step AI agents powered by Gemini 2.0 and the LangGraph framework. These agents orchestrate complex workflows and enhance their reas…☆10Dec 19, 2024Updated last year
- Martingale posterior neural networks for fast sequential decision making @ Neurips 2025☆23Nov 13, 2025Updated 3 months ago
- Software to enable data-rich collaboration from high-resolution display walls to your laptop☆16Feb 19, 2026Updated last week
- AI-native knowledge kernel for human/agent collaboration. Use it as a Knowledge Base, Wiki, Annotator, Research Tool, or Agentic Memory.☆29Updated this week
- A free and open-source focus stacking software that supports multi-focus image alignment and fusion.☆19Feb 5, 2026Updated 3 weeks ago
- Animate Any Character in Any World☆90Jan 9, 2026Updated last month
- ☆92Dec 30, 2025Updated 2 months ago
- ☆43Dec 1, 2025Updated 2 months ago
- ☆31Feb 3, 2026Updated 3 weeks ago
- ComfyUI custom node implementation of VideoMaMa for video matting with mask conditioning.☆34Feb 9, 2026Updated 2 weeks ago
- Fast, free, easy, and object-agnostic video anonymization☆11Dec 12, 2020Updated 5 years ago
- PainterVRAM lets you reserve a slice of GPU memory before ComfyUI starts processing, preventing out-of-memory crashes. Switch between man…☆27Jan 2, 2026Updated last month
- Video-as-Answer: Predict and Generate Next Video Event with Joint-GRPO☆92Dec 1, 2025Updated 2 months ago
- The official implementation of COOPER: A Unified Model for Cooperative Perception and Reasoning in Spatial Intelligence.☆28Dec 30, 2025Updated 2 months ago
- Auction Theory Toolbox – Computer Verified Auctions☆14Jul 12, 2016Updated 9 years ago
- Benchmark evaluating ocean forecasting systems against reference datasets and observations.☆24Feb 20, 2026Updated last week
- Are Video Models Ready as Zero-shot Reasoners?☆84Nov 24, 2025Updated 3 months ago
- ☆24Dec 19, 2025Updated 2 months ago
- [ISBI 2024] Official PyTorch implementation of Towards Cross-Domain Single Blood Cell Image Classification via Large-Scale LoRA-based Seg…☆11Aug 12, 2024Updated last year
- This is the official repository for the paper "Modeling Human Gaze Behavior with Diffusion Models for Unified Scanpath Prediction". ICCV …☆24Dec 4, 2025Updated 2 months ago
- ☆13Oct 21, 2024Updated last year
- MCP server for Grok AI API integration☆19Jun 2, 2025Updated 8 months ago
- Code of Rags2riches☆20May 26, 2025Updated 9 months ago
- ☆12Nov 5, 2024Updated last year
- Get aid from local LLMs right in your PowerShell☆15May 2, 2025Updated 9 months ago
- ☆19Dec 1, 2025Updated 2 months ago
- [ICLR 2026] [NeurIPS 2025] ViPRA: Video Prediction for Robot Actions☆25Jan 27, 2026Updated last month