[CVPR 2026] Video-as-Answer: Predict and Generate Next Video Event with Joint-GRPO
☆113Feb 28, 2026Updated 2 months ago
Alternatives and similar repositories for VANS
Users that are interested in VANS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆27Apr 28, 2025Updated last year
- Video models as pure visual reasoners for high-quality text-to-image generation via Chain-of-Frame reasoning.☆37Jan 16, 2026Updated 3 months ago
- Official repository of PhysMaster: Mastering Physical Representation for Video Generation via Reinforcement Learning☆58Oct 16, 2025Updated 6 months ago
- [CVPR 2026] FaceCam: Portrait Video Camera Control via Scale-Aware Conditioning☆52Mar 26, 2026Updated last month
- ☆26Jun 20, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆15Mar 30, 2025Updated last year
- Official Implementation of "MemFlow: Flowing Adaptive Memory for Consistent and Efficient Long Video Narratives"☆201Dec 29, 2025Updated 4 months ago
- Audio-video joint generation☆57Nov 27, 2025Updated 5 months ago
- PICABench: How Far Are We from Physically Realistic Image Editing?☆36Nov 5, 2025Updated 6 months ago
- [NeurIPS 2025] Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance☆619Jan 5, 2026Updated 4 months ago
- official repo for `thinking with images through-self-calling`☆26Dec 28, 2025Updated 4 months ago
- The official implementation of StereoPilot☆112Dec 19, 2025Updated 4 months ago
- StableWorld: Towards Stable and Consistent Long Interactive Video Generation☆96Mar 18, 2026Updated last month
- Consistent Autoregressive Video Generation with Long Context☆81Feb 6, 2026Updated 3 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- [CVPR 2026] 👋 Dataset and Benchmark code for EgoEdit☆143Apr 5, 2026Updated last month
- 📝The official repository of "Rethinking Cross-Generator Image Forgery Detection through DINOv3"☆22Dec 2, 2025Updated 5 months ago
- [CVPR 2024] Official PyTorch implementation of FreeCustom: Tuning-Free Customized Image Generation for Multi-Concept Composition☆176Sep 1, 2025Updated 8 months ago
- Video-Holmes: Can MLLM Think Like Holmes for Complex Video Reasoning?☆92Jul 13, 2025Updated 9 months ago
- ☆147Feb 28, 2026Updated 2 months ago
- [CVPR 2026] FluxMem: Adaptive Hierarchical Memory for Streaming Video Understanding☆59Mar 16, 2026Updated last month
- [WACV2025] Linking Omni-Depth with View Synthesis through Multi-Sphere Image aided Generalizable Neural Radiance Field☆14Nov 3, 2024Updated last year
- [CVPR25] IAR☆17Jun 13, 2025Updated 10 months ago
- Not All Steps are Created Equal: Selective Diffusion Distillation for Image Manipulation (ICCV 2023)☆65Sep 28, 2023Updated 2 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Official repo for paper "IC-Effect: Precise and Efficient Video Effects Editing via In-Context Learning"☆43Jan 29, 2026Updated 3 months ago
- The official implementation of "PixelThink: Towards Efficient Chain-of-Pixel Reasoning" (arXiv 2025)☆41May 30, 2025Updated 11 months ago
- UniVid: The Open-Source Unified Video Model☆32Oct 13, 2025Updated 6 months ago
- OpenVE-3M: A Large-Scale High-Quality Dataset for Instruction-Guided Video Editing☆46Apr 15, 2026Updated 3 weeks ago
- Official Repo for paper: Scaling Behavior Cloning Improves Causal Reasoning: An Open Model for Real-Time Video Game Playing☆143Feb 6, 2026Updated 3 months ago
- ☆70Apr 21, 2026Updated 2 weeks ago
- ICML2025☆64Aug 28, 2025Updated 8 months ago
- Code for "Skill-based Chain-of-Thoughts for Domain-Adaptive Video Reasoning [EMNLP 2025 Findings]"☆18Aug 27, 2025Updated 8 months ago
- Video Generation, Physical Commonsense, Semantic Adherence, VideoCon-Physics☆192Jan 30, 2026Updated 3 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆51Jun 4, 2025Updated 11 months ago
- Official Repo for the Paper Collaborative Video Diffusion: Consistent Multi-video Generation with Camera Control☆43Dec 30, 2024Updated last year
- Navigate dreamscapes with a click – your chosen point guides the drone’s flight in a thrilling visual journey.☆48Sep 2, 2025Updated 8 months ago
- VCapsBench: A Large-scale Fine-grained Benchmark for Video Caption Quality Evaluation☆19Jun 2, 2025Updated 11 months ago
- [ICCV2025] VEGGIE: Instructional Editing and Reasoning Video Concepts with Grounded Generation☆33Aug 18, 2025Updated 8 months ago
- ☆34Dec 29, 2025Updated 4 months ago
- Official repository of the paper InstructBrush: Learning Attention-based Instruction Optimization for Image Editing☆16Apr 14, 2024Updated 2 years ago