A real-time streaming conversational video system that transforms text interactions into continuous, high-fidelity video responses using autoregressive diffusion.
β302Dec 15, 2025Updated 2 months ago
Alternatives and similar repositories for RealVideo
Users that are interested in RealVideo are comparing it to the libraries listed below
Sorting:
- RePlan: Reasoning-Guided Region Planning for Complex Instruction-Based Image Editingβ58Dec 26, 2025Updated 2 months ago
- [CVPR 2026] π Dataset and Benchmark code for EgoEditβ106Feb 21, 2026Updated last week
- β21Dec 12, 2025Updated 2 months ago
- β64Dec 16, 2025Updated 2 months ago
- Animate Any Character in Any Worldβ90Jan 9, 2026Updated last month
- β86Feb 4, 2026Updated 3 weeks ago
- Official implementation of "VideoMaMa: Mask-Guided Video Matting via Generative Prior", CVPR 2026β279Feb 7, 2026Updated 3 weeks ago
- A Unified Visual Generator with Interleaved OmniModal Contextβ185Feb 10, 2026Updated 2 weeks ago
- β37Oct 29, 2025Updated 4 months ago
- Reinforcing Text-Rich Video Reasoning with Visual Ruminationβ27Nov 24, 2025Updated 3 months ago
- Pusa: Thousands Timesteps Video Diffusion Modelβ671Feb 13, 2026Updated 2 weeks ago
- [NeurIPS'25 Spotlight] Official implementation of "JavisGPT: A Unified Multi-modal LLM for Sounding-Video Comprehension and Generation"β69Updated this week
- Code2Worlds: Empowering Coding LLMs for 4D World Generationβ65Feb 16, 2026Updated last week
- β109Sep 3, 2025Updated 5 months ago
- [CVPR 2026] OmniTransfer: All-in-one Framework for Spatio-temporal Video Transferβ219Feb 21, 2026Updated last week
- Overworld's local world client interface to run Waypoint world modelsβ46Updated this week
- [ICLR 2026] Light-X: Generative 4D Video Rendering with Camera and Illumination Controlβ167Dec 11, 2025Updated 2 months ago
- Official repo for paper "IC-Effect: Precise and Efficient Video Effects Editing via In-Context Learning"β39Jan 29, 2026Updated last month
- β187Dec 10, 2025Updated 2 months ago
- [CVPR2026]We present FlashPortrait, an end-to-end video diffusion transformer capable of synthesizing ID-preserving, infinite-length videβ¦β446Feb 21, 2026Updated last week
- Official Implementation of "MemFlow: Flowing Adaptive Memory for Consistent and Efficient Long Video Narratives"β184Dec 29, 2025Updated 2 months ago
- Dungeon procedural generator similar to whatabou's "One Page Dungeon"β48Jan 4, 2026Updated last month
- PISCO: Precise Video Instance Insertion with Sparse Controlβ46Feb 13, 2026Updated 2 weeks ago
- [CVPR 2026] SpaceTimePilot: Generative Rendering of Dynamic Scenes Across Space and Timeβ97Jan 1, 2026Updated last month
- Official pytorch implementation for SingleInsertβ28Apr 19, 2024Updated last year
- A tool for running and customizing real-time, interactive generative AI pipelines and modelsβ238Updated this week
- β257Jan 2, 2026Updated last month
- ICLR 2025 paper X-NeMo & Project X-Portrati2β115Aug 7, 2025Updated 6 months ago
- FIBO is a SOTA, first open-source, JSON-native text-to-image model built for controllable, predictable, and legally safe image generationβ¦β302Jan 7, 2026Updated last month
- HunyuanCustom: A Multimodal-Driven Architecture for Customized Video Generationβ1,205Oct 15, 2025Updated 4 months ago
- Code for paper "CLiFT: Compressive Light-Field Tokens for Compute Efficient and Adaptive Neural Rendering" [NeurIPS 2025 (spotlight)]β75Aug 2, 2025Updated 6 months ago
- OmniInsert: Mask-Free Video Insertion of Any Reference via Diffusion Transformer Modelsβ154Sep 24, 2025Updated 5 months ago
- SpotEdit:Selective Region Editing in Diffusion Transformersβ170Jan 5, 2026Updated last month
- Official Pytorch Implementation of Paper - Stable-Pose: Leveraging Transformers for Pose-Guided Text-to-Image Generation - NeurIPS 2024β113Dec 23, 2024Updated last year
- Official PyTorch Implementation of "SVG-T2I: Scaling up Text-to-Image Latent Diffusion Model Without Variational Autoencoder".β133Dec 18, 2025Updated 2 months ago
- CogVideoX-LoRAs is a centralized repository for all LoRA models created for CogVideoX, filling the gap for a unified sharing space. With β¦β80Dec 4, 2024Updated last year
- Minimalist stable-diffusion desktop application with only one executable file writen with golang ( No python ).β18Apr 16, 2025Updated 10 months ago
- Official implementation of "VSTAR: Generative Temporal Nursing for Longer Dynamic Video Synthesis"β20Jan 26, 2025Updated last year
- [NeurIPS 2025] Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidanceβ577Jan 5, 2026Updated last month