OpenImagingLab / FlashVSRLinks
Towards Real-Time Diffusion-Based Streaming Video Super-Resolution — An efficient one-step diffusion framework for streaming VSR with locality-constrained sparse attention and a tiny conditional decoder.
☆1,291Updated last month
Alternatives and similar repositories for FlashVSR
Users that are interested in FlashVSR are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2025] Official implementation of "XVerse: Consistent Multi-Subject Control of Identity and Semantic Attributes via DiT Modulatio…☆616Updated 3 months ago
- MoCha: End-to-End Video Character Replacement without Structural Guidance☆620Updated 3 weeks ago
- Official code for StoryMem: Multi-shot Long Video Storytelling with Memory☆638Updated 2 weeks ago
- UniAnimate-DiT: Human Image Animation with Large-Scale Video Diffusion Transformer☆832Updated 9 months ago
- Repo for SeedVR2 (ICLR2026) & SeedVR (CVPR2025 Highlight)☆961Updated last week
- Stand-In is a lightweight, plug-and-play framework for identity-preserving video generation.☆725Updated last month
- Official implementation for "RIFLEx: A Free Lunch for Length Extrapolation in Video Diffusion Transformers" (ICML 2025) and UltraViCo (IC…☆782Updated last week
- SteadyDancer: Harmonized and Coherent Human Image Animation with First-Frame Preservation☆569Updated last month
- ObjectClear: Complete Object Removal via Object-Effect Attention☆532Updated 2 months ago
- [ICLR2026] SeedVR2: One-Step Video Restoration via Diffusion Adversarial Post-Training☆623Updated last week
- [Preprint 2025] Ditto: Scaling Instruction-Based Video Editing with a High-Quality Synthetic Dataset☆564Updated 3 months ago
- [CVPR 2025 Highlight🔥] Identity-Preserving Text-to-Video Generation by Frequency Decomposition☆803Updated 5 months ago
- Offical Implementation of SCAIL: Towards Studio-Grade Character Animation via In-Context Learning of 3D-Consistent Pose Representations☆806Updated last month
- HuMo: Human-Centric Video Generation via Collaborative Multi-Modal Conditioning☆1,127Updated last week
- FantasyPortrait: Enhancing Multi-Character Portrait Animation with Expression-Augmented Diffusion Transformers☆498Updated 5 months ago
- ☆714Updated 2 months ago
- DreamID-V: Bridging the Image-to-Video Gap for High-Fidelity Face Swapping via Diffusion Transformer☆497Updated 3 weeks ago
- [ICCV 2025] Region-Aware Text-to-Image Generation via Hard Binding and Soft Refinement 🔥☆620Updated last month
- [NeurIPS'25] One-Step Diffusion for Detail-Rich and Temporally Consistent Video Super-Resolution☆335Updated 2 months ago
- The official repository of paper "Stream-DiffVSR: Low-Latency Streamable Video Super-Resolution via Auto-Regressive Diffusion"☆278Updated 3 weeks ago
- Pusa: Thousands Timesteps Video Diffusion Model☆672Updated 5 months ago
- Official Implementations for Paper - HoloCine: Holistic Generation of Cinematic Multi-Shot Long Video Narratives☆627Updated 2 months ago
- ☆546Updated 2 months ago
- Directly Aligning the Full Diffusion Trajectory with Fine-Grained Human Preference☆1,247Updated 3 months ago
- [NeurIPS 2024] Generalizable Implicit Motion Modeling for Video Frame Interpolation☆378Updated 8 months ago
- Implementation of "FLUX-Text: A Simple and Advanced Diffusion Transformer Baseline for Scene Text Editing"☆433Updated 2 months ago
- 🔥 [ICCV 2025 Highlight] Official ComfyUI native node supporting InfiniteYou with FLUX☆280Updated 6 months ago
- ☆1,782Updated 6 months ago
- ☆1,046Updated 8 months ago
- [ICLR 2026] ChronoEdit: Towards Temporal Reasoning for Image Editing and World Simulation☆667Updated 2 months ago