StableWorld: Towards Stable and Consistent Long Interactive Video Generation
☆86Mar 18, 2026Updated this week
Alternatives and similar repositories for StableWorld
Users that are interested in StableWorld are comparing it to the libraries listed below
Sorting:
- ☆33Nov 26, 2025Updated 3 months ago
- Official pytorch implementation of "Tool-R1: Sample-Efficient Reinforcement Learning for Agentic Tool Use"☆20Sep 16, 2025Updated 6 months ago
- Implementation of paper "SceneMaker: Open-set 3D Scene Generation with Decoupled De-occlusion and Pose Estimation Model"☆93Jan 1, 2026Updated 2 months ago
- [ECCV 2024] Dual-Camera Smoooth Zoom on Mobile Phones☆75Nov 18, 2024Updated last year
- [ICLR 2026] Light-X: Generative 4D Video Rendering with Camera and Illumination Control☆173Dec 11, 2025Updated 3 months ago
- ☆30Apr 24, 2025Updated 10 months ago
- Official repository for "Vid2World: Crafting Video Diffusion Models to Interactive World Models" (ICLR 2026), https://arxiv.org/abs/2505.…☆44Jan 27, 2026Updated last month
- Official implementation of "MV-TAP: Tracking Any Point in Multi-View Videos"☆39Mar 10, 2026Updated last week
- Official implementation of "Repurposing Video Diffusion Transformers for Robust Point Tracking"☆41Dec 24, 2025Updated 2 months ago
- A Survey on Leveraging Pre-trained Generative Adversarial Networks for Image Editing and Restoration☆17Jul 22, 2022Updated 3 years ago
- [ICCV 2025] FiVE-Bench: A Fine-grained Video Editing Benchmark for Evaluating Emerging Diffusion and Rectified Flow Models☆33Dec 27, 2025Updated 2 months ago
- ☆19Jul 7, 2023Updated 2 years ago
- [ICCV 2025] Boosting Multi-View Indoor 3D Object Detection via Adaptive 3D Volume Construction☆26Oct 1, 2025Updated 5 months ago
- ☆30Dec 12, 2024Updated last year
- [CVPR 2026] FluxMem: Adaptive Hierarchical Memory for Streaming Video Understanding☆43Updated this week
- ☆37Mar 21, 2025Updated last year
- Cut2Next: Generating Next Shot via In-Context Tuning☆31Aug 21, 2025Updated 7 months ago
- This is the official PyTorch implementation of TBSR. Our team received 2nd place (real data track) and 3rd place (synthetic track) in NTI…☆14Jun 11, 2022Updated 3 years ago
- [ICCV2025] DCM: Dual-Expert Consistency Model for Efficient and High-Quality Video Generation☆204Jun 8, 2025Updated 9 months ago
- [TPAMI 2024] Self-Supervised Learning for Real-World Super-Resolution from Dual and Multiple Zoomed Observations☆41May 9, 2024Updated last year
- [ICLR 2026] Lyra: Generative 3D Scene Reconstruction via Video Diffusion Model Self-Distillation☆794Oct 2, 2025Updated 5 months ago
- [NeurIPS 2025] PhysCtrl: Generative Physics for Controllable and Physics-Grounded Video Generation☆121Feb 22, 2026Updated last month
- OpenVE-3M: A Large-Scale High-Quality Dataset for Instruction-Guided Video Editing☆42Jan 9, 2026Updated 2 months ago
- [ICLR 2026] Implementation of the paper "Learning Unified Representation of 3D Gaussian Splatting". Rethinking 3DGS representation in neu…☆42Feb 11, 2026Updated last month
- [CVPR 2022] Semantic-shape Adaptive Feature Modulation for Semantic Image Synthesis☆36Oct 31, 2022Updated 3 years ago
- The official implementation of "Compositional Generative Model of Unbounded 4D Cities". (TPAMI 2026)☆135Dec 6, 2025Updated 3 months ago
- [IEEE TMM 2024] NIR-Assisted Image Denoising: A Selective Fusion Approach and A Real-World Benchmark Dataset☆21Feb 23, 2025Updated last year
- One-shot 3D Object Canonicalization based on Geometric and Semantic Consistency(CVPR highlight 2025)☆73Dec 15, 2025Updated 3 months ago
- [SIGGRAPH Asia 2025] "ASIA: Adaptive 3D Segmentation using Few Image Annotations ".☆25Feb 14, 2026Updated last month
- Consistent Autoregressive Video Generation with Long Context☆75Feb 6, 2026Updated last month
- [NeurIPS 2025 (Spotlight)] The implementation for the paper "4DGT Learning a 4D Gaussian Transformer Using Real-World Monocular Videos"☆413Sep 19, 2025Updated 6 months ago
- Enhanced Generative Structure Prior for Text Image Super-Resolution [TPAMI]☆68Aug 20, 2025Updated 7 months ago
- ☆41Dec 15, 2023Updated 2 years ago
- [CVPR 2024 Highlight] PLACE: Adaptive Layout-Semantic Fusion for Semantic Image Synthesis☆46Mar 5, 2024Updated 2 years ago
- [IJCV 2025] OmniDrag: Enabling Motion Control for Omnidirectional Image-to-Video Generation☆15Feb 13, 2026Updated last month
- [ICCV25] TACA: Rethinking Cross-Modal Interaction in Multimodal Diffusion Transformers☆41Jul 23, 2025Updated 7 months ago
- [Preprint] Efficient Generative Model Training via Embedded Representation Warmup☆36Oct 15, 2025Updated 5 months ago
- [PRCV 2023] RBSR: Efficient and Flexible Recurrent Network for Burst Super-Resolution☆49Oct 16, 2024Updated last year
- Official repo for: Epipolar Geometry Improves Video Generation Models☆81Oct 28, 2025Updated 4 months ago