Long-range camera-conditioned scene generation from one single image.
☆107Dec 23, 2025Updated 3 months ago
Alternatives and similar repositories for WorldWarp
Users that are interested in WorldWarp are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Pytorch implementation of: "Continual Semantic Segmentation via Structure Preserving and Projected Feature Alignment", ECCV22☆11Jul 22, 2022Updated 3 years ago
- [Technical Report] A Comprehensive Evaluation of Nano Banana Pro on 14 Low-Level Vision Tasks and 40 Datasets☆72Dec 24, 2025Updated 3 months ago
- Code implementation for: From Virtual Games to Real-World Play☆46Jun 23, 2025Updated 9 months ago
- Official Implementation of ReCo: Region-Constraint In-Context Generation for Instructional Video Editing☆150Mar 5, 2026Updated 3 weeks ago
- Vision Bridge Transformer at Scale☆139Dec 1, 2025Updated 3 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- SpotEdit:Selective Region Editing in Diffusion Transformers☆176Jan 5, 2026Updated 2 months ago
- [SIGGRAPH Asia'25] Enabling Reference-based Camera Control via Context without Explicit 3D Estimation☆156Jan 18, 2026Updated 2 months ago
- SyncNoise: Geometrically Consistent Noise Prediction for Text-based 3D Scene Editing☆19Dec 28, 2024Updated last year
- [Arxiv 2025] In-Video Instructions: Visual Signals as Generative Control☆45Nov 25, 2025Updated 4 months ago
- [ECCV 2024] Official code for: SC4D: Sparse-Controlled Video-to-4D Generation and Motion Transfer☆113Jun 30, 2025Updated 8 months ago
- [ICCV 2025] LightSwitch: Multi-view Relighting with Material-guided Diffusion☆62Aug 13, 2025Updated 7 months ago
- PISCO: Precise Video Instance Insertion with Sparse Control☆53Feb 13, 2026Updated last month
- [ICCV2025] LONG3R: Long Sequence Streaming 3D Reconstruction☆41Jul 25, 2025Updated 8 months ago
- MMD viewer powered by Babylon.js and babylon-mmd☆16Aug 2, 2025Updated 7 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Official implementation of Forge4D: Feed-Forward 4D Human Reconstruction and Interpolation from Uncalibrated Sparse Videos☆40Sep 30, 2025Updated 5 months ago
- Adaptive and Temporally Consistent Gaussian Surfels for Multi-view Dynamic Reconstruction☆88Jun 11, 2025Updated 9 months ago
- HY-WU (Part I): An Extensible Functional Neural Memory Framework and An Instantiation in Text-Guided Image Editing☆238Mar 18, 2026Updated last week
- Official repo for UAE☆172Dec 29, 2025Updated 2 months ago
- [ICLR 2026] SparseD: Sparse Attention for Diffusion Language Models☆60Feb 22, 2026Updated last month
- [ICCV2025] Extrapolated Urban View Synthesis Benchmark☆48Oct 1, 2025Updated 5 months ago
- code release for HouseCrafter (ICCV 2025 Highlight)☆74Oct 23, 2025Updated 5 months ago
- [NeurIPS'25 Spotlight] GeoSVR: Taming Sparse Voxels for Geometrically Accurate Surface Reconstruction☆169Jan 1, 2026Updated 2 months ago
- Code for our TVCG paper "DiffCap: Diffusion-based Real-time Human Motion Capture using Sparse IMUs and a Monocular Camera".☆19Aug 22, 2025Updated 7 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Official implementation for the AAAI2025 paper "PIXELS - Progressive Image Xemplar-based Editing with Latent Surgery"☆11Dec 17, 2024Updated last year
- ☆25Mar 30, 2025Updated 11 months ago
- [ICCV 2025] Diffuman4D: 4D Consistent Human View Synthesis from Sparse-View Videos with Spatio-Temporal Diffusion Models☆581Feb 12, 2026Updated last month
- FlashTex: Fast Relightable Mesh Texturing with LightControlNet☆166Dec 12, 2024Updated last year
- [Interspeech 2024] LiteFocus is a tool designed to accelerate diffusion-based TTA model, now implemented with the base model AudioLDM2.☆34Mar 11, 2025Updated last year
- Complex-Edit: CoT-Like Instruction Generation for Complexity-Controllable Image Editing Benchmark☆28Apr 22, 2025Updated 11 months ago
- Dexterous World Models☆76Updated this week
- Open source impl of **MV-DUSt3R+ Single-Stage Scene Reconstruction from Sparse Views In 2 Seconds** from Meta Reality Labs. Project page …☆581Updated this week
- Gaga: Group Any Gaussians via 3D-aware Memory Bank☆402Aug 4, 2025Updated 7 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [ECCV 2024] Early Preparation Pays Off: New Classifier Pre-tuning for Class Incremental Semantic Segmentation☆38Mar 3, 2025Updated last year
- [ICLR'25] SynCamMaster: Synchronizing Multi-Camera Video Generation from Diverse Viewpoints☆682May 23, 2025Updated 10 months ago
- Official PyTorch and Diffusers Implementation of "LinFusion: 1 GPU, 1 Minute, 16K Image"☆314Dec 23, 2024Updated last year
- [ICLR 2026] Many-for-Many: Unify the Training of Multiple Video and Image Generation and Manipulation Tasks☆30Feb 5, 2026Updated last month
- ☆60Jun 8, 2025Updated 9 months ago
- ☆16Sep 16, 2025Updated 6 months ago
- [ICLR 2026 oral] Official code for VIST3A: Text-to-3D by Stitching a Multi-view Reconstruction Network to a Video Generator☆118Mar 5, 2026Updated 3 weeks ago