worv-ai / D2ELinks
D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI
☆46Updated this week
Alternatives and similar repositories for D2E
Users that are interested in D2E are comparing it to the libraries listed below
Sorting:
- Official implementation of "DiT360: High-Fidelity Panoramic Image Generation via Hybrid Training".☆103Updated this week
- ☆219Updated 3 months ago
- Official implementation for "Story2Board: A Training‑Free Approach for Expressive Storyboard Generation"☆169Updated 2 months ago
- ☆179Updated 2 months ago
- One-shot and Few-shot 3D Editing without Per-Scene Optimization☆158Updated 2 months ago
- [ICCV 2025] Enhancing spatial understanding in text-to-Image diffusion models☆89Updated last month
- ☆318Updated 2 months ago
- The implementation of Extreme Viewpoint 4D Video Generation☆243Updated last month
- ToonOut, a fork of BiRefNet focused on background removal for anime images. We open-source our dataset & our weights. See our paper at: h…☆68Updated last month
- Lynx: Towards High-Fidelity Personalized Video Generation☆260Updated 3 weeks ago
- ☆119Updated last week
- Any-to-Bokeh is a novel one-step video bokeh framework that converts arbitrary input videos into temporally coherent, depth-aware bokeh e…☆112Updated 3 months ago
- OmniInsert: Mask-Free Video Insertion of Any Reference via Diffusion Transformer Models☆138Updated 3 weeks ago
- Code for paper "CLiFT: Compressive Light-Field Tokens for Compute Efficient and Adaptive Neural Rendering" [NeurIPS 2025 (spotlight)]☆71Updated 2 months ago
- Official Implementation of DRA-Ctrl (Dimension-Reduction Attack! Video Generative Models are Experts on Controllable Image Synthesis)☆118Updated 2 months ago
- project for skyreels-a3☆74Updated 2 months ago
- [SIGGRAPH-ASIA 2025] Official implementation of "VideoFrom3D: 3D Scene Video Generation via Complementary Image and Video Diffusion Model…☆104Updated last week
- Official repository for the paper "MVP4D: Multi-View Portrait Video Diffusion for Animatable 4D Avatars"☆27Updated last week
- Thinking with Camera: A Unified Multimodal Model for Camera-Centric Understanding and Generation☆147Updated last week
- The official implementation of ”RepVideo: Rethinking Cross-Layer Representation for Video Generation“☆121Updated 8 months ago
- Official Implementation of "Instance Segmentation of Scene Sketches Using Natural Image Priors" (SIGGRAPH 2025)☆78Updated last month
- Unlimited-length talking video generation that supports image-to-video and video-to-video generation☆46Updated 2 months ago
- High-Quality Text-to-Video Generation with Alpha Channel☆244Updated 3 weeks ago
- [ICCV 2025] Official implementation of the paper "DreamCube: 3D Panorama Generation via Multi-plane Synchronization".☆147Updated 2 weeks ago
- OmniVCus: Feedforward Subject-driven Video Customization with Multimodal Control Conditions (NeurIPS 2025)☆80Updated last month
- PlayerOne: Egocentric World Simulator☆147Updated 4 months ago
- ☆94Updated 4 months ago
- ☆104Updated last month
- [SIGGRAGH'25] Official repository of LayerFlow: A Unified Model for Layer-aware Video Generation☆84Updated 2 months ago
- [ACM MM 2025] HoloTime: Taming Video Diffusion Models for Panoramic 4D Scene Generation☆136Updated last month