worv-ai / D2ELinks
D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI
☆66Updated 3 weeks ago
Alternatives and similar repositories for D2E
Users that are interested in D2E are comparing it to the libraries listed below
Sorting:
- Scaling Zero-Shot Reference-to-Video Generation☆59Updated last month
- Animate Any Character in Any World☆82Updated 2 weeks ago
- One-shot and Few-shot 3D Editing without Per-Scene Optimization☆160Updated 4 months ago
- 👋 Dataset and Benchmark code for EgoEdit☆99Updated last month
- ☆227Updated 5 months ago
- 🎨 A Style is Worth One Code: Unlocking Code-to-Style Image Generation with Discrete Style Space☆151Updated last month
- iMontage: Unified, Versatile, Highly Dynamic Many-to-many Image Generation☆182Updated last month
- ☆298Updated this week
- ☆82Updated 2 weeks ago
- [ArXiv 2025] DiffusionVL: Translating Any Autoregressive Models into Diffusion Vision Language Models☆121Updated 2 weeks ago
- Official repo for paper "IC-Effect: Precise and Efficient Video Effects Editing via In-Context Learning"☆38Updated 3 weeks ago
- The implementation of Extreme Viewpoint 4D Video Generation☆250Updated 4 months ago
- RePlan: Reasoning-Guided Region Planning for Complex Instruction-Based Image Editing☆49Updated 2 weeks ago
- SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction☆261Updated 2 weeks ago
- FIBO is a SOTA, first open-source, JSON-native text-to-image model built for controllable, predictable, and legally safe image generation…☆292Updated this week
- Official implementation of "LoFA: Learning to Predict Personalized Prior for Fast Adaptation of Visual Generative Models".☆30Updated 3 weeks ago
- Official Implementation of ReCo: Region-Constraint In-Context Generation for Instructional Video Editing☆115Updated last week
- Official repo for paper "Video-As-Prompt: Unified Semantic Control for Video Generation"☆346Updated 2 months ago
- [ICCV 2025] Video-T1: Test-Time Scaling for Video Generation☆304Updated 6 months ago
- [ICCV 2025] Enhancing spatial understanding in text-to-Image diffusion models☆88Updated 4 months ago
- PlayerOne: Egocentric World Simulator☆181Updated 6 months ago
- Official code for VINCIE: Unlocking In-context Image Editing from Video☆44Updated 4 months ago
- ☆61Updated 3 weeks ago
- Official repository of paper "ProEdit: Inversion-based Editing From Prompts Done Right"☆91Updated last week
- [ICCV 2025] LangScene-X: Reconstruct Generalizable 3D Language-Embedded Scenes with TriMap Video Diffusion☆294Updated 5 months ago
- VFXMaster: Unlocking Dynamic Visual Effect Generation via In-Context Learning☆60Updated 2 months ago
- ☆91Updated 4 months ago
- Official implementation for "Story2Board: A Training‑Free Approach for Expressive Storyboard Generation"☆220Updated 4 months ago
- The official implementation of ”RepVideo: Rethinking Cross-Layer Representation for Video Generation“☆123Updated 11 months ago
- [AAAI 2026] UltraGen☆79Updated 2 months ago