worv-ai / D2EView external linksLinks
D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI
☆68Jan 15, 2026Updated 3 weeks ago
Alternatives and similar repositories for D2E
Users that are interested in D2E are comparing it to the libraries listed below
Sorting:
- Scaling Zero-Shot Reference-to-Video Generation☆62Dec 11, 2025Updated 2 months ago
- Official repository for the paper "MVP4D: Multi-View Portrait Video Diffusion for Animatable 4D Avatars"☆41Nov 20, 2025Updated 2 months ago
- Official implementation of "DiT360: High-Fidelity Panoramic Image Generation via Hybrid Training".☆211Nov 5, 2025Updated 3 months ago
- [ICLR 2026] Light-X: Generative 4D Video Rendering with Camera and Illumination Control☆166Dec 11, 2025Updated 2 months ago
- Any-to-Bokeh is a novel one-step video bokeh framework that converts arbitrary input videos into temporally coherent, depth-aware bokeh e…☆122Feb 4, 2026Updated last week
- 👋 Dataset and Benchmark code for EgoEdit☆106Dec 11, 2025Updated 2 months ago
- End2End Virtual Try-on with Visual Reference☆57Nov 19, 2025Updated 2 months ago
- [ICCV 2025] Official implementation of the paper "DreamCube: 3D Panorama Generation via Multi-plane Synchronization".☆168Feb 4, 2026Updated last week
- [ICLR 2026] Thinking with Camera: A Unified Multimodal Model for Camera-Centric Understanding and Generation☆379Jan 28, 2026Updated 2 weeks ago
- VerseCrafter: Dynamic Realistic Video World Model with 4D Geometric Control☆269Updated this week
- ☆26Aug 6, 2025Updated 6 months ago
- [NeurIPS'25 Spotlight] Official implementation of "JavisGPT: A Unified Multi-modal LLM for Sounding-Video Comprehension and Generation"☆70Jan 10, 2026Updated last month
- Animate Any Character in Any World☆88Jan 9, 2026Updated last month
- ☆12Nov 1, 2023Updated 2 years ago
- ☆17Aug 6, 2025Updated 6 months ago
- ToonOut, a fork of BiRefNet focused on background removal for anime images. We open-source our dataset & our weights. See our paper at: h…☆81Sep 10, 2025Updated 5 months ago
- ☆11Jan 7, 2026Updated last month
- ☆11Apr 30, 2023Updated 2 years ago
- This repository is the project page for "Point Anywhere: Directed Object Estimation from Omnidirectional Images", including source code …☆12Aug 25, 2023Updated 2 years ago
- The implementation of Extreme Viewpoint 4D Video Generation☆253Sep 6, 2025Updated 5 months ago
- [AAAI 2026] UltraGen☆78Feb 1, 2026Updated last week
- Simple in-browser stable diffusion prompt reader☆40Oct 7, 2023Updated 2 years ago
- SpaceTimePilot: Generative Rendering of Dynamic Scenes Across Space and Time☆96Jan 1, 2026Updated last month
- This is the official Pytorch code for our paper "Artemis: Structured Visual Reasoning for Perception Policy Learning".☆14Dec 4, 2025Updated 2 months ago
- Just wanna see what type and how many GPUs/TPUs are used in CVPR 2025 oral papers. Fun vibe coding with LLMs.☆12Apr 24, 2025Updated 9 months ago
- ☆12Sep 19, 2021Updated 4 years ago
- The official baseline implementations for Chronocept☆10Dec 21, 2025Updated last month
- [ICCV 2025] Official repo of "EC-Flow: Enabling Versatile Robotic Manipulation from Action-Unlabeled Videos via Embodiment-Centric Flow"☆26Oct 16, 2025Updated 3 months ago
- ☆12May 4, 2023Updated 2 years ago
- ☆11May 22, 2024Updated last year
- ComfyUI custom node to convert latent to RGB☆12Jun 23, 2024Updated last year
- Resilient multi-LLM orchestration with in-built failure handing, rate limits, retries, and circuit breaker.☆29Feb 4, 2026Updated last week
- ☆11Jan 18, 2024Updated 2 years ago
- [NIPS 2025] FreqPolicy: Frequency Autoregressive Visuomotor Policy with Continuous Tokens☆20Oct 12, 2025Updated 4 months ago
- Provides an interface for extensions to use language models directly in the browser.☆15Feb 7, 2026Updated last week
- FormulaOne: A dataset of algorithmic problems based on MSO formulas.☆24Aug 14, 2025Updated 6 months ago
- The Infinite Jukebox algorithm extracted from https://github.com/UnderMybrella/EternalJukebox. Decoupled from the code that does renderin…☆13Sep 28, 2023Updated 2 years ago
- Official code for the paper: Depth Anything At Any Condition☆322Aug 21, 2025Updated 5 months ago
- [ICCV 2025] Diffuman4D: 4D Consistent Human View Synthesis from Sparse-View Videos with Spatio-Temporal Diffusion Models☆566Updated this week