worv-ai/D2E

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/worv-ai/D2E)

worv-ai / D2E

D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI [ICLR 2026]

☆89

Alternatives and similar repositories for D2E

Users that are interested in D2E are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Kiloforge / kiloforge
View on GitHub
1,000x Productivity. Command AI agent swarms and ship code at the speed of thought.
☆17Mar 22, 2026Updated 3 months ago
worv-ai / CostNav
View on GitHub
CostNav: A Navigation Benchmark for Real-World Economic-Cost Evaluation of Physical AI Agents
☆22May 14, 2026Updated 2 months ago
open-world-agents / open-world-agents
View on GitHub
Everything you need to build state-of-the-art foundation multimodal desktop agent, end-to-end.
☆43Jul 9, 2026Updated last week
open-world-agents / ocap
View on GitHub
High-performance desktop recorder for Windows. Captures screen, audio, keyboard, mouse, and window events.
☆36Jul 9, 2026Updated last week
worv-ai / canvas
View on GitHub
CANVAS: Commonsense-Aware Navigation System for Intuitive Human-Robot Interaction [ICRA 2025]
☆18Oct 20, 2025Updated 9 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
worv-ai / LightManager
View on GitHub
An advanced Omniverse Isaac Sim extension for dynamic management and real-time animation of scene lighting, enabling more flexible, inter…
☆15Aug 29, 2025Updated 10 months ago
open-world-agents / desktop-env
View on GitHub
A real-time, high-frequency, real-world desktop environment that is suitable for desktop-based ML development (agents, world models, etc.…
☆14Jan 23, 2025Updated last year
alohays / openai-tool2mcp
View on GitHub
mcp wrapper for openai built-in tools
☆12Mar 13, 2025Updated last year
elefant-ai / open-p2p
View on GitHub
Official Repo for paper: Scaling Behavior Cloning Improves Causal Reasoning: An Open Model for Real-Time Video Game Playing
☆169Feb 6, 2026Updated 5 months ago
felixtaubner / mvp4d
View on GitHub
Official repository for the paper "MVP4D: Multi-View Portrait Video Diffusion for Animatable 4D Avatars"
☆43Mar 24, 2026Updated 3 months ago
riiid / PPAP
View on GitHub
Official pytorch implementation of "Towards Practical Plug-and-Play Diffusion Models" in CVPR2023
☆22Jul 22, 2023Updated 2 years ago
hyeon-cho / Tangential-Amplifying-Guidance
View on GitHub
[ICML2026] Official Implementation of "TAG: Tangential Amplifying Guidance for Hallucination-Resistant Sampling"
☆42Jul 6, 2026Updated 2 weeks ago
gohyojun15 / ANT_diffusion
View on GitHub
[Neurips 2023] Official pytorch implementation of "Addressing Negative Transfer in Diffusion Models"
☆24Jul 4, 2024Updated 2 years ago
cloneisyou / HEVEC
View on GitHub
A Vector Database Powered by Homomorphic Encryption
☆19Feb 3, 2026Updated 5 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
elefant-ai / recap
View on GitHub
☆16Jan 6, 2026Updated 6 months ago
cvlab-kaist / WorldCam
View on GitHub
Code Implementation of "WorldCam: Interactive Autoregressive 3D Gaming Worlds with Camera Pose as a Unifying Geometric Representation"
☆175May 9, 2026Updated 2 months ago
xbyym / StableWorld
View on GitHub
StableWorld: Towards Stable and Consistent Long Interactive Video Generation
☆97Mar 18, 2026Updated 4 months ago
eric-zqwang / CLiFT
View on GitHub
Code for paper "CLiFT: Compressive Light-Field Tokens for Compute Efficient and Adaptive Neural Rendering" [NeurIPS 2025 (spotlight)]
☆78Jul 6, 2026Updated 2 weeks ago
delta0-inc / heimdall
View on GitHub
Open Source Observability Platform for MCP Servers & Apps
☆30Feb 4, 2026Updated 5 months ago
MilkClouds / vla0-trl
View on GitHub
Unofficial reimplementation of VLA-0 using TRL's SFTTrainer.
☆79Feb 20, 2026Updated 5 months ago
KangLiao929 / Puffin
View on GitHub
[ICLR 2026] Thinking with Camera: A Unified Multimodal Model for Camera-Centric Understanding and Generation
☆419Feb 18, 2026Updated 5 months ago
franciszzj / Saber
View on GitHub
[CVPR 2026] Scaling Zero-Shot Reference-to-Video Generation
☆76Apr 28, 2026Updated 2 months ago
AIGeeksGroup / Code2Worlds
View on GitHub
[ICML 2026] Code2Worlds: Empowering Coding LLMs for 4D World Generation
☆118Jun 3, 2026Updated last month
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
cvlab-kaist / MATRIX
View on GitHub
Official implementation of "MATRIX: Mask Track Alignment for Interaction-aware Video Generation" (ICLR 2026)
☆43Apr 2, 2026Updated 3 months ago
hletrd / kiwi-paper
View on GitHub
🥝 딱딱한 논문이나 API 문서를 나무위키 문서로 바꿔줍니다. Claude Code, OpenCode, Codex, Gemini CLI 지원.
☆15Mar 28, 2026Updated 3 months ago
byeongjun-park / SteerX
View on GitHub
[ICCV 2025] Official pytorch implementation of "SteerX: Creating Any Camera-Free 3D and 4D Scenes with Geometric Steering"
☆50Mar 20, 2025Updated last year
cvlab-kaist / TETO
View on GitHub
Official implementation of "TETO: Tracking Events with Teacher Observation for Motion Estimation and Frame Interpolation".
☆17Mar 25, 2026Updated 3 months ago
YS-IMTech / PermaVid
View on GitHub
[Official Code] PermaVid: Consistent Video Generation Across Edits via Disentangled Context Memory
☆43Jun 17, 2026Updated last month
solaris-wm / solaris-engine
View on GitHub
Scalable Minecraft multiplayer data collection engine
☆139Apr 23, 2026Updated 2 months ago
cvlab-kaist / SpikeMatch
View on GitHub
☆19Sep 29, 2025Updated 9 months ago
cvlab-kaist / ReNoV
View on GitHub
☆21Feb 14, 2026Updated 5 months ago
isle-dev / MetricEval
View on GitHub
MetricEval: A framework that conceptualizes and operationalizes four main components of metric evaluation, in terms of reliability and va…
☆12Nov 6, 2023Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
showlab / Olaf-World
View on GitHub
[ICML 2026] Orienting Latent Actions for Video World Modeling
☆116Apr 20, 2026Updated 3 months ago
cvlab-kaist / DA-Flow
View on GitHub
Official implementation of "DA-Flow: Degradation-Aware Optical Flow Estimation with Diffusion Models"
☆25Mar 26, 2026Updated 3 months ago
seahl0119 / ImprovedMeanFlow
View on GitHub
☆20Nov 24, 2025Updated 7 months ago
ZheningHuang / SpaceTimePilot
View on GitHub
[CVPR 2026] SpaceTimePilot: Generative Rendering of Dynamic Scenes Across Space and Time
☆122May 17, 2026Updated 2 months ago
TingtingLiao / mimix
View on GitHub
☆83Oct 13, 2025Updated 9 months ago
CIntellifusion / MultiWorld
View on GitHub
Official Implementation of MultiWorld: Scalable Multi-Agent Multi-View Video World Models
☆247May 12, 2026Updated 2 months ago
cvlab-kaist / V-Warper
View on GitHub
Official implementation of "V-Warper: Appearance-Consistent Video Diffusion Personalization via Value Warping"
☆21Jun 4, 2026Updated last month