[CVPR 2026] π Dataset and Benchmark code for EgoEdit
β152Apr 5, 2026Updated 2 months ago
Alternatives and similar repositories for EgoEdit
Users that are interested in EgoEdit are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Phantom-Data: Towards a General Subject-Consistent Video Generation Datasetβ113Feb 25, 2026Updated 4 months ago
- β90May 13, 2026Updated last month
- [NeurIPS'25 Spotlight] Official implementation of "JavisGPT: A Unified Multi-modal LLM for Sounding-Video Comprehension and Generation"β75Feb 26, 2026Updated 4 months ago
- A real-time streaming conversational video system that transforms text interactions into continuous, high-fidelity video responses using β¦β334Dec 15, 2025Updated 6 months ago
- [CVPR 2026] Scaling Zero-Shot Reference-to-Video Generationβ76Apr 28, 2026Updated 2 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI β’ AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- [CVPR'26 Highlight] Ditto: Scaling Instruction-Based Video Editing with a High-Quality Synthetic Datasetβ609Jun 1, 2026Updated last month
- [CVPR2026] Long-horizon, spatially consistent video generation enabled by persistent 3D scene point clouds and dynamic-static disentangleβ¦β210May 12, 2026Updated last month
- A unified and fully open-source framework for instruction-guided and reference-guided video editing using natural language.β297May 13, 2026Updated last month
- OmniTransfer: All-in-one Framework for Spatio-temporal Video Transferβ233Apr 15, 2026Updated 2 months ago
- [CVPR 2026] Video-as-Answer: Predict and Generate Next Video Event with Joint-GRPOβ120Feb 28, 2026Updated 4 months ago
- OmniInsert: Mask-Free Video Insertion of Any Reference via Diffusion Transformer Modelsβ161Mar 4, 2026Updated 4 months ago
- Batch video captioning using Qwen3-VL-8B vision-language modelβ80Apr 19, 2026Updated 2 months ago
- OmniShotCut is a sensitive and more informative SoTA on Shot Boundary Detection task.β242Jun 1, 2026Updated last month
- A framework for camera-controllable image editing using unified geometric guidance and video models.β66Jun 25, 2026Updated last week
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [ICML 2026][Ultra Powerful Few-Step Diffusion RL] TDM-R1: Reinforcing Few-Step Diffusion Models with Non-Differentiable Rewardβ107May 25, 2026Updated last month
- Official repository for the paper "MVP4D: Multi-View Portrait Video Diffusion for Animatable 4D Avatars"β43Mar 24, 2026Updated 3 months ago
- β67Aug 8, 2025Updated 10 months ago
- Official code for paper Advantage Weighted Matching: Aligning RL with Pretraining in Diffusion Modelsβ87Apr 23, 2026Updated 2 months ago
- Warp-as-History: Generalizable Camera-Controlled Video Generation from One Training Videoβ218May 30, 2026Updated last month
- A Web UI simplify the AI videos generation using Hunyuan Video Diffusion Modelβ17Dec 27, 2024Updated last year
- D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI [ICLR 2026]β87Mar 3, 2026Updated 4 months ago
- The official repository of EditCrafter: Tuning-free High-Resolution Image Editing via Pretrained Diffusion Model (CVPRW 2026)β50Apr 19, 2026Updated 2 months ago
- Official training code for MUG-V 10B video generation model. Built on Megatron-LM (v0.14.0) with production-ready distributed training foβ¦β20Oct 20, 2025Updated 8 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Official PyTorch implementation of paper βInsViE-1M: Effective Instruction-based Video Editing with Elaborate Dataset Constructionββ34Apr 3, 2026Updated 3 months ago
- [NeurIPS 2025] Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidanceβ638Jan 5, 2026Updated 5 months ago
- End2End Virtual Try-on with Visual Reference, CVPR2026β68Apr 18, 2026Updated 2 months ago
- β103Mar 13, 2026Updated 3 months ago
- Audio-video joint generationβ58Nov 27, 2025Updated 7 months ago
- β48Oct 29, 2025Updated 8 months ago
- [ECCV 2026] DreamID-V: Bridging the Image-to-Video Gap for High-Fidelity Face Swapping via Diffusion Transformerβ653May 22, 2026Updated last month
- [ICCV 2025] Enhancing spatial understanding in text-to-Image diffusion modelsβ94Sep 11, 2025Updated 9 months ago
- [AAAI 2026] Zero-to-Hero: Zero-Shot Initialization Empowering Reference-Based Video Appearance Editingβ24Nov 20, 2025Updated 7 months ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- SpotEdit:Selective Region Editing in Diffusion Transformersβ196Jan 5, 2026Updated 6 months ago
- Official repository for the paper "Instance-Wise Holistic Order Prediction in Natural Scenes".β29Jan 11, 2024Updated 2 years ago
- Official PyTorch Implementation of Ctrl-Crash π₯β53Jun 3, 2025Updated last year
- Tempo: Small Vision-Language Models are Smart Compressors for Long Video Understanding (ECCV 2026)β72Updated this week
- [ICCV 2025] Official implementation of the paper "DreamCube: 3D Panorama Generation via Multi-plane Synchronization".β179Feb 4, 2026Updated 5 months ago
- UniMesh: Unifying 3D Mesh Understanding and Generationβ57May 8, 2026Updated last month
- [Arxiv 2026] ActionPlan: Future-Aware Streaming Motion Synthesis via Frame-Level Action Planningβ90Mar 26, 2026Updated 3 months ago