[CVPR 2026] π Dataset and Benchmark code for EgoEdit
β150Apr 5, 2026Updated 2 months ago
Alternatives and similar repositories for EgoEdit
Users that are interested in EgoEdit are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Phantom-Data: Towards a General Subject-Consistent Video Generation Datasetβ110Feb 25, 2026Updated 3 months ago
- β89May 13, 2026Updated last month
- [CVPR 2026] FaceCam: Portrait Video Camera Control via Scale-Aware Conditioningβ56Mar 26, 2026Updated 2 months ago
- [NeurIPS'25 Spotlight] Official implementation of "JavisGPT: A Unified Multi-modal LLM for Sounding-Video Comprehension and Generation"β75Feb 26, 2026Updated 3 months ago
- A real-time streaming conversational video system that transforms text interactions into continuous, high-fidelity video responses using β¦β327Dec 15, 2025Updated 5 months ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [CVPR 2026] Scaling Zero-Shot Reference-to-Video Generationβ75Apr 28, 2026Updated last month
- [CVPR'26 Highlight] Ditto: Scaling Instruction-Based Video Editing with a High-Quality Synthetic Datasetβ605Jun 1, 2026Updated last week
- [CVPR2026] Long-horizon, spatially consistent video generation enabled by persistent 3D scene point clouds and dynamic-static disentangleβ¦β201May 12, 2026Updated last month
- A unified and fully open-source framework for instruction-guided and reference-guided video editing using natural language.β286May 13, 2026Updated last month
- OmniTransfer: All-in-one Framework for Spatio-temporal Video Transferβ232Apr 15, 2026Updated last month
- [CVPR 2026] Video-as-Answer: Predict and Generate Next Video Event with Joint-GRPOβ119Feb 28, 2026Updated 3 months ago
- OmniInsert: Mask-Free Video Insertion of Any Reference via Diffusion Transformer Modelsβ159Mar 4, 2026Updated 3 months ago
- Batch video captioning using Qwen3-VL-8B vision-language modelβ80Apr 19, 2026Updated last month
- OmniShotCut is a sensitive and more informative SoTA on Shot Boundary Detection task.β221Jun 1, 2026Updated last week
- GPU virtual machines on DigitalOcean Gradient AI β’ AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A framework for camera-controllable image editing using unified geometric guidance and video models.β66Apr 28, 2026Updated last month
- [ICML 2026][Ultra Powerful Few-Step Diffusion RL] TDM-R1: Reinforcing Few-Step Diffusion Models with Non-Differentiable Rewardβ103May 25, 2026Updated 2 weeks ago
- Official repository for the paper "MVP4D: Multi-View Portrait Video Diffusion for Animatable 4D Avatars"β43Mar 24, 2026Updated 2 months ago
- Warp-as-History: Generalizable Camera-Controlled Video Generation from One Training Videoβ210May 30, 2026Updated 2 weeks ago
- Official code for paper Advantage Weighted Matching: Aligning RL with Pretraining in Diffusion Modelsβ84Apr 23, 2026Updated last month
- D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI [ICLR 2026]β83Mar 3, 2026Updated 3 months ago
- The official repository of EditCrafter: Tuning-free High-Resolution Image Editing via Pretrained Diffusion Model (CVPRW 2026)β49Apr 19, 2026Updated last month
- Official training code for MUG-V 10B video generation model. Built on Megatron-LM (v0.14.0) with production-ready distributed training foβ¦β20Oct 20, 2025Updated 7 months ago
- Official PyTorch implementation of paper βInsViE-1M: Effective Instruction-based Video Editing with Elaborate Dataset Constructionββ34Apr 3, 2026Updated 2 months ago
- Simple, predictable pricing with DigitalOcean hosting β’ AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- [NeurIPS 2025] Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidanceβ631Jan 5, 2026Updated 5 months ago
- End2End Virtual Try-on with Visual Reference, CVPR2026β68Apr 18, 2026Updated last month
- β101Mar 13, 2026Updated 3 months ago
- Audio-video joint generationβ58Nov 27, 2025Updated 6 months ago
- β47Oct 29, 2025Updated 7 months ago
- DreamID-V: Bridging the Image-to-Video Gap for High-Fidelity Face Swapping via Diffusion Transformerβ647May 22, 2026Updated 3 weeks ago
- [ICCV 2025] Enhancing spatial understanding in text-to-Image diffusion modelsβ94Sep 11, 2025Updated 9 months ago
- [AAAI 2026] Zero-to-Hero: Zero-Shot Initialization Empowering Reference-Based Video Appearance Editingβ24Nov 20, 2025Updated 6 months ago
- SpotEdit:Selective Region Editing in Diffusion Transformersβ192Jan 5, 2026Updated 5 months ago
- Open source password manager - Proton Pass β’ AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Official repository for the paper "Instance-Wise Holistic Order Prediction in Natural Scenes".β29Jan 11, 2024Updated 2 years ago
- Official PyTorch Implementation of Ctrl-Crash π₯β53Jun 3, 2025Updated last year
- Tempo: Small Vision-Language Models are Smart Compressors for Long Video Understandingβ70Apr 29, 2026Updated last month
- [ICCV 2025] Official implementation of the paper "DreamCube: 3D Panorama Generation via Multi-plane Synchronization".β178Feb 4, 2026Updated 4 months ago
- [Arxiv 2026] ActionPlan: Future-Aware Streaming Motion Synthesis via Frame-Level Action Planningβ88Mar 26, 2026Updated 2 months ago
- UniMesh: Unifying 3D Mesh Understanding and Generationβ57May 8, 2026Updated last month
- [arXiv 2512.17796] Animate Any Character in Any Worldβ96Mar 10, 2026Updated 3 months ago