[CVPR 2026] π Dataset and Benchmark code for EgoEdit
β147Apr 5, 2026Updated last month
Alternatives and similar repositories for EgoEdit
Users that are interested in EgoEdit are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Phantom-Data: Towards a General Subject-Consistent Video Generation Datasetβ110Feb 25, 2026Updated 3 months ago
- β89May 13, 2026Updated last week
- [CVPR 2026] FaceCam: Portrait Video Camera Control via Scale-Aware Conditioningβ54Mar 26, 2026Updated last month
- [NeurIPS'25 Spotlight] Official implementation of "JavisGPT: A Unified Multi-modal LLM for Sounding-Video Comprehension and Generation"β73Feb 26, 2026Updated 2 months ago
- A real-time streaming conversational video system that transforms text interactions into continuous, high-fidelity video responses using β¦β327Dec 15, 2025Updated 5 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits β’ AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- [CVPR 2026] Scaling Zero-Shot Reference-to-Video Generationβ72Apr 28, 2026Updated 3 weeks ago
- [CVPR'26 Highlight] Ditto: Scaling Instruction-Based Video Editing with a High-Quality Synthetic Datasetβ603Oct 29, 2025Updated 6 months ago
- [CVPR2026] Long-horizon, spatially consistent video generation enabled by persistent 3D scene point clouds and dynamic-static disentangleβ¦β196May 12, 2026Updated last week
- A unified and fully open-source framework for instruction-guided and reference-guided video editing using natural language.β275May 13, 2026Updated last week
- Warp-as-History: Generalizable Camera-Controlled Video Generation from One Training Videoβ121May 17, 2026Updated last week
- OmniTransfer: All-in-one Framework for Spatio-temporal Video Transferβ232Apr 15, 2026Updated last month
- OmniInsert: Mask-Free Video Insertion of Any Reference via Diffusion Transformer Modelsβ159Mar 4, 2026Updated 2 months ago
- Batch video captioning using Qwen3-VL-8B vision-language modelβ80Apr 19, 2026Updated last month
- OmniShotCut is a sensitive and more informative SoTA on Shot Boundary Detection task.β198May 4, 2026Updated 2 weeks ago
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A framework for camera-controllable image editing using unified geometric guidance and video models.β65Apr 28, 2026Updated 3 weeks ago
- [Ultra Powerful Few-Step Diffusion RL] TDM-R1: Reinforcing Few-Step Diffusion Models with Non-Differentiable Rewardβ94Mar 11, 2026Updated 2 months ago
- β67Aug 8, 2025Updated 9 months ago
- D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI [ICLR 2026]β82Mar 3, 2026Updated 2 months ago
- Official code for paper Advantage Weighted Matching: Aligning RL with Pretraining in Diffusion Modelsβ78Apr 23, 2026Updated last month
- A Web UI simplify the AI videos generation using Hunyuan Video Diffusion Modelβ18Dec 27, 2024Updated last year
- The official repository of EditCrafter: Tuning-free High-Resolution Image Editing via Pretrained Diffusion Model (CVPRW 2026)β48Apr 19, 2026Updated last month
- [NeurIPS 2025] Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidanceβ628Jan 5, 2026Updated 4 months ago
- End2End Virtual Try-on with Visual Reference, CVPR2026β64Apr 18, 2026Updated last month
- GPU virtual machines on DigitalOcean Gradient AI β’ AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- β101Mar 13, 2026Updated 2 months ago
- Audio-video joint generationβ57Nov 27, 2025Updated 5 months ago
- Official repository for the paper "Instance-Wise Holistic Order Prediction in Natural Scenes".β27Jan 11, 2024Updated 2 years ago
- β46Oct 29, 2025Updated 6 months ago
- DreamID-V: Bridging the Image-to-Video Gap for High-Fidelity Face Swapping via Diffusion Transformerβ639Updated this week
- [ICCV 2025] Enhancing spatial understanding in text-to-Image diffusion modelsβ94Sep 11, 2025Updated 8 months ago
- [AAAI 2026] Zero-to-Hero: Zero-Shot Initialization Empowering Reference-Based Video Appearance Editingβ24Nov 20, 2025Updated 6 months ago
- SpotEdit:Selective Region Editing in Diffusion Transformersβ190Jan 5, 2026Updated 4 months ago
- Official PyTorch Implementation of Ctrl-Crash π₯β52Jun 3, 2025Updated 11 months ago
- Virtual machines for every use case on DigitalOcean β’ AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Tempo: Small Vision-Language Models are Smart Compressors for Long Video Understandingβ69Apr 29, 2026Updated 3 weeks ago
- [ICCV 2025] Official implementation of the paper "DreamCube: 3D Panorama Generation via Multi-plane Synchronization".β178Feb 4, 2026Updated 3 months ago
- [arXiv 2025.12] Animate Any Character in Any Worldβ97Mar 10, 2026Updated 2 months ago
- UniMesh: Unifying 3D Mesh Understanding and Generationβ56May 8, 2026Updated 2 weeks ago
- Resilient multi-LLM orchestration with in-built failure handing, rate limits, retries, and circuit breaker.β45Mar 23, 2026Updated 2 months ago
- [CVPR2025] Unveil Inversion and Invariance in Flow Transformer for Versatile Image Editingβ26Aug 23, 2025Updated 9 months ago
- Krea Realtime 14B. An open-source realtime AI video model.β549Nov 13, 2025Updated 6 months ago