[CVPR 2026] π Dataset and Benchmark code for EgoEdit
β107Feb 21, 2026Updated last week
Alternatives and similar repositories for EgoEdit
Users that are interested in EgoEdit are comparing it to the libraries listed below
Sorting:
- β86Feb 4, 2026Updated last month
- Scaling Zero-Shot Reference-to-Video Generationβ62Dec 11, 2025Updated 2 months ago
- A real-time streaming conversational video system that transforms text interactions into continuous, high-fidelity video responses using β¦β307Dec 15, 2025Updated 2 months ago
- [NeurIPS'25 Spotlight] Official implementation of "JavisGPT: A Unified Multi-modal LLM for Sounding-Video Comprehension and Generation"β69Updated this week
- Official repository for the paper "MVP4D: Multi-View Portrait Video Diffusion for Animatable 4D Avatars"β41Nov 20, 2025Updated 3 months ago
- End2End Virtual Try-on with Visual Reference, CVPR2026β58Nov 19, 2025Updated 3 months ago
- D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI [ICLR 2026]β72Jan 15, 2026Updated last month
- [CVPR 2026] OmniTransfer: All-in-one Framework for Spatio-temporal Video Transferβ224Feb 21, 2026Updated last week
- Long-horizon, spatially consistent video generation enabled by persistent 3D scene point clouds and dynamic-static disentanglement.β168Dec 18, 2025Updated 2 months ago
- Official PyTorch Implementation of Ctrl-Crash π₯β51Jun 3, 2025Updated 9 months ago
- [ICCV 2025] Official implementation of the paper "DreamCube: 3D Panorama Generation via Multi-plane Synchronization".β172Feb 4, 2026Updated last month
- β39Oct 29, 2025Updated 4 months ago
- Resilient multi-LLM orchestration with in-built failure handing, rate limits, retries, and circuit breaker.β29Updated this week
- OmniInsert: Mask-Free Video Insertion of Any Reference via Diffusion Transformer Modelsβ154Sep 24, 2025Updated 5 months ago
- Phantom-Data: Towards a General Subject-Consistent Video Generation Datasetβ105Feb 25, 2026Updated last week
- SpotEdit:Selective Region Editing in Diffusion Transformersβ173Jan 5, 2026Updated 2 months ago
- Reflection Removal through Efficient Adaptation of Diffusion Transformersβ121Dec 5, 2025Updated 2 months ago
- Any-to-Bokeh is a novel one-step video bokeh framework that converts arbitrary input videos into temporally coherent, depth-aware bokeh eβ¦β123Feb 4, 2026Updated last month
- [AAAI 2026] Zero-to-Hero: Zero-Shot Initialization Empowering Reference-Based Video Appearance Editingβ25Nov 20, 2025Updated 3 months ago
- PICABench: How Far Are We from Physically Realistic Image Editing?β36Nov 5, 2025Updated 3 months ago
- Code for the paper Proactive Hearing Assistants that Isolate Egocentric Conversationsβ43Nov 19, 2025Updated 3 months ago
- Official implementation of "VideoMaMa: Mask-Guided Video Matting via Generative Prior", CVPR 2026β279Feb 7, 2026Updated 3 weeks ago
- VerseCrafter: Dynamic Realistic Video World Model with 4D Geometric Controlβ315Updated this week
- Official PyTorch implementation of paper βInsViE-1M: Effective Instruction-based Video Editing with Elaborate Dataset Constructionββ33Jul 28, 2025Updated 7 months ago
- An example of ADK Agent for Long-form video generation with Veo 3.1 and Geminiβ91Nov 13, 2025Updated 3 months ago
- OmniGAIA: Towards Native Omni-Modal AI Agentsβ46Updated this week
- A transformers implementation of csm-streamingβ27May 16, 2025Updated 9 months ago
- Animate Any Character in Any Worldβ90Jan 9, 2026Updated last month
- Mobius: Text to Seamless Looping Video Generation via Latent Shiftβ174May 8, 2025Updated 9 months ago
- [CVPR2026] Code Release of MVInverse: Feedforward Multi-view Inverse Rendering in Secondsβ137Jan 22, 2026Updated last month
- Video Content Customization Using First Frameβ168Feb 25, 2026Updated last week
- β321Jan 24, 2026Updated last month
- ComfyUI-HiggsAudio is now available in ComfyUI, Higgs Audio v2 is a text-audio foundation model from Boson AI.β22Jul 26, 2025Updated 7 months ago
- [AAAI 2026] UltraGenβ77Feb 1, 2026Updated last month
- Video-as-Answer: Predict and Generate Next Video Event with Joint-GRPOβ92Dec 1, 2025Updated 3 months ago
- DreamStyle: A Unified Framework for Video Stylizationβ109Jan 7, 2026Updated last month
- [NeurIPS 2025] Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidanceβ581Jan 5, 2026Updated last month
- A Unified Visual Generator with Interleaved OmniModal Contextβ192Feb 10, 2026Updated 3 weeks ago
- SkyReels-A2: Compose anything in video diffusion transformersβ704Jun 3, 2025Updated 9 months ago