VideoDirector [CVPR 2025]
☆36Nov 25, 2025Updated 7 months ago
Alternatives and similar repositories for Video_Director
Users that are interested in Video_Director are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆28Apr 4, 2025Updated last year
- [ICLR 2025] VideoGrain: This repo is the official implementation of "VideoGrain: Modulating Space-Time Attention for Multi-Grained Video …☆158Mar 24, 2025Updated last year
- ☆13Dec 16, 2024Updated last year
- [ACMMM 2025] ComplexBench-Edit: Benchmarking Complex Instruction-Driven Image Editing via Compositional Dependencies☆22Jun 20, 2025Updated last year
- Tuning-Free Image Editing with Fidelity and Editability via Unified Latent Diffusion Model☆13Dec 29, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Audio-video joint generation☆58Nov 27, 2025Updated 7 months ago
- CVPR 2025' Instruct-4DGS: Efficient Dynamic Scene Editing via 4D Gaussian-based Static-Dynamic Separation☆33Sep 21, 2025Updated 9 months ago
- [CVPR 2025] Official code of "DiET-GS: Diffusion Prior and Event Stream-Assisted Motion Deblurring 3D Gaussian Splatting"☆56Sep 5, 2025Updated 9 months ago
- [NeurIPS 2024] COVE: Unleashing the Diffusion Feature Correspondence for Consistent Video Editing☆26Dec 8, 2024Updated last year
- [NeurIPS 2025 Spotlight] Official implementation for DNAEdit: Direct Noise Alignment for Text-Guided Rectified Flow Editing☆32Jan 23, 2026Updated 5 months ago
- [ICCV 2025] Edicho: Consistent Image Editing in the Wild☆127Oct 22, 2025Updated 8 months ago
- [ICLR 2025] FreqPrior: Improving Video Diffusion Models with Frequency Filtering Gaussian Noise☆14Mar 5, 2025Updated last year
- Official code of "UniVid: Unifying Vision Tasks with Pre-trained Video Generation Models" WACV2026☆37Nov 24, 2025Updated 7 months ago
- [CVPR 2024 Highlight] OpenESS: Event-Based Semantic Scene Understanding with Open Vocabularies☆72Aug 22, 2025Updated 10 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- [CVPR 2024] Code and datasets for 'Learning Spatial Features from Audio-Visual Correspondence in Egocentric Videos'☆13Jun 16, 2024Updated 2 years ago
- CTRL-D: Controllable Dynamic 3D Scene Editing with Personalized 2D Diffusion.☆58Apr 28, 2025Updated last year
- [CVPR'25] 🌟🌟 EgoTextVQA: Towards Egocentric Scene-Text Aware Video Question Answering☆52Jun 19, 2025Updated last year
- Human-centric environment representations from egocentric video☆15Feb 5, 2026Updated 4 months ago
- Training recipe for SpatialReasoner [NeurIPS 2025]☆45Apr 5, 2026Updated 2 months ago
- Official implementation of "Creating Your Editable 3D Photorealistic Avatar with Tetrahedron-constrained Gaussian Splatting".☆24Oct 18, 2025Updated 8 months ago
- [NeurIPS 2023] The official implementation of SOC: Semantic-Assisted Object Cluster for Referring Video Object Segmentation☆33Mar 16, 2024Updated 2 years ago
- 🦊 Quiet Node's portfolio | Full-stack Web 3.0 Software Developer | Blockchain Smart Contract Enthusiast☆13Dec 19, 2023Updated 2 years ago
- ☆13May 27, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Probabilistic lifElong Test-time Adaptation with seLf-training prior (PETAL)☆13Sep 8, 2023Updated 2 years ago
- Official repository of "KDFAS: Multi-Stage Knowledge Distillation Vision Transformer for Face Anti-Spoofing".☆10Oct 9, 2024Updated last year
- Audio Generation model working with GPT-2 and VQVAE compressed representation of MelSpectrograms☆18Oct 8, 2023Updated 2 years ago
- Codes of PostEdit☆23Apr 28, 2025Updated last year
- [CVPR 2023] Regularizing Second-Order Influences for Continual Learning☆38May 19, 2023Updated 3 years ago
- OnlyFlow: Optical Flow based Motion Conditioning for Video Diffusion Models☆21Feb 20, 2025Updated last year
- A semi-weakly supervised object detection technique based on monte carlo sampling for pseudo GT boxes☆12Apr 10, 2022Updated 4 years ago
- [ECCV2024] The official implementation of "Listen to Look into the Future: Audio-Visual Egocentric Gaze Anticipation".☆15Feb 24, 2025Updated last year
- HD-EPIC Python script to download the entire datasets or parts of it☆22Oct 7, 2025Updated 8 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- [NeurIPS2025] The official implementation of MindOmni: Unleashing Reasoning Generation in Vision Language Models with RGPO☆141Oct 15, 2025Updated 8 months ago
- RoboTwin: Dual-Arm Robot Benchmark with Generative Digital Twins☆12Sep 20, 2024Updated last year
- ☆18Sep 27, 2025Updated 9 months ago
- LightGaussian tailored for large-scale scene. Used by https://github.com/DekuLiuTesla/CityGaussian☆12Oct 9, 2024Updated last year
- Code for the paper "IFFNeRF: 6D Pose Estimation from a Single Image and a 3D Gaussian Splatting Model"☆12May 26, 2024Updated 2 years ago
- code for "EMS: 3D Eyebrow Modeling from Single-view Images"(SIGGRAPH Asia 2023)☆14May 3, 2025Updated last year
- [ICLR 2026] RefAny3D: 3D Asset-Referenced Diffusion Models for Image Generation☆36Mar 10, 2026Updated 3 months ago