VideoDirector [CVPR 2025]
☆35Nov 25, 2025Updated 4 months ago
Alternatives and similar repositories for Video_Director
Users that are interested in Video_Director are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆28Apr 4, 2025Updated last year
- [ICLR 2025] VideoGrain: This repo is the official implementation of "VideoGrain: Modulating Space-Time Attention for Multi-Grained Video …☆158Mar 24, 2025Updated last year
- [ECCV 2024] DGE: Direct Gaussian 3D Editing by Consistent Multi-view Editing☆127Jul 19, 2025Updated 8 months ago
- [ACMMM 2025] ComplexBench-Edit: Benchmarking Complex Instruction-Driven Image Editing via Compositional Dependencies☆22Jun 20, 2025Updated 9 months ago
- ☆13Dec 16, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Tuning-Free Image Editing with Fidelity and Editability via Unified Latent Diffusion Model☆13Dec 29, 2024Updated last year
- [CVPR'25] DropoutGS: Dropouting out Gaussian for Better Sparse-view Rendering☆53Nov 24, 2025Updated 4 months ago
- [NeurIPS 2025] Official implementation of "CLIPGaussian: Universal and Multimodal Style Transfer Based on Gaussian Splatting"☆34Mar 30, 2026Updated 2 weeks ago
- Audio-video joint generation☆56Nov 27, 2025Updated 4 months ago
- Materialist: Physically Based Editing Using Single-Image Inverse Rendering☆26Oct 24, 2025Updated 5 months ago
- ☆28Sep 29, 2024Updated last year
- [NeurIPS 2024] COVE: Unleashing the Diffusion Feature Correspondence for Consistent Video Editing☆25Dec 8, 2024Updated last year
- [ICCV 2025] Edicho: Consistent Image Editing in the Wild☆125Oct 22, 2025Updated 5 months ago
- [ICLR 2025] FreqPrior: Improving Video Diffusion Models with Frequency Filtering Gaussian Noise☆14Mar 5, 2025Updated last year
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Official code of "UniVid: Unifying Vision Tasks with Pre-trained Video Generation Models" WACV2026☆37Nov 24, 2025Updated 4 months ago
- [ECCV'24] CoR-GS: Sparse-View 3D Gaussian Splatting via Co-Regularization☆132Apr 2, 2025Updated last year
- [ICLR 2025] InstantSwap: This repo is the official implementation of "InstantSwap: Fast Customized Concept Swapping across Sharp Shape Di…☆111Mar 16, 2025Updated last year
- A framework for Longitudinal Radiology Report Generation☆27Aug 10, 2024Updated last year
- [CVPR 2024 Highlight] OpenESS: Event-Based Semantic Scene Understanding with Open Vocabularies☆72Aug 22, 2025Updated 7 months ago
- [CVPR 2024] Code and datasets for 'Learning Spatial Features from Audio-Visual Correspondence in Egocentric Videos'☆13Jun 16, 2024Updated last year
- CTRL-D: Controllable Dynamic 3D Scene Editing with Personalized 2D Diffusion.☆54Apr 28, 2025Updated 11 months ago
- [CVPR'25] 🌟🌟 EgoTextVQA: Towards Egocentric Scene-Text Aware Video Question Answering☆47Jun 19, 2025Updated 9 months ago
- Human-centric environment representations from egocentric video☆14Feb 5, 2026Updated 2 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Training recipe for SpatialReasoner [NeurIPS 2025]☆41Apr 5, 2026Updated last week
- ☆13May 27, 2025Updated 10 months ago
- Source code for the Paper "Mind the Gap: Benchmarking Spatial Reasoning in Vision-Language Models"☆19Feb 1, 2026Updated 2 months ago
- Audio Generation model working with GPT-2 and VQVAE compressed representation of MelSpectrograms☆18Oct 8, 2023Updated 2 years ago
- Codes of PostEdit☆23Apr 28, 2025Updated 11 months ago
- [ICASSP 2025] Official code of "Exploring Kolmogorov-Arnold networks for realistic image sharpness assessment"☆25Aug 31, 2025Updated 7 months ago
- OnlyFlow: Optical Flow based Motion Conditioning for Video Diffusion Models☆19Feb 20, 2025Updated last year
- [ECCV2024] The official implementation of "Listen to Look into the Future: Audio-Visual Egocentric Gaze Anticipation".☆13Feb 24, 2025Updated last year
- ☆139Oct 15, 2025Updated 5 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- HD-EPIC Python script to download the entire datasets or parts of it☆19Oct 7, 2025Updated 6 months ago
- Unofficial implementation of the paper: "NeRF-In: Free-Form NeRF Inpainting with RGB-D Priors"☆11Apr 30, 2023Updated 2 years ago
- Official Implementation of DMT: Dual Mean-Teacher in PyTorch.☆10Oct 27, 2023Updated 2 years ago
- [ICLR 2026] RefAny3D: 3D Asset-Referenced Diffusion Models for Image Generation☆32Mar 10, 2026Updated last month
- RoboTwin: Dual-Arm Robot Benchmark with Generative Digital Twins☆12Sep 20, 2024Updated last year
- LightGaussian tailored for large-scale scene. Used by https://github.com/DekuLiuTesla/CityGaussian☆12Oct 9, 2024Updated last year
- code for "EMS: 3D Eyebrow Modeling from Single-view Images"(SIGGRAPH Asia 2023)☆13May 3, 2025Updated 11 months ago