VisualComputingInstitute / diffusion-e2e-ftLinks
[WACV'25 Oral] Fine-Tuning Image-Conditional Diffusion Models is Easier than You Think
☆489Updated 11 months ago
Alternatives and similar repositories for diffusion-e2e-ft
Users that are interested in diffusion-e2e-ft are comparing it to the libraries listed below
Sorting:
- Official implementation of Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction☆759Updated last week
- ☆301Updated last year
- [ICLR2025] GenPercept: Diffusion Models Trained with Large Data Are Transferable Visual Models☆214Updated 10 months ago
- ChronoDepth: Learning Temporally Consistent Video Depth from Video Diffusion Priors☆275Updated 9 months ago
- [MM24] Official codes and datasets for ACM MM24 paper "Hi3D: Pursuing High-Resolution Image-to-3D Generation with Video Diffusion Models"…☆279Updated last year
- [CVPR2024 Oral] EscherNet: A Generative Model for Scalable View Synthesis☆360Updated last year
- [CVPR 2025] Code for Segment Any Motion in Videos☆439Updated 5 months ago
- Depth Any Video with Scalable Synthetic Data (ICLR 2025)☆508Updated last year
- Generative Camera Dolly: Extreme Monocular Dynamic Novel View Synthesis (ECCV 2024 Oral) - Official Implementation☆273Updated 2 weeks ago
- Orient Anything, ICML 2025☆349Updated last month
- High-quality and editable surfel 3D Gaussian generation through native 3D diffusion (ICLR 2025)☆376Updated 6 months ago
- [ECCV 2024] Improving 2D Feature Representations by 3D-Aware Fine-Tuning☆305Updated 3 months ago
- Source code of paper "NVS-Solver: Video Diffusion Model as Zero-Shot Novel View Synthesizer"☆308Updated 8 months ago
- [T-PAMI 2025] V3D: Video Diffusion Models are Effective 3D Generators☆508Updated last year
- ViewDiff generates high-quality, multi-view consistent images of a real-world 3D object in authentic surroundings. (CVPR2024).☆377Updated 8 months ago
- [ACCV 2024 (Oral)] Official Implementation of "Moving Object Segmentation: All You Need Is SAM (and Flow)" Junyu Xie, Charig Yang, Weidi …☆323Updated 11 months ago
- [CVPR 2025 Highlight] VideoScene: Distilling Video Diffusion Model to Generate 3D Scenes in One Step☆328Updated 5 months ago
- [CVPR 2025] RollingDepth: Video Depth without Video Models☆590Updated 8 months ago
- Code for "Director3D: Real-world Camera Trajectory and 3D Scene Generation from Text" (NeurIPS 2024).☆362Updated 8 months ago
- [NeurIPS 2024] Diffusion4D: Fast Spatial-temporal Consistent 4D Generation via Video Diffusion Models☆328Updated 10 months ago
- [arXiv 2023] DreamGaussian4D: Generative 4D Gaussian Splatting☆589Updated last year
- 🍳 [CVPR'24 Highlight] Pytorch implementation of "Taming Stable Diffusion for Text to 360° Panorama Image Generation"☆249Updated last year
- [ECCV'24] GeoWizard: Unleashing the Diffusion Priors for 3D Geometry Estimation from a Single Image☆927Updated 11 months ago
- Official implementation of L-MAGIC☆133Updated 3 months ago
- [ICCV 2025] VistaDream: Sampling multiview consistent images for single-view scene reconstruction☆511Updated 5 months ago
- [ICCV 2025] GeometryCrafter: Consistent Geometry Estimation for Open-world Videos with Diffusion Priors☆413Updated 2 months ago
- [ICLR 2025] 3DitScene: Editing Any Scene via Language-guided Disentangled Gaussian Splatting☆255Updated last year
- [ICCV 2025 Highlight] Geo4D: Leveraging Video Generators for Geometric 4D Scene Reconstruction☆386Updated 6 months ago
- [TPAMI 2025, NeurIPS 2024] Video4DGen: Enhancing Video and 4D Generation through Mutual Optimization☆366Updated 10 months ago
- AllTracker is a model for tracking all pixels in a video.☆372Updated 3 months ago