VisualComputingInstitute / diffusion-e2e-ftLinks
[WACV'25 Oral] Fine-Tuning Image-Conditional Diffusion Models is Easier than You Think
☆494Updated last year
Alternatives and similar repositories for diffusion-e2e-ft
Users that are interested in diffusion-e2e-ft are comparing it to the libraries listed below
Sorting:
- Official implementation of Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction☆768Updated 3 weeks ago
- [ICLR2025] GenPercept: Diffusion Models Trained with Large Data Are Transferable Visual Models☆215Updated 11 months ago
- ChronoDepth: Learning Temporally Consistent Video Depth from Video Diffusion Priors☆275Updated 10 months ago
- Depth Any Video with Scalable Synthetic Data (ICLR 2025)☆509Updated last year
- [CVPR2024 Oral] EscherNet: A Generative Model for Scalable View Synthesis☆361Updated last year
- [CVPR 2025] Code for Segment Any Motion in Videos☆448Updated 6 months ago
- [MM24] Official codes and datasets for ACM MM24 paper "Hi3D: Pursuing High-Resolution Image-to-3D Generation with Video Diffusion Models"…☆281Updated last year
- [T-PAMI 2025] V3D: Video Diffusion Models are Effective 3D Generators☆512Updated last year
- Source code of paper "NVS-Solver: Video Diffusion Model as Zero-Shot Novel View Synthesizer"☆308Updated 8 months ago
- [ECCV 2024] Improving 2D Feature Representations by 3D-Aware Fine-Tuning☆306Updated this week
- ☆303Updated last year
- Generative Camera Dolly: Extreme Monocular Dynamic Novel View Synthesis (ECCV 2024 Oral) - Official Implementation☆278Updated last month
- Orient Anything, ICML 2025☆359Updated 2 months ago
- High-quality and editable surfel 3D Gaussian generation through native 3D diffusion (ICLR 2025)☆380Updated 7 months ago
- ViewDiff generates high-quality, multi-view consistent images of a real-world 3D object in authentic surroundings. (CVPR2024).☆377Updated 8 months ago
- [CVPR 2025] RollingDepth: Video Depth without Video Models☆594Updated 9 months ago
- Code for "Director3D: Real-world Camera Trajectory and 3D Scene Generation from Text" (NeurIPS 2024).☆368Updated 9 months ago
- [ACCV 2024 (Oral)] Official Implementation of "Moving Object Segmentation: All You Need Is SAM (and Flow)" Junyu Xie, Charig Yang, Weidi …☆326Updated last year
- [NeurIPS 2024] HDR 3D Scene Editing!☆231Updated last month
- [ECCV'24] GeoWizard: Unleashing the Diffusion Priors for 3D Geometry Estimation from a Single Image☆930Updated last year
- [ICCV 2025] VistaDream: Sampling multiview consistent images for single-view scene reconstruction☆518Updated 5 months ago
- Official code for the paper: Depth Anything At Any Condition☆312Updated 4 months ago
- [ICCV 2025] GeometryCrafter: Consistent Geometry Estimation for Open-world Videos with Diffusion Priors☆417Updated 2 months ago
- [CVPR 2025 Highlight] VideoScene: Distilling Video Diffusion Model to Generate 3D Scenes in One Step☆328Updated 5 months ago
- [AAAI 2025] Elevating Flow-Guided Video Inpainting with Reference Generation☆88Updated 6 months ago
- 🍳 [CVPR'24 Highlight] Pytorch implementation of "Taming Stable Diffusion for Text to 360° Panorama Image Generation"☆250Updated 3 weeks ago
- [ICCV 2025 Highlight] Geo4D: Leveraging Video Generators for Geometric 4D Scene Reconstruction☆392Updated 6 months ago
- Flash3D: Feed-Forward Generalisable 3D Scene Reconstruction from a Single Image☆291Updated 6 months ago
- [NeurIPS 2024] Diffusion4D: Fast Spatial-temporal Consistent 4D Generation via Video Diffusion Models☆331Updated 11 months ago
- AllTracker is a model for tracking all pixels in a video.☆378Updated 3 months ago