[WACV'25 Oral] Fine-Tuning Image-Conditional Diffusion Models is Easier than You Think
☆504Jan 26, 2026Updated last month
Alternatives and similar repositories for diffusion-e2e-ft
Users that are interested in diffusion-e2e-ft are comparing it to the libraries listed below
Sorting:
- Official implementation of Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction☆785Nov 28, 2025Updated 3 months ago
- [ECCV'24] GeoWizard: Unleashing the Diffusion Priors for 3D Geometry Estimation from a Single Image☆934Dec 7, 2024Updated last year
- [CVPR 2024 - Oral, Best Paper Award Candidate] Marigold: Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation☆3,102Dec 10, 2025Updated 3 months ago
- [CVPR 2025 Highlight] DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos☆1,523Nov 30, 2025Updated 3 months ago
- [SIGGRAPH Asia 2024 (Journal Track)] StableNormal: Reducing Diffusion Variance for Stable and Sharp Normal☆757Aug 2, 2025Updated 7 months ago
- [ICCV 2025] VistaDream: Sampling multiview consistent images for single-view scene reconstruction☆528Jul 2, 2025Updated 8 months ago
- Official Implementation of paper "MonST3R: A Simple Approach for Estimating Geometry in the Presence of Motion"☆1,344Jun 16, 2025Updated 9 months ago
- Depth Any Video with Scalable Synthetic Data (ICLR 2025)☆510Dec 4, 2024Updated last year
- Code for MegaSynth: Scaling Up 3D Scene Reconstruction with Synthesized Data (CVPR 2025)☆200May 20, 2025Updated 10 months ago
- [CVPR 2024 Oral] Rethinking Inductive Biases for Surface Normal Estimation☆895Jul 10, 2024Updated last year
- Code for the project "MegaSaM: Accurate, Fast and Robust Structure and Motion from Casual Dynamic Videos"☆1,254Jan 5, 2026Updated 2 months ago
- The repo for "Metric3D: Towards Zero-shot Metric 3D Prediction from A Single Image" and "Metric3Dv2: A Versatile Monocular Geometric Foun…☆2,130Mar 13, 2025Updated last year
- ChronoDepth: Learning Temporally Consistent Video Depth from Video Diffusion Priors☆277Feb 27, 2025Updated last year
- [ICCV 2025] Zero-Shot Monocular Depth Completion with Guided Diffusion☆240Oct 31, 2025Updated 4 months ago
- [CVPR 2025 Highlight] Align3R: Aligned Monocular Depth Estimation for Dynamic Videos☆456Apr 4, 2025Updated 11 months ago
- [TIP 2026] ReconX: Reconstruct Any Scene from Sparse Views with Video Diffusion Model☆709Nov 9, 2024Updated last year
- [TPAMI 2025] ViewCrafter: Taming Video Diffusion Models for High-fidelity Novel View Synthesis☆1,525Dec 13, 2025Updated 3 months ago
- [SIGGRAPH 2025] Diffusion as Shader: 3D-aware Video Diffusion for Versatile Video Generation Control☆812Jun 9, 2025Updated 9 months ago
- PrimeDepth: Efficient Monocular Depth Estimation with a Stable Diffusion Preimage☆50Mar 12, 2026Updated last week
- [ICCV 2025] A simple training-free approach adapting DUSt3R for dynamic scenes.☆515Apr 1, 2025Updated 11 months ago
- Generative Camera Dolly: Extreme Monocular Dynamic Novel View Synthesis (ECCV 2024 Oral) - Official Implementation☆283Nov 18, 2025Updated 4 months ago
- StableRecon: Making Video to 3D easy☆76Oct 19, 2024Updated last year
- Stable Virtual Camera: Generative View Synthesis with Diffusion Models☆1,574Mar 3, 2026Updated 2 weeks ago
- [SIGGRAPH'24] Implementations for "High-quality Surface Reconstruction using Gaussian Surfels".☆673Jun 17, 2025Updated 9 months ago
- [ICLR 2025 Oral] Official code for "LVSM: A Large View Synthesis Model with Minimal 3D Inductive Bias"☆508Aug 4, 2025Updated 7 months ago
- Universal Monocular Metric Depth Estimation☆1,153May 18, 2025Updated 10 months ago
- [ECCV 2024 Oral] LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation.☆2,049Aug 20, 2024Updated last year
- Code for SPAD : Spatially Aware Multiview Diffusers, CVPR 2024☆178Feb 8, 2025Updated last year
- [CVPR 2025] RollingDepth: Video Depth without Video Models☆605Mar 18, 2025Updated last year
- DUSt3R: Geometric 3D Vision Made Easy☆7,011Sep 24, 2025Updated 5 months ago
- ☆709May 1, 2025Updated 10 months ago
- [CVPR2024 Oral] EscherNet: A Generative Model for Scalable View Synthesis☆367Sep 10, 2024Updated last year
- ☆1,242Aug 2, 2025Updated 7 months ago
- [NeurIPS 2024] Official code for "Neural Gaffer: Relighting Any Object via Diffusion"☆340Jun 9, 2025Updated 9 months ago
- [ICLR2025] GenPercept: Diffusion Models Trained with Large Data Are Transferable Visual Models☆220Jan 24, 2025Updated last year
- [ICLR'25 Oral] Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think☆1,581Mar 16, 2025Updated last year
- An open-source impl. of Large Reconstruction Models☆1,210May 6, 2024Updated last year
- ☆66Nov 27, 2024Updated last year
- ☆306Sep 26, 2024Updated last year