[ICLR2025] GenPercept: Diffusion Models Trained with Large Data Are Transferable Visual Models
β219Jan 24, 2025Updated last year
Alternatives and similar repositories for GenPercept
Users that are interested in GenPercept are comparing it to the libraries listed below
Sorting:
- [ECCV'24] GeoWizard: Unleashing the Diffusion Priors for 3D Geometry Estimation from a Single Imageβ931Dec 7, 2024Updated last year
- [ICCV2023] π§FrozenRecon: Pose-free 3D Scene Reconstruction with Frozen Depth Modelsβ131Aug 23, 2024Updated last year
- [AAAI 2025, Oral] DepthFM: Fast Monocular Depth Estimation with Flow Matchingβ739May 6, 2025Updated 9 months ago
- [CVPR 2024 - Oral, Best Paper Award Candidate] Marigold: Repurposing Diffusion-Based Image Generators for Monocular Depth Estimationβ3,086Dec 10, 2025Updated 2 months ago
- A toolbox for benchmarking SOTA discriminative and generative geometry estimation models.β65Aug 29, 2024Updated last year
- Official implementation of Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Predictionβ783Nov 28, 2025Updated 3 months ago
- ACTIVE-O3: Empowering Multimodal Large Language Models with Active Perception via GRPOβ78Nov 17, 2025Updated 3 months ago
- [SIGGRAPH Asia 2024 (Journal Track)] StableNormal: Reducing Diffusion Variance for Stable and Sharp Normalβ753Aug 2, 2025Updated 7 months ago
- [NeurIPS'24] Unleashing the Potential of the Diffusion Model in Few-shot Semantic Segmentation (Diffews)β48Apr 14, 2025Updated 10 months ago
- ChronoDepth: Learning Temporally Consistent Video Depth from Video Diffusion Priorsβ277Feb 27, 2025Updated last year
- This is a simple template using HuggingFace Accelerator for DDP-training/Saving/Loading/Pushing.β49Mar 19, 2024Updated last year
- β273May 31, 2024Updated last year
- Official code for NeurIPS 2024 paper LRM-Zero: Training Large Reconstruction Models with Synthesized Dataβ153Oct 7, 2024Updated last year
- β131Feb 7, 2024Updated 2 years ago
- [NeurIPS 2025 Spotlight] A Generalist Diffusion Model for Vision Perceptionβ295Sep 21, 2025Updated 5 months ago
- [CVPR 2024 Oral] Rethinking Inductive Biases for Surface Normal Estimationβ893Jul 10, 2024Updated last year
- [CVPR 2024] Exploiting Diffusion Prior for Generalizable Dense Predictionβ80Apr 19, 2024Updated last year
- Intrinsic Image Diffusion for Single-view Material Estimationβ226Nov 28, 2025Updated 3 months ago
- [ICCV 2025] Amodal Depth Anything: Amodal Depth Estimation in the Wildβ39Feb 21, 2026Updated last week
- [CVPR'25 Oral] MoGe: Unlocking Accurate Monocular Geometry Estimation for Open-Domain Images with Optimal Training Supervisionβ2,312Nov 2, 2025Updated 3 months ago
- [CVPR 2024] Probing the 3D Awareness of Visual Foundation Modelsβ349Dec 1, 2025Updated 3 months ago
- β703May 1, 2025Updated 10 months ago
- The repo for "Metric3D: Towards Zero-shot Metric 3D Prediction from A Single Image" and "Metric3Dv2: A Versatile Monocular Geometric Founβ¦β2,112Mar 13, 2025Updated 11 months ago
- [ECCV 2024] Efficient Large-Baseline Radiance Fields, a feed-forward 2DGS modelβ314Jul 13, 2024Updated last year
- Depth Any Video with Scalable Synthetic Data (ICLR 2025)β510Dec 4, 2024Updated last year
- DELTA: Dense Efficient Long-range 3D Tracking for Any video (ICLR 2025)β138Apr 6, 2025Updated 10 months ago
- β70Oct 19, 2023Updated 2 years ago
- [WACV'25 Oral] Fine-Tuning Image-Conditional Diffusion Models is Easier than You Thinkβ499Jan 26, 2026Updated last month
- [CVPR'25] DepthSplat: Connecting Gaussian Splatting and Depthβ1,153Apr 27, 2025Updated 10 months ago
- Instant-angelo: Build high-fidelity Digital Twin within 20 Minutes!β458Oct 26, 2024Updated last year
- [SIGGRAPH Asia'24 & TOG] Gaussian Opacity Fields: Efficient Adaptive Surface Reconstruction in Unbounded Scenesβ983Nov 15, 2024Updated last year
- A niche toolkit for 3D computer vision tasks.β319Feb 3, 2026Updated 3 weeks ago
- [ECCV 2024] Diffusion Models for Monocular Depth Estimation: Overcoming Challenging Conditionsβ92Sep 28, 2024Updated last year
- [ICCV 2025, Oral] TrajectoryCrafter: Redirecting Camera Trajectory for Monocular Videos via Diffusion Modelsβ839Dec 17, 2025Updated 2 months ago
- [SIGGRAPH Asia 2022] A Neural Galerkin Solver for Accurate Surface Reconstructionβ99Dec 21, 2022Updated 3 years ago
- Official implementation for "DMesh: A Differentiable Representation for General Meshes" (NeurIPS 2024)β319Oct 3, 2024Updated last year
- [ICCV 2025 & ICCV 2025 RIWM Outstanding Paper] Aether: Geometric-Aware Unified World Modelingβ573Oct 26, 2025Updated 4 months ago
- β476Jul 24, 2024Updated last year
- Code for ICCV'2025 "Real3D: Scaling Up Large Reconstruction Models with Real-World Images"β206Jul 28, 2025Updated 7 months ago