[ICLR2025] GenPercept: Diffusion Models Trained with Large Data Are Transferable Visual Models
β220Jan 24, 2025Updated last year
Alternatives and similar repositories for GenPercept
Users that are interested in GenPercept are comparing it to the libraries listed below
Sorting:
- [ECCV'24] GeoWizard: Unleashing the Diffusion Priors for 3D Geometry Estimation from a Single Imageβ934Dec 7, 2024Updated last year
- [ICCV2023] π§FrozenRecon: Pose-free 3D Scene Reconstruction with Frozen Depth Modelsβ131Aug 23, 2024Updated last year
- ACTIVE-O3: Empowering Multimodal Large Language Models with Active Perception via GRPOβ79Nov 17, 2025Updated 4 months ago
- A toolbox for benchmarking SOTA discriminative and generative geometry estimation models.β65Aug 29, 2024Updated last year
- [NeurIPS'24] Unleashing the Potential of the Diffusion Model in Few-shot Semantic Segmentation (Diffews)β49Apr 14, 2025Updated 11 months ago
- [CVPR 2024 - Oral, Best Paper Award Candidate] Marigold: Repurposing Diffusion-Based Image Generators for Monocular Depth Estimationβ3,102Dec 10, 2025Updated 3 months ago
- [AAAI 2025, Oral] DepthFM: Fast Monocular Depth Estimation with Flow Matchingβ739May 6, 2025Updated 10 months ago
- [CVPR 2024] Exploiting Diffusion Prior for Generalizable Dense Predictionβ80Apr 19, 2024Updated last year
- [NeurIPS 2025 Spotlight] A Generalist Diffusion Model for Vision Perceptionβ298Sep 21, 2025Updated 6 months ago
- Official implementation of Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Predictionβ788Nov 28, 2025Updated 3 months ago
- ChronoDepth: Learning Temporally Consistent Video Depth from Video Diffusion Priorsβ277Feb 27, 2025Updated last year
- [SIGGRAPH Asia 2024 (Journal Track)] StableNormal: Reducing Diffusion Variance for Stable and Sharp Normalβ757Aug 2, 2025Updated 7 months ago
- Depth Any Video with Scalable Synthetic Data (ICLR 2025)β510Dec 4, 2024Updated last year
- β70Oct 19, 2023Updated 2 years ago
- β274May 31, 2024Updated last year
- [CVPR 2024 Oral] Rethinking Inductive Biases for Surface Normal Estimationβ897Jul 10, 2024Updated last year
- The repo for "Metric3D: Towards Zero-shot Metric 3D Prediction from A Single Image" and "Metric3Dv2: A Versatile Monocular Geometric Founβ¦β2,130Mar 13, 2025Updated last year
- This is a simple template using HuggingFace Accelerator for DDP-training/Saving/Loading/Pushing.β49Mar 19, 2024Updated 2 years ago
- β131Feb 7, 2024Updated 2 years ago
- Official code for NeurIPS 2024 paper LRM-Zero: Training Large Reconstruction Models with Synthesized Dataβ154Oct 7, 2024Updated last year
- [CVPR'25 Oral] MoGe: Unlocking Accurate Monocular Geometry Estimation for Open-Domain Images with Optimal Training Supervisionβ2,367Nov 2, 2025Updated 4 months ago
- [WACV'25 Oral] Fine-Tuning Image-Conditional Diffusion Models is Easier than You Thinkβ504Jan 26, 2026Updated last month
- β709May 1, 2025Updated 10 months ago
- Intrinsic Image Diffusion for Single-view Material Estimationβ225Nov 28, 2025Updated 3 months ago
- [CVPR 2024] Probing the 3D Awareness of Visual Foundation Modelsβ348Dec 1, 2025Updated 3 months ago
- [ECCV 2024] Efficient Large-Baseline Radiance Fields, a feed-forward 2DGS modelβ314Jul 13, 2024Updated last year
- [ECCV 2024] Leveraging Synthetic Data for Real-Domain High-Resolution Monocular Metric Depth Estimationβ66Jan 5, 2025Updated last year
- [CVPR'2024] Official implementation of the paper "ECoDepth: Effective Conditioning of Diffusion Models for Monocular Depth Estimation"β206Nov 20, 2025Updated 4 months ago
- DELTA: Dense Efficient Long-range 3D Tracking for Any video (ICLR 2025)β138Apr 6, 2025Updated 11 months ago
- Repo of HawkLlama.β16Jan 2, 2025Updated last year
- [ECCV 2024] Diffusion Models for Monocular Depth Estimation: Overcoming Challenging Conditionsβ93Sep 28, 2024Updated last year
- [CVPR 2024] Official implementation of "SuperNormal: Neural Surface Reconstruction via Multi-View Normal Integration"β190Mar 31, 2025Updated 11 months ago
- [ICCV 2025, Oral] TrajectoryCrafter: Redirecting Camera Trajectory for Monocular Videos via Diffusion Modelsβ847Dec 17, 2025Updated 3 months ago
- [ICCV 2025] Amodal Depth Anything: Amodal Depth Estimation in the Wildβ40Feb 21, 2026Updated last month
- [ICCV 2025 & ICCV 2025 RIWM Outstanding Paper] Aether: Geometric-Aware Unified World Modelingβ580Oct 26, 2025Updated 4 months ago
- [ICLR 2025 Spotlight] Boltzmann-Aligned Inverse Folding Model as a Predictor of Mutational Effects on Protein-Protein Interactionsβ44Mar 10, 2025Updated last year
- Instant-angelo: Build high-fidelity Digital Twin within 20 Minutes!β459Oct 26, 2024Updated last year
- [CVPR 2025] Prompt Depth Anythingβ1,082Jan 29, 2026Updated last month
- Boosting Multi-view Stereo with Late Cost Aggregationβ13Jan 24, 2024Updated 2 years ago