[ICLR2025] GenPercept: Diffusion Models Trained with Large Data Are Transferable Visual Models
β228Jan 24, 2025Updated last year
Alternatives and similar repositories for GenPercept
Users that are interested in GenPercept are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ECCV'24] GeoWizard: Unleashing the Diffusion Priors for 3D Geometry Estimation from a Single Imageβ936Dec 7, 2024Updated last year
- [ICCV2023] π§FrozenRecon: Pose-free 3D Scene Reconstruction with Frozen Depth Modelsβ131Aug 23, 2024Updated last year
- [ICML2026] ACTIVE-O3: Empowering Multimodal Large Language Models with Active Perception via GRPOβ81Apr 30, 2026Updated 2 months ago
- A toolbox for benchmarking SOTA discriminative and generative geometry estimation models.β65Aug 29, 2024Updated last year
- [NeurIPS'24] Unleashing the Potential of the Diffusion Model in Few-shot Semantic Segmentation (Diffews)β52Apr 14, 2025Updated last year
- End-to-end encrypted cloud storage - Proton Drive β’ AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- [CVPR 2024 - Oral, Best Paper Award Candidate] Marigold: Repurposing Diffusion-Based Image Generators for Monocular Depth Estimationβ3,171Dec 10, 2025Updated 6 months ago
- [AAAI 2025, Oral] DepthFM: Fast Monocular Depth Estimation with Flow Matchingβ755May 6, 2025Updated last year
- [CVPR 2024] Exploiting Diffusion Prior for Generalizable Dense Predictionβ81Apr 19, 2024Updated 2 years ago
- [NeurIPS 2025 Spotlight] A Generalist Diffusion Model for Vision Perceptionβ316Sep 21, 2025Updated 9 months ago
- Official implementation of "Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction"β807Nov 28, 2025Updated 7 months ago
- ChronoDepth: Learning Temporally Consistent Video Depth from Video Diffusion Priorsβ279Feb 27, 2025Updated last year
- [SIGGRAPH Asia 2024 (Journal Track)] StableNormal: Reducing Diffusion Variance for Stable and Sharp Normalβ777Aug 2, 2025Updated 10 months ago
- Depth Any Video with Scalable Synthetic Data (ICLR 2025)β518Dec 4, 2024Updated last year
- β70Oct 19, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- β285May 31, 2024Updated 2 years ago
- [CVPR 2024 Oral] Rethinking Inductive Biases for Surface Normal Estimationβ917Jul 10, 2024Updated last year
- The repo for "Metric3D: Towards Zero-shot Metric 3D Prediction from A Single Image" and "Metric3Dv2: A Versatile Monocular Geometric Founβ¦β2,241Mar 13, 2025Updated last year
- This is a simple template using HuggingFace Accelerator for DDP-training/Saving/Loading/Pushing.β49Mar 19, 2024Updated 2 years ago
- [CVPR'25 Oral] MoGe: Unlocking Accurate Monocular Geometry Estimation for Open-Domain Images with Optimal Training Supervisionβ2,588Nov 2, 2025Updated 7 months ago
- β132Feb 7, 2024Updated 2 years ago
- Official code for NeurIPS 2024 paper LRM-Zero: Training Large Reconstruction Models with Synthesized Dataβ154Oct 7, 2024Updated last year
- [WACV'25 Oral] Fine-Tuning Image-Conditional Diffusion Models is Easier than You Thinkβ517Jan 26, 2026Updated 5 months ago
- β720May 1, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean β’ AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Intrinsic Image Diffusion for Single-view Material Estimationβ232Nov 28, 2025Updated 7 months ago
- [CVPR 2024] Probing the 3D Awareness of Visual Foundation Modelsβ350Dec 1, 2025Updated 7 months ago
- [ECCV 2024] Efficient Large-Baseline Radiance Fields, a feed-forward 2DGS modelβ316Jul 13, 2024Updated last year
- [ECCV 2024] Leveraging Synthetic Data for Real-Domain High-Resolution Monocular Metric Depth Estimationβ68Jan 5, 2025Updated last year
- [CVPR'2024] Official implementation of the paper "ECoDepth: Effective Conditioning of Diffusion Models for Monocular Depth Estimation"β223Nov 20, 2025Updated 7 months ago
- Repo of HawkLlama.β16Jan 2, 2025Updated last year
- DELTA: Dense Efficient Long-range 3D Tracking for Any video (ICLR 2025)β149Apr 6, 2025Updated last year
- [ECCV 2024] Diffusion Models for Monocular Depth Estimation: Overcoming Challenging Conditionsβ94Sep 28, 2024Updated last year
- [CVPR 2024] Official implementation of "SuperNormal: Neural Surface Reconstruction via Multi-View Normal Integration"β194Mar 31, 2025Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits β’ AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- [ICCV 2025, Oral] TrajectoryCrafter: Redirecting Camera Trajectory for Monocular Videos via Diffusion Modelsβ856Dec 17, 2025Updated 6 months ago
- [ICCV 2025] Amodal Depth Anything: Amodal Depth Estimation in the Wildβ43Feb 21, 2026Updated 4 months ago
- [ICLR 2025 Spotlight] Boltzmann-Aligned Inverse Folding Model as a Predictor of Mutational Effects on Protein-Protein Interactionsβ45Mar 10, 2025Updated last year
- Instant-angelo: Build high-fidelity Digital Twin within 20 Minutes!β462Oct 26, 2024Updated last year
- [ICCV 2025 & ICCV 2025 RIWM Outstanding Paper] Aether: Geometric-Aware Unified World Modelingβ598Oct 26, 2025Updated 8 months ago
- Boosting Multi-view Stereo with Late Cost Aggregationβ13Jan 24, 2024Updated 2 years ago
- [CVPR 2025] Prompt Depth Anythingβ1,131Jan 29, 2026Updated 5 months ago