[ICLR2025] GenPercept: Diffusion Models Trained with Large Data Are Transferable Visual Models
β221Jan 24, 2025Updated last year
Alternatives and similar repositories for GenPercept
Users that are interested in GenPercept are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ECCV'24] GeoWizard: Unleashing the Diffusion Priors for 3D Geometry Estimation from a Single Imageβ935Dec 7, 2024Updated last year
- [ICCV2023] π§FrozenRecon: Pose-free 3D Scene Reconstruction with Frozen Depth Modelsβ131Aug 23, 2024Updated last year
- [ICML2026] ACTIVE-O3: Empowering Multimodal Large Language Models with Active Perception via GRPOβ80Apr 30, 2026Updated 3 weeks ago
- A toolbox for benchmarking SOTA discriminative and generative geometry estimation models.β65Aug 29, 2024Updated last year
- [NeurIPS'24] Unleashing the Potential of the Diffusion Model in Few-shot Semantic Segmentation (Diffews)β51Apr 14, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [CVPR 2024 - Oral, Best Paper Award Candidate] Marigold: Repurposing Diffusion-Based Image Generators for Monocular Depth Estimationβ3,137Dec 10, 2025Updated 5 months ago
- [AAAI 2025, Oral] DepthFM: Fast Monocular Depth Estimation with Flow Matchingβ748May 6, 2025Updated last year
- [CVPR 2024] Exploiting Diffusion Prior for Generalizable Dense Predictionβ80Apr 19, 2024Updated 2 years ago
- [NeurIPS 2025 Spotlight] A Generalist Diffusion Model for Vision Perceptionβ311Sep 21, 2025Updated 8 months ago
- Official implementation of "Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction"β802Nov 28, 2025Updated 5 months ago
- ChronoDepth: Learning Temporally Consistent Video Depth from Video Diffusion Priorsβ277Feb 27, 2025Updated last year
- [SIGGRAPH Asia 2024 (Journal Track)] StableNormal: Reducing Diffusion Variance for Stable and Sharp Normalβ773Aug 2, 2025Updated 9 months ago
- Depth Any Video with Scalable Synthetic Data (ICLR 2025)β516Dec 4, 2024Updated last year
- β70Oct 19, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- β279May 31, 2024Updated last year
- [CVPR 2024 Oral] Rethinking Inductive Biases for Surface Normal Estimationβ905Jul 10, 2024Updated last year
- The repo for "Metric3D: Towards Zero-shot Metric 3D Prediction from A Single Image" and "Metric3Dv2: A Versatile Monocular Geometric Founβ¦β2,193Mar 13, 2025Updated last year
- [CVPR'25 Oral] MoGe: Unlocking Accurate Monocular Geometry Estimation for Open-Domain Images with Optimal Training Supervisionβ2,470Nov 2, 2025Updated 6 months ago
- This is a simple template using HuggingFace Accelerator for DDP-training/Saving/Loading/Pushing.β49Mar 19, 2024Updated 2 years ago
- β131Feb 7, 2024Updated 2 years ago
- Official code for NeurIPS 2024 paper LRM-Zero: Training Large Reconstruction Models with Synthesized Dataβ154Oct 7, 2024Updated last year
- [WACV'25 Oral] Fine-Tuning Image-Conditional Diffusion Models is Easier than You Thinkβ512Jan 26, 2026Updated 3 months ago
- β716May 1, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI β’ AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Intrinsic Image Diffusion for Single-view Material Estimationβ227Nov 28, 2025Updated 5 months ago
- [CVPR 2024] Probing the 3D Awareness of Visual Foundation Modelsβ348Dec 1, 2025Updated 5 months ago
- [ECCV 2024] Efficient Large-Baseline Radiance Fields, a feed-forward 2DGS modelβ314Jul 13, 2024Updated last year
- [ECCV 2024] Leveraging Synthetic Data for Real-Domain High-Resolution Monocular Metric Depth Estimationβ67Jan 5, 2025Updated last year
- [CVPR'2024] Official implementation of the paper "ECoDepth: Effective Conditioning of Diffusion Models for Monocular Depth Estimation"β207Nov 20, 2025Updated 6 months ago
- Repo of HawkLlama.β16Jan 2, 2025Updated last year
- DELTA: Dense Efficient Long-range 3D Tracking for Any video (ICLR 2025)β146Apr 6, 2025Updated last year
- [ECCV 2024] Diffusion Models for Monocular Depth Estimation: Overcoming Challenging Conditionsβ94Sep 28, 2024Updated last year
- [CVPR 2024] Official implementation of "SuperNormal: Neural Surface Reconstruction via Multi-View Normal Integration"β194Mar 31, 2025Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits β’ AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- [ICCV 2025, Oral] TrajectoryCrafter: Redirecting Camera Trajectory for Monocular Videos via Diffusion Modelsβ855Dec 17, 2025Updated 5 months ago
- [ICCV 2025] Amodal Depth Anything: Amodal Depth Estimation in the Wildβ42Feb 21, 2026Updated 3 months ago
- [ICLR 2025 Spotlight] Boltzmann-Aligned Inverse Folding Model as a Predictor of Mutational Effects on Protein-Protein Interactionsβ43Mar 10, 2025Updated last year
- Instant-angelo: Build high-fidelity Digital Twin within 20 Minutes!β462Oct 26, 2024Updated last year
- [ICCV 2025 & ICCV 2025 RIWM Outstanding Paper] Aether: Geometric-Aware Unified World Modelingβ595Oct 26, 2025Updated 6 months ago
- [CVPR 2025] Prompt Depth Anythingβ1,108Jan 29, 2026Updated 3 months ago
- Boosting Multi-view Stereo with Late Cost Aggregationβ13Jan 24, 2024Updated 2 years ago