[ICLR2025] GenPercept: Diffusion Models Trained with Large Data Are Transferable Visual Models
β226Jan 24, 2025Updated last year
Alternatives and similar repositories for GenPercept
Users that are interested in GenPercept are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ECCV'24] GeoWizard: Unleashing the Diffusion Priors for 3D Geometry Estimation from a Single Imageβ936Dec 7, 2024Updated last year
- [ICCV2023] π§FrozenRecon: Pose-free 3D Scene Reconstruction with Frozen Depth Modelsβ131Aug 23, 2024Updated last year
- [ICML2026] ACTIVE-O3: Empowering Multimodal Large Language Models with Active Perception via GRPOβ81Apr 30, 2026Updated last month
- A toolbox for benchmarking SOTA discriminative and generative geometry estimation models.β65Aug 29, 2024Updated last year
- [NeurIPS'24] Unleashing the Potential of the Diffusion Model in Few-shot Semantic Segmentation (Diffews)β52Apr 14, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [CVPR 2024 - Oral, Best Paper Award Candidate] Marigold: Repurposing Diffusion-Based Image Generators for Monocular Depth Estimationβ3,157Dec 10, 2025Updated 6 months ago
- [AAAI 2025, Oral] DepthFM: Fast Monocular Depth Estimation with Flow Matchingβ751May 6, 2025Updated last year
- [CVPR 2024] Exploiting Diffusion Prior for Generalizable Dense Predictionβ80Apr 19, 2024Updated 2 years ago
- [NeurIPS 2025 Spotlight] A Generalist Diffusion Model for Vision Perceptionβ315Sep 21, 2025Updated 8 months ago
- Official implementation of "Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction"β804Nov 28, 2025Updated 6 months ago
- ChronoDepth: Learning Temporally Consistent Video Depth from Video Diffusion Priorsβ278Feb 27, 2025Updated last year
- [SIGGRAPH Asia 2024 (Journal Track)] StableNormal: Reducing Diffusion Variance for Stable and Sharp Normalβ774Aug 2, 2025Updated 10 months ago
- Depth Any Video with Scalable Synthetic Data (ICLR 2025)β517Dec 4, 2024Updated last year
- β70Oct 19, 2023Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting β’ AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- β282May 31, 2024Updated 2 years ago
- [CVPR 2024 Oral] Rethinking Inductive Biases for Surface Normal Estimationβ911Jul 10, 2024Updated last year
- The repo for "Metric3D: Towards Zero-shot Metric 3D Prediction from A Single Image" and "Metric3Dv2: A Versatile Monocular Geometric Founβ¦β2,222Mar 13, 2025Updated last year
- This is a simple template using HuggingFace Accelerator for DDP-training/Saving/Loading/Pushing.β49Mar 19, 2024Updated 2 years ago
- [CVPR'25 Oral] MoGe: Unlocking Accurate Monocular Geometry Estimation for Open-Domain Images with Optimal Training Supervisionβ2,514Nov 2, 2025Updated 7 months ago
- β131Feb 7, 2024Updated 2 years ago
- Official code for NeurIPS 2024 paper LRM-Zero: Training Large Reconstruction Models with Synthesized Dataβ154Oct 7, 2024Updated last year
- [WACV'25 Oral] Fine-Tuning Image-Conditional Diffusion Models is Easier than You Thinkβ516Jan 26, 2026Updated 4 months ago
- β717May 1, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Intrinsic Image Diffusion for Single-view Material Estimationβ228Nov 28, 2025Updated 6 months ago
- [CVPR 2024] Probing the 3D Awareness of Visual Foundation Modelsβ350Dec 1, 2025Updated 6 months ago
- [ECCV 2024] Efficient Large-Baseline Radiance Fields, a feed-forward 2DGS modelβ316Jul 13, 2024Updated last year
- [ECCV 2024] Leveraging Synthetic Data for Real-Domain High-Resolution Monocular Metric Depth Estimationβ67Jan 5, 2025Updated last year
- [CVPR'2024] Official implementation of the paper "ECoDepth: Effective Conditioning of Diffusion Models for Monocular Depth Estimation"β212Nov 20, 2025Updated 6 months ago
- Repo of HawkLlama.β16Jan 2, 2025Updated last year
- DELTA: Dense Efficient Long-range 3D Tracking for Any video (ICLR 2025)β147Apr 6, 2025Updated last year
- [ECCV 2024] Diffusion Models for Monocular Depth Estimation: Overcoming Challenging Conditionsβ94Sep 28, 2024Updated last year
- [CVPR 2024] Official implementation of "SuperNormal: Neural Surface Reconstruction via Multi-View Normal Integration"β193Mar 31, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [ICCV 2025, Oral] TrajectoryCrafter: Redirecting Camera Trajectory for Monocular Videos via Diffusion Modelsβ854Dec 17, 2025Updated 5 months ago
- [ICCV 2025] Amodal Depth Anything: Amodal Depth Estimation in the Wildβ43Feb 21, 2026Updated 3 months ago
- [ICLR 2025 Spotlight] Boltzmann-Aligned Inverse Folding Model as a Predictor of Mutational Effects on Protein-Protein Interactionsβ44Mar 10, 2025Updated last year
- Instant-angelo: Build high-fidelity Digital Twin within 20 Minutes!β462Oct 26, 2024Updated last year
- [ICCV 2025 & ICCV 2025 RIWM Outstanding Paper] Aether: Geometric-Aware Unified World Modelingβ595Oct 26, 2025Updated 7 months ago
- Boosting Multi-view Stereo with Late Cost Aggregationβ13Jan 24, 2024Updated 2 years ago
- [CVPR 2025] Prompt Depth Anythingβ1,121Jan 29, 2026Updated 4 months ago