aim-uofa / GenPerceptLinks
[ICLR2025] GenPercept: Diffusion Models Trained with Large Data Are Transferable Visual Models
☆207Updated 8 months ago
Alternatives and similar repositories for GenPercept
Users that are interested in GenPercept are comparing it to the libraries listed below
Sorting:
- Source code of paper "NVS-Solver: Video Diffusion Model as Zero-Shot Novel View Synthesizer"☆305Updated 6 months ago
- official repo of paper for "CamI2V: Camera-Controlled Image-to-Video Diffusion Model"☆148Updated this week
- "Diffusion4D: Fast Spatial-temporal Consistent 4D Generation via Video Diffusion Models", Hanwen Liang*, Yuyang Yin*, Dejia Xu, Hanxue Li…☆320Updated 8 months ago
- Generative Camera Dolly: Extreme Monocular Dynamic Novel View Synthesis (ECCV 2024 Oral) - Official Implementation☆269Updated 3 weeks ago
- ☆75Updated 4 months ago
- ChronoDepth: Learning Temporally Consistent Video Depth from Video Diffusion Priors☆271Updated 7 months ago
- This is a simple template using HuggingFace Accelerator for DDP-training/Saving/Loading/Pushing.☆49Updated last year
- [ICLR 2025] 3DitScene: Editing Any Scene via Language-guided Disentangled Gaussian Splatting☆249Updated 10 months ago
- Official repository for "SAR3D: Autoregressive 3D Object Generation and Understanding via Multi-scale 3D VQVAE"☆175Updated 3 months ago
- [ICCV 2025] Free4D: Tuning-free 4D Scene Generation with Spatial-Temporal Consistency☆215Updated 5 months ago
- [NeurIPS 2024] MVInpainter: Learning Multi-View Consistent Inpainting to Bridge 2D and 3D Editing☆127Updated 10 months ago
- [NeurIPS 2024] Official code for "Splatter a Video: Video Gaussian Representation for Versatile Processing"☆151Updated 8 months ago
- [NeurIPS 2024] DreamScene4D: Dynamic Multi-Object Scene Generation from Monocular Videos☆229Updated last year
- AC3D: Analyzing and Improving 3D Camera Control in Video Diffusion Transformers☆134Updated 2 weeks ago
- These scripts are used to download RealEstate10K dataset.☆93Updated last year
- Code for the paper "pix2gestalt: Amodal Segmentation by Synthesizing Wholes" (CVPR 2024)☆181Updated 3 months ago
- Seeing World Dynamics in a Nutshell☆109Updated 6 months ago
- [CVPR2025] Prometheus: 3D-Aware Latent Diffusion Models for Feed-Forward Text-to-3D Scene Generation☆127Updated 3 months ago
- [CVPR2025] MVGenMaster: Scaling Multi-View Generation from Any Image via 3D Priors Enhanced Diffusion Model☆116Updated 4 months ago
- Generative Omnimatte (CVPR 2025)☆139Updated 4 months ago
- Gaussian splatting implementation of Instruct-NeRF2NeRF: Editing 3D Scenes with Instructions☆109Updated last year
- ☆38Updated last year
- Baking Gaussian Splatting into Diffusion Denoiser for Fast and Scalable Single-stage Image-to-3D Generation (ICCV 2025)☆160Updated last month
- [ECCV 2024] Efficient Large-Baseline Radiance Fields, a feed-forward 2DGS model☆315Updated last year
- BrightDreamer: Generic 3D Gaussian Generative Framework for Fast Text-to-3D Synthesis☆89Updated 10 months ago
- Flash3D: Feed-Forward Generalisable 3D Scene Reconstruction from a Single Image☆265Updated 4 months ago
- Source code for paper GS-DiT: Advancing Video Generation with Pseudo 4D Gaussian Fields through Efficient Dense 3D Point Tracking☆56Updated 8 months ago
- [CVPR 2024] Exploiting Diffusion Prior for Generalizable Dense Prediction☆79Updated last year
- [ECCV 2024] Implementation of latentSplat: Autoencoding Variational Gaussians for Fast Generalizable 3D Reconstruction☆226Updated 6 months ago
- 🎞️ [NeurIPS'24] MVSplat360: Feed-Forward 360 Scene Synthesis from Sparse Views☆280Updated 10 months ago