aim-uofa / GenPercept
[ICLR2025] GenPercept: Diffusion Models Trained with Large Data Are Transferable Visual Models
☆180Updated 2 months ago
Alternatives and similar repositories for GenPercept:
Users that are interested in GenPercept are comparing it to the libraries listed below
- official repo of paper for "CamI2V: Camera-Controlled Image-to-Video Diffusion Model"☆118Updated 3 weeks ago
- Official repository for "SAR3D: Autoregressive 3D Object Generation and Understanding via Multi-scale 3D VQVAE"☆135Updated 2 weeks ago
- Source code of paper "NVS-Solver: Video Diffusion Model as Zero-Shot Novel View Synthesizer"☆289Updated 2 weeks ago
- [NeurIPS 2024] MVInpainter: Learning Multi-View Consistent Inpainting to Bridge 2D and 3D Editing☆109Updated 5 months ago
- ChronoDepth: Learning Temporally Consistent Video Depth from Video Diffusion Priors☆228Updated last month
- Aether: Geometric-Aware Unified World Modeling☆253Updated 2 weeks ago
- [NeurIPS 2024] Official code for "Splatter a Video: Video Gaussian Representation for Versatile Processing"☆129Updated 3 months ago
- Free4D: Tuning-free 4D Scene Generation with Spatial-Temporal Consistency☆148Updated last week
- Seeing World Dynamics in a Nutshell☆102Updated last month
- [ICLR 2025] 3DitScene: Editing Any Scene via Language-guided Disentangled Gaussian Splatting☆209Updated 4 months ago
- [NeurIPS 2024] DreamScene4D: Dynamic Multi-Object Scene Generation from Monocular Videos☆224Updated 6 months ago
- [NeurIPS 2024] Geometry-Aware Large Reconstruction Model for Efficient and High-Quality 3D Generation☆162Updated 6 months ago
- Generative Camera Dolly: Extreme Monocular Dynamic Novel View Synthesis (ECCV 2024 Oral) - Official Implementation☆241Updated 5 months ago
- "Diffusion4D: Fast Spatial-temporal Consistent 4D Generation via Video Diffusion Models", Hanwen Liang*, Yuyang Yin*, Dejia Xu, Hanxue Li…☆288Updated 2 months ago
- This is a simple template using HuggingFace Accelerator for DDP-training/Saving/Loading/Pushing.☆49Updated last year
- [ECCV 2024] Efficient Large-Baseline Radiance Fields, a feed-forward 2DGS model☆309Updated 9 months ago
- Gaussian splatting implementation of Instruct-NeRF2NeRF: Editing 3D Scenes with Instructions☆92Updated last year
- CustomDiffusion360: Customizing Text-to-Image Diffusion with Camera Viewpoint Control☆166Updated 4 months ago
- Official Implementation for STAG4D: Spatial-Temporal Anchored Generative 4D Gaussians☆186Updated 8 months ago
- ☆58Updated 2 months ago
- ☆104Updated 2 weeks ago
- AC3D: Analyzing and Improving 3D Camera Control in Video Diffusion Transformers☆91Updated 3 weeks ago
- [CVPR 2025 Highlight] VideoScene: Distilling Video Diffusion Model to Generate 3D Scenes in One Step☆203Updated last week
- ☆30Updated last year
- [CVPR 2024] EpiDiff: Enhancing Multi-View Synthesis via Localized Epipolar-Constrained Diffusion☆122Updated 7 months ago
- Baking Gaussian Splatting into Diffusion Denoiser for Fast and Scalable Single-stage Image-to-3D Generation☆142Updated last month
- Code for SPAD : Spatially Aware Multiview Diffusers, CVPR 2024☆165Updated 2 months ago
- Source code for paper GS-DiT: Advancing Video Generation with Pseudo 4D Gaussian Fields through Efficient Dense 3D Point Tracking☆44Updated 3 months ago
- [ArXiv 2025] Geo4D: Leveraging Video Generators for Geometric 4D Scene Reconstruction☆199Updated this week
- Prometheus: 3D-Aware Latent Diffusion Models for Feed-Forward Text-to-3D Scene Generation☆89Updated 3 months ago