aim-uofa / GenPercept
GenPercept: Diffusion Models Trained with Large Data Are Transferable Visual Models
☆127Updated 3 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for GenPercept
- Source code of paper "NVS-Solver: Video Diffusion Model as Zero-Shot Novel View Synthesizer"☆255Updated 3 months ago
- "Comp4D: Compositional 4D Scene Generation", Dejia Xu*, Hanwen Liang*, Neel P. Bhatt, Hezhen Hu, Hanxue Liang, Konstantinos N. Platanioti…☆73Updated 2 months ago
- [NeurIPS 2024] Geometry-Aware Large Reconstruction Model for Efficient and High-Quality 3D Generation☆135Updated last month
- This is a simple template using HuggingFace Accelerator for DDP-training/Saving/Loading/Pushing.☆48Updated 8 months ago
- 🎞️ [NeurIPS'24] MVSplat360: Feed-Forward 360 Scene Synthesis from Sparse Views☆112Updated this week
- Gaussian splatting implementation of Instruct-NeRF2NeRF: Editing 3D Scenes with Instructions☆75Updated 7 months ago
- [ECCV 2024] Efficient Large-Baseline Radiance Fields, a feed-forward 2DGS model☆277Updated 4 months ago
- [NeurIPS 2024] DreamScene4D: Dynamic Multi-Object Scene Generation from Monocular Videos☆184Updated last month
- [CVPR 2024] MVD-Fusion: Single-view 3D via Depth-consistent Multi-view Generation☆110Updated 6 months ago
- [CVPR'24] Consistent Novel View Synthesis without 3D Representation☆137Updated 2 months ago
- ☆123Updated 3 weeks ago
- Code for "Real3D: Scaling Up Large Reconstruction Models with Real-World Images"☆145Updated 5 months ago
- [ECCV 2024] DGE: Direct Gaussian 3D Editing by Consistent Multi-view Editing☆86Updated 3 months ago
- Generative Camera Dolly: Extreme Monocular Dynamic Novel View Synthesis (ECCV 2024 Oral) - Official Implementation☆183Updated 3 weeks ago
- Code for SPAD : Spatially Aware Multiview Diffusers, CVPR 2024☆144Updated last month
- About Official code for TIP-Editor: An Accurate 3D Editor Following Both Text-Prompts And Image-Prompts (Siggraph 2024 & TOG)☆96Updated 4 months ago
- Unofficial Implementation of "Stable Video Diffusion Multi-View"☆73Updated 7 months ago
- [NeurIPS 2024] MVInpainter: Learning Multi-View Consistent Inpainting to Bridge 2D and 3D Editing☆76Updated last week
- ☆93Updated 3 weeks ago
- [ECCV 2024] Implementation of latentSplat: Autoencoding Variational Gaussians for Fast Generalizable 3D Reconstruction☆164Updated 4 months ago
- [CVPR'24] GraphDreamer: a novel framework of generating compositional 3D scenes from scene graphs.☆166Updated 8 months ago
- 3DitScene: Editing Any Scene via Language-guided Disentangled Gaussian Splatting☆180Updated 5 months ago
- GaussianFlow: Splatting Gaussian Dynamics for 4D Content Creation☆157Updated 6 months ago
- [ECCV 2024] GaussCtrl: Multi-View Consistent Text-Driven 3D Gaussian Splatting Editing☆81Updated 2 weeks ago
- [NeurIPS 2024] Official code for "Splatter a Video: Video Gaussian Representation for Versatile Processing"☆86Updated this week
- Official Implementation for STAG4D: Spatial-Temporal Anchored Generative 4D Gaussians☆154Updated 3 months ago
- [CVPR 2024 Highlight] ViVid-1-to-3: Novel View Synthesis with Video Diffusion Models☆149Updated 3 months ago
- (CVPR 2024) NViST: In the wild New View Synthesis from a Single Image with Transformers☆33Updated last month
- [CVPR 2024 Hightlight] Code release for "The More You See in 2D, the More You Perceive in 3D"☆60Updated last month
- [ECCV 2024] Viewpoint Textual Inversion: Discovering Scene Representations and 3D View Control in 2D Diffusion Models☆103Updated 3 months ago