aim-uofa / GenPerceptLinks
[ICLR2025] GenPercept: Diffusion Models Trained with Large Data Are Transferable Visual Models
☆220Updated last year
Alternatives and similar repositories for GenPercept
Users that are interested in GenPercept are comparing it to the libraries listed below
Sorting:
- Source code of paper "NVS-Solver: Video Diffusion Model as Zero-Shot Novel View Synthesizer"☆311Updated 10 months ago
- official repo of paper for "CamI2V: Camera-Controlled Image-to-Video Diffusion Model"☆164Updated 4 months ago
- Official repository for "SAR3D: Autoregressive 3D Object Generation and Understanding via Multi-scale 3D VQVAE"☆190Updated 7 months ago
- [ICLR 2025] 3DitScene: Editing Any Scene via Language-guided Disentangled Gaussian Splatting☆259Updated last year
- [NeurIPS 2024] Diffusion4D: Fast Spatial-temporal Consistent 4D Generation via Video Diffusion Models☆333Updated last year
- This is a simple template using HuggingFace Accelerator for DDP-training/Saving/Loading/Pushing.☆49Updated last year
- ChronoDepth: Learning Temporally Consistent Video Depth from Video Diffusion Priors☆277Updated 11 months ago
- ☆88Updated 8 months ago
- These scripts are used to download RealEstate10K dataset.☆98Updated last year
- Generative Camera Dolly: Extreme Monocular Dynamic Novel View Synthesis (ECCV 2024 Oral) - Official Implementation☆281Updated 2 months ago
- ☆278Updated 3 months ago
- [ICCV 2025] Free4D: Tuning-free 4D Scene Generation with Spatial-Temporal Consistency☆235Updated 3 months ago
- [CVPR2025] MVGenMaster: Scaling Multi-View Generation from Any Image via 3D Priors Enhanced Diffusion Model☆140Updated last month
- [NeurIPS 2024] Official code for "Splatter a Video: Video Gaussian Representation for Versatile Processing"☆156Updated last year
- Generative Omnimatte (CVPR 2025)☆162Updated 8 months ago
- AC3D: Analyzing and Improving 3D Camera Control in Video Diffusion Transformers☆152Updated 4 months ago
- [NeurIPS 2024] MVInpainter: Learning Multi-View Consistent Inpainting to Bridge 2D and 3D Editing☆139Updated last year
- [CVPR 2024] Exploiting Diffusion Prior for Generalizable Dense Prediction☆80Updated last year
- [NeurIPS 2024] DreamScene4D: Dynamic Multi-Object Scene Generation from Monocular Videos☆232Updated last year
- Code for the paper "pix2gestalt: Amodal Segmentation by Synthesizing Wholes" (CVPR 2024)☆194Updated 7 months ago
- ☆41Updated 2 years ago
- Source code for paper GS-DiT: Advancing Video Generation with Pseudo 4D Gaussian Fields through Efficient Dense 3D Point Tracking☆56Updated last year
- CustomDiffusion360: Customizing Text-to-Image Diffusion with Camera Viewpoint Control☆172Updated last year
- [CVPR 2025 Highlight] VideoScene: Distilling Video Diffusion Model to Generate 3D Scenes in One Step☆340Updated 7 months ago
- Orient Anything, ICML 2025☆372Updated 3 months ago
- Official Repo for the Paper Collaborative Video Diffusion: Consistent Multi-video Generation with Camera Control☆38Updated last year
- [CVPR 2025] Boosting Generative Novel View Synthesis with Sparse and Unposed Images☆124Updated last year
- [CVPR'24] Consistent Novel View Synthesis without 3D Representation☆167Updated last year
- [ICLR 2025] Trajectory Attention For Fine-grained Video Motion Control☆96Updated 8 months ago
- [CVPR'24 - Rebuttal Score 554] GenN2N: Generative NeRF2NeRF Translation☆100Updated last year