aim-uofa / GenPerceptLinks

[ICLR2025] GenPercept: Diffusion Models Trained with Large Data Are Transferable Visual Models

☆209

Alternatives and similar repositories for GenPercept

Users that are interested in GenPercept are comparing it to the libraries listed below

Sorting:

ZHU-Zhiyu / NVS_Solver
Source code of paper "NVS-Solver: Video Diffusion Model as Zero-Shot Novel View Synthesizer"
☆306Updated 6 months ago
basilevh / gcd
Generative Camera Dolly: Extreme Monocular Dynamic Novel View Synthesis (ECCV 2024 Oral) - Official Implementation
☆270Updated last month
VITA-Group / Diffusion4D
"Diffusion4D: Fast Spatial-temporal Consistent 4D Generation via Video Diffusion Models", Hanwen Liang*, Yuyang Yin*, Dejia Xu, Hanxue Li…
☆321Updated 9 months ago
Magicboomliu / Accelerator-Simple-Template
This is a simple template using HuggingFace Accelerator for DDP-training/Saving/Loading/Pushing.
☆49Updated last year
jiahao-shao1 / ChronoDepth
ChronoDepth: Learning Temporally Consistent Video Depth from Video Diffusion Priors
☆274Updated 7 months ago
zqh0253 / 3DitScene
[ICLR 2025] 3DitScene: Editing Any Scene via Language-guided Disentangled Gaussian Splatting
☆252Updated 11 months ago
ZGCTroy / CamI2V
official repo of paper for "CamI2V: Camera-Controlled Image-to-Video Diffusion Model"
☆152Updated 3 weeks ago
cyw-3d / SAR3D
Official repository for "SAR3D: Autoregressive 3D Object Generation and Understanding via Multi-scale 3D VQVAE"
☆178Updated 4 months ago
MattWallingford / 360-1M
☆79Updated 4 months ago
snap-research / ac3d
AC3D: Analyzing and Improving 3D Camera Control in Video Diffusion Transformers
☆135Updated last month
ewrfcas / MVInpainter
[NeurIPS 2024] MVInpainter: Learning Multi-View Consistent Inpainting to Bridge 2D and 3D Editing
☆129Updated 11 months ago
dreamscene4d / dreamscene4d
[NeurIPS 2024] DreamScene4D: Dynamic Multi-Object Scene Generation from Monocular Videos
☆230Updated last year
TQTQliu / Free4D
[ICCV 2025] Free4D: Tuning-free 4D Scene Generation with Spatial-Temporal Consistency
☆219Updated 6 months ago
cvlab-columbia / pix2gestalt
Code for the paper "pix2gestalt: Amodal Segmentation by Synthesizing Wholes" (CVPR 2024)
☆185Updated 3 months ago
SunYangtian / Splatter_A_Video
[NeurIPS 2024] Official code for "Splatter a Video: Video Gaussian Representation for Versatile Processing"
☆151Updated 9 months ago
ewrfcas / MVGenMaster
[CVPR2025] MVGenMaster: Scaling Multi-View Generation from Any Image via 3D Priors Enhanced Diffusion Model
☆117Updated 4 months ago
gen-omnimatte / gen-omnimatte-public
Generative Omnimatte (CVPR 2025)
☆139Updated 4 months ago
cashiwamochi / RealEstate10K_Downloader
These scripts are used to download RealEstate10K dataset.
☆93Updated last year
lutao2021 / BrightDreamer
BrightDreamer: Generic 3D Gaussian Generative Framework for Fast Text-to-3D Synthesis
☆89Updated 11 months ago
YorkUCVIL / Photoconsistent-NVS
☆38Updated 2 years ago
ywyue / FiT3D
[ECCV 2024] Improving 2D Feature Representations by 3D-Aware Fine-Tuning
☆300Updated last month
xizaoqu / TrajectoryAttention
[ICLR 2025] Trajectory Attention For Fine-grained Video Motion Control
☆94Updated 5 months ago
XDimLab / Prometheus
[CVPR2025] Prometheus: 3D-Aware Latent Diffusion Models for Feed-Forward Text-to-3D Scene Generation
☆128Updated 3 months ago
ChrisDong-THU / GaussianToken
Pytorch implementation of GaussianToken: An Effective Image Tokenizer with 2D Gaussian Splatting
☆96Updated 6 months ago
hrz2000 / CustomNeRF
[CVPR 2024] Customize your NeRF: Adaptive Source Driven 3D Scene Editing via Local-Global Iterative Training
☆41Updated last year
shinying / dmp
[CVPR 2024] Exploiting Diffusion Prior for Generalizable Dense Prediction
☆80Updated last year
wkbian / GS-DiT
Source code for paper GS-DiT: Advancing Video Generation with Pseudo 4D Gaussian Fields through Efficient Dense 3D Point Tracking
☆56Updated 9 months ago
Nut-World / NutWorld
Seeing World Dynamics in a Nutshell
☆109Updated 7 months ago
autonomousvision / LaRa
[ECCV 2024] Efficient Large-Baseline Radiance Fields, a feed-forward 2DGS model
☆313Updated last year
Chrixtar / latentsplat
[ECCV 2024] Implementation of latentSplat: Autoencoding Variational Gaussians for Fast Generalizable 3D Reconstruction
☆227Updated 6 months ago