xtudbxk / GPSTokenLinks
The official code for paper "GPSToken: Gaussian Parameterized Spatially-adaptive Tokenization for Image Representation and Generation"
☆36Updated last week
Alternatives and similar repositories for GPSToken
Users that are interested in GPSToken are comparing it to the libraries listed below
Sorting:
- SyncNoise: Geometrically Consistent Noise Prediction for Text-based 3D Scene Editing☆18Updated 9 months ago
- [CVPR'25 - Rating 555] Official PyTorch implementation of Lumos: Learning Visual Generative Priors without Text☆51Updated 6 months ago
- Sora Generates Videos with Stunning Geometrical Consistency☆51Updated last year
- VideoDirector [CVPR 2025]☆28Updated 6 months ago
- ☆39Updated last year
- [ECCV2024] ScaleDreamer: Scalable Text-to-3D Synthesis with Asynchronous Score Distillation☆53Updated 6 months ago
- ☆52Updated 2 months ago
- ☆33Updated 11 months ago
- open-sourced video dataset with dynamic scenes and camera movements annotation☆75Updated 5 months ago
- Official PyTorch Implementation of "Latent Denoising Makes Good Visual Tokenizers"☆133Updated 2 months ago
- ☆19Updated 6 months ago
- [ICCV'25] Official implementation of "Reangle-A-Video: 4D Video Generation as Video-to-Video Translation"☆69Updated 3 months ago
- Official implementation of LaVin-DiT☆43Updated 8 months ago
- Pytorch implementation of GaussianToken: An Effective Image Tokenizer with 2D Gaussian Splatting☆96Updated 6 months ago
- Official Repo for the Paper Collaborative Video Diffusion: Consistent Multi-video Generation with Camera Control☆34Updated 9 months ago
- ☆26Updated 5 months ago
- Official Code for 'AR-1-to-3: Single Image to Consistent 3D Object Generation via Next-View Prediction' (ICCV 2025)☆56Updated 2 months ago
- Unofficial Implementation of "Stable Video Diffusion Multi-View"☆79Updated last year
- ☆27Updated 3 months ago
- [ICLR 2025] Trajectory Attention For Fine-grained Video Motion Control☆94Updated 4 months ago
- Official PyTorch implementation - Video Motion Transfer with Diffusion Transformers☆73Updated 2 months ago
- Official implementation of Geometry Forcing: Marrying Video Diffusion and 3D Representation for Consistent World Modeling☆41Updated this week
- ShotBench: Expert-Level Cinematic Understanding in Vision-Language Models☆79Updated 3 weeks ago
- [Neurips 2024] Video Diffusion Models are Training-free Motion Interpreter and Controller☆48Updated 2 months ago
- VChain: Chain-of-Visual-Thought for Reasoning in Video Generation☆42Updated this week
- [ICCV 2025] Pytorch implementation of "VLIPP: Towards Physically Plausible Video Generation with Vision and Language Informed Physical Pr…☆36Updated 2 months ago
- UniFork: Exploring Modality Alignment for Unified Multimodal Understanding and Generation☆46Updated last month
- Official PyTorch implementation of paper “InsViE-1M: Effective Instruction-based Video Editing with Elaborate Dataset Construction”☆23Updated 2 months ago
- Phantom-Data: Towards a General Subject-Consistent Video Generation Dataset☆85Updated last week
- Official implementation of "EG4D: Explicit Generation of 4D Object without Score Distillation" (ICLR 2025)☆32Updated 7 months ago