xtudbxk / GPSTokenLinks
The official code for paper "GPSToken: Gaussian Parameterized Spatially-adaptive Tokenization for Image Representation and Generation"
☆39Updated last month
Alternatives and similar repositories for GPSToken
Users that are interested in GPSToken are comparing it to the libraries listed below
Sorting:
- SyncNoise: Geometrically Consistent Noise Prediction for Text-based 3D Scene Editing☆18Updated 10 months ago
- OmniDrag: Enabling Motion Control for Omnidirectional Image-to-Video Generation☆16Updated 10 months ago
- [ECCV2024] ScaleDreamer: Scalable Text-to-3D Synthesis with Asynchronous Score Distillation☆53Updated 7 months ago
- Pytorch implementation of GaussianToken: An Effective Image Tokenizer with 2D Gaussian Splatting☆97Updated 6 months ago
- Official PyTorch Implementation of "Latent Denoising Makes Good Visual Tokenizers"☆143Updated last week
- [CVPR'25 - Rating 555] Official PyTorch implementation of Lumos: Learning Visual Generative Priors without Text☆51Updated 7 months ago
- Official Code for 'AR-1-to-3: Single Image to Consistent 3D Object Generation via Next-View Prediction' (ICCV 2025)☆57Updated 3 months ago
- ☆39Updated last year
- VideoDirector [CVPR 2025]☆31Updated 7 months ago
- Project page of "GaussianSR: 3D Gaussian Super-Resolution with 2D Diffusion Priors"☆22Updated last year
- Sora Generates Videos with Stunning Geometrical Consistency☆51Updated last year
- ☆33Updated last year
- Official Code for 'TAR3D: Creating High-Quality 3D Assets via Next-Part Prediction' (ICCV 2025)☆75Updated 3 months ago
- open-sourced video dataset with dynamic scenes and camera movements annotation☆76Updated 6 months ago
- Official implementation of LaVin-DiT☆46Updated 9 months ago
- ShotBench: Expert-Level Cinematic Understanding in Vision-Language Models☆84Updated last month
- [ICCV'25] Official implementation of "Reangle-A-Video: 4D Video Generation as Video-to-Video Translation"☆73Updated 3 months ago
- PyTorch implementation of DiffMoE, TC-DiT, EC-DiT and Dense DiT☆145Updated last week
- [NeurIPS DB 2025] IR3D-Bench: Evaluating Vision-Language Model Scene Understanding as Agentic Inverse Rendering☆41Updated 2 weeks ago
- [ICLR 2025] Trajectory Attention For Fine-grained Video Motion Control☆95Updated 5 months ago
- Official PyTorch implementation of paper “InsViE-1M: Effective Instruction-based Video Editing with Elaborate Dataset Construction”☆25Updated 3 months ago
- [ICCV 2025] Pytorch implementation of "VLIPP: Towards Physically Plausible Video Generation with Vision and Language Informed Physical Pr…☆41Updated 3 months ago
- Not All Steps are Created Equal: Selective Diffusion Distillation for Image Manipulation (ICCV 2023)☆65Updated 2 years ago
- UniFork: Exploring Modality Alignment for Unified Multimodal Understanding and Generation☆46Updated 2 months ago
- [CVPR 2024 Highlight] PLACE: Adaptive Layout-Semantic Fusion for Semantic Image Synthesis☆45Updated last year
- The codes of our paper "ObjectAdd: Adding Objects into Image via a Training-Free Diffusion Modification Fashion"☆14Updated 4 months ago
- ☆26Updated 6 months ago
- ☆19Updated 6 months ago
- ☆11Updated 3 months ago
- Image Neural Field Diffusion Models, CVPR 2024 (Highlight)☆73Updated 11 months ago