xtudbxk / GPSTokenLinks
The official code for paper "GPSToken: Gaussian Parameterized Spatially-adaptive Tokenization for Image Representation and Generation"
☆48Updated 3 months ago
Alternatives and similar repositories for GPSToken
Users that are interested in GPSToken are comparing it to the libraries listed below
Sorting:
- Official implementation of "STAR: Scale-wise Text-to-image generation via Auto-Regressive representations"☆42Updated 10 months ago
- This is the official implementation for ControlVAR.☆125Updated last year
- SyncNoise: Geometrically Consistent Noise Prediction for Text-based 3D Scene Editing☆18Updated last year
- CAR: Controllable AutoRegressive Modeling for Visual Generation☆127Updated last year
- Frequency Autoregressive Image Generation with Continuous Tokens☆94Updated 7 months ago
- [IJCV 2025] OmniDrag: Enabling Motion Control for Omnidirectional Image-to-Video Generation☆16Updated last month
- Official PyTorch Implementation of "Latent Denoising Makes Good Visual Tokenizers"☆169Updated last month
- [CVMJ 2025] Neural Video Fields Editing☆80Updated 6 months ago
- ☆12Updated 6 months ago
- [CVPR 2025 Oral] Alias-free Latent Diffusion Models (official implementation)☆106Updated last month
- Training-Free Text-Guided Image Editing Using Visual Autoregressive Model☆71Updated 9 months ago
- [CVPR 2024 Highlight] PLACE: Adaptive Layout-Semantic Fusion for Semantic Image Synthesis☆46Updated last year
- UniPercept: Towards Unified Perceptual-Level Image Understanding across Aesthetics, Quality, Structure, and Texture☆74Updated this week
- [ECCV2024] "SlimFlow: Training Smaller One-Step Diffusion Models with Rectified Flow", Yuanzhi Zhu, Xingchao Liu, Qiang Liu☆58Updated last year
- [ICCV2025]Generate one 2K image on single 24GB 3090 GPU!☆83Updated 4 months ago
- [NeurlPS' 25] InstructRestore: Region-Customized Image Restoration with Human Instructions☆47Updated 2 months ago
- Image Neural Field Diffusion Models, CVPR 2024 (Highlight)☆78Updated last year
- ☆37Updated last month
- Official PyTorch implementation of DiffMoE, TC-DiT, EC-DiT and Dense DiT☆159Updated 3 months ago
- (SRA) No Other Representation Component Is Needed: Diffusion Transformers Can Provide Representation Guidance by Themselves☆107Updated 5 months ago
- [ICML 2025] Official code for the paper 'DCTdiff: Intriguing Properties of Image Generative Modeling in the DCT Space'☆75Updated 5 months ago
- Analogist: Out-of-the-box Visual In-Context Learning with Image Diffusion Model (SIGGRAPH 2024)☆37Updated last year
- EditAR: Unified Conditional Generation with Autoregressive Models (CVPR 2025)☆36Updated 7 months ago
- Official implementation for What matters for Representation Alignment: Global Information or Spatial Structure?☆190Updated last month
- Implementation of the paper "MaskBit: Embedding-free Image Generation from Bit Tokens"☆88Updated 9 months ago
- Official PyTorch Code for our ICCV25 paper- Generalized and Efficient 2D Gaussian Splatting for Arbitrary-scale Super-Resolution☆79Updated 5 months ago
- [CVPR 2025🔥] Enhancing Video VAE by Wavelet-Driven Energy Flow for Latent Video Diffusion Model☆190Updated 8 months ago
- Code of the paper "FreePCA:Integrating Consistency Information across Long-short Frames in Training-free Long Video Generation via Princi…☆28Updated 4 months ago
- Official implementation of LaVin-DiT☆53Updated 11 months ago
- [CVPR 2025] HMAR: Efficient Hierarchical Masked Auto-Regressive Image Generation☆61Updated 6 months ago