elyxlz / givt-pytorch
A partial implementation of Generative Infinite Vocabulary Transformer (GIVT) from Google Deepmind, in PyTorch.
☆18Updated last year
Alternatives and similar repositories for givt-pytorch:
Users that are interested in givt-pytorch are comparing it to the libraries listed below
- “FlowAR: Scale-wise Autoregressive Image Generation Meets Flow Matching” FlowAR employs a simplest scale design and is compatible with an…☆102Updated 3 months ago
- ☆88Updated last week
- [ICLR2025] Halton Scheduler for Masked Generative Image Transformer☆207Updated last month
- ☆121Updated 9 months ago
- Official Pytorch Implementation of Our CVPR2023 Paper: "Not All Image Regions Matter: Masked Vector Quantization for Autoregressive Image…☆57Updated last year
- A Pytorch Implementation of Finite Scalar Quantization☆118Updated last year
- [ECCV 2024] Official PyTorch implementation of RoPE-ViT "Rotary Position Embedding for Vision Transformer"☆302Updated 3 months ago
- The codebase of our paper "Improving the Training of Rectified Flows", NeurIPS 2024☆107Updated 5 months ago
- Official Implementation of the CrossMAE paper: Rethinking Patch Dependence for Masked Autoencoders☆107Updated 3 months ago
- The official implementation of "[MASK] is All You Need"☆115Updated last month
- This repository includes the official implementation of our paper "Beyond Next-Token: Next-X Prediction for Autoregressive Visual Generat…☆166Updated last month
- High-performance Image Tokenizers for VAR and AR☆235Updated 2 weeks ago
- [ICCV 2023] Online Clustered Codebook☆164Updated 6 months ago
- ☆176Updated last month
- Towards training VQ-VAE models robustly!☆65Updated 3 months ago
- This repository is the official implementation of our Autoregressive Pretraining with Mamba in Vision☆72Updated 9 months ago
- ☆155Updated 3 months ago
- ☆78Updated 5 months ago
- Official PyTorch Implementation of "Diffusion Autoencoders are Scalable Image Tokenizers"☆110Updated 2 months ago
- [NeurIPS'23] DropPos: Pre-Training Vision Transformers by Reconstructing Dropped Positions☆60Updated 11 months ago
- Open source implementation of "Vision Transformers Need Registers"☆171Updated this week
- ☆127Updated last year
- The official github repo for "Test-Time Training with Masked Autoencoders"☆81Updated last year
- 🔥stable, simple, state-of-the-art VQVAE toolkit & cookbook☆90Updated 9 months ago
- Official Pytorch Implementation of Our CVPR2023 Paper: "Towards Accurate Image Coding: Improved Autoregressive Image Generation with Dyna…☆171Updated last year
- Implementation of the paper "MaskBit: Embedding-free Image Generation from Bit Tokens"☆61Updated last month
- Official repository of paper "Subobject-level Image Tokenization"☆69Updated last week
- The official PyTorch implementation of Fast Diffusion Model☆95Updated last year
- [ICCV 2023 Oral] Official Implementation of "Denoising Diffusion Autoencoders are Unified Self-supervised Learners"☆164Updated last year
- [NeurIPS 2024] The official code of "U-DiTs: Downsample Tokens in U-Shaped Diffusion Transformers"☆194Updated 6 months ago