lucidrains / titok-pytorch
Implementation of TiTok, proposed by Bytedance in "An Image is Worth 32 Tokens for Reconstruction and Generation"
☆170Updated 9 months ago
Alternatives and similar repositories for titok-pytorch:
Users that are interested in titok-pytorch are comparing it to the libraries listed below
- Implementation of a multimodal diffusion transformer in Pytorch☆101Updated 9 months ago
- Official PyTorch Implementation of "Diffusion Autoencoders are Scalable Image Tokenizers"☆110Updated 2 months ago
- Implementation of the proposed MaskBit from Bytedance AI☆75Updated 5 months ago
- ☆156Updated 3 months ago
- Implementation of a single layer of the MMDiT, proposed in Stable Diffusion 3, in Pytorch☆338Updated 3 months ago
- ☆70Updated 4 months ago
- DDT: Decoupled Diffusion Transformer☆97Updated this week
- Official PyTorch Implementation of "Scalable Autoregressive Image Generation with Mamba"☆134Updated 3 months ago
- Train VAE like a boss☆274Updated 5 months ago
- Official PyTorch implementation of TokenSet.☆114Updated 3 weeks ago
- [CVPR2025] PyTorch-based reimplementation of CrossFlow, as proposed in 'Flowing from Words to Pixels: A Noise-Free Framework for Cross-Mo…☆161Updated last month
- ☆52Updated 2 weeks ago
- Gaussian Mixture Flow Matching Models (GMFlow)☆73Updated last week
- ☆84Updated last year
- Code for paper "Principal Components" Enable A New Language of Images☆34Updated 3 weeks ago
- ☆122Updated 9 months ago
- Scaling Vision Pre-Training to 4K Resolution☆110Updated 3 weeks ago
- Implementation of a framework for Genie2 in Pytorch☆145Updated 3 months ago
- A Video Tokenizer Evaluation Dataset☆111Updated 3 months ago
- 🦾 EvalGIM (pronounced as "EvalGym") is an evaluation library for generative image models. It enables easy-to-use, reproducible automatic…☆73Updated 3 months ago
- ☆91Updated 2 weeks ago
- Exploration into the proposed "Self Reasoning Tokens" by Felipe Bonetto☆55Updated 11 months ago
- [ICLR 2025] Source code for paper "A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive Transformer for Efficient Finegr…☆74Updated 4 months ago
- ElasticTok: Adaptive Tokenization for Image and Video☆65Updated 5 months ago
- The codebase of our paper "Improving the Training of Rectified Flows", NeurIPS 2024☆108Updated 5 months ago
- [CVPR2025 Highlight] PAR: Parallelized Autoregressive Visual Generation. https://yuqingwang1029.github.io/PAR-project☆146Updated 3 weeks ago
- Official PyTorch implementation of FlowMo.☆31Updated last week
- Scalable Diffusion Models with State Space Backbone☆152Updated last year
- TokenBridge: Bridging Continuous and Discrete Tokens for Autoregressive Visual Generation. https://yuqingwang1029.github.io/TokenBridge☆89Updated last week
- FlexTok: Resampling Images into 1D Token Sequences of Flexible Length☆98Updated last week