VachanVY / Transfusion.torch
PyTorch Implementation of Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Model
☆17Updated 3 months ago
Alternatives and similar repositories for Transfusion.torch:
Users that are interested in Transfusion.torch are comparing it to the libraries listed below
- ☆43Updated 4 months ago
- ☆67Updated 3 months ago
- The official implementation of OmniFlow: Any-to-Any Generation with Multi-Modal Rectified Flows☆50Updated 3 weeks ago
- ☆133Updated last month
- The official implementation of "[MASK] is All You Need"☆104Updated last month
- 🔥stable, simple, state-of-the-art VQVAE toolkit & cookbook☆74Updated 7 months ago
- Implementation of the proposed MaskBit from Bytedance AI☆71Updated 2 months ago
- [NeurIPS 2024] Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspective☆59Updated 3 months ago
- The official implementation for "MonoFormer: One Transformer for Both Diffusion and Autoregression"☆81Updated 3 months ago
- This is a repo to track the latest autoregressive visual generation papers.☆119Updated last week
- A Pytorch Implementation of Finite Scalar Quantization☆104Updated last year
- The official implementation of PAR: Parallelized Autoregressive Visual Generation. https://epiphqny.github.io/PAR-project/☆108Updated 3 weeks ago
- Implementation of a multimodal diffusion transformer in Pytorch☆99Updated 7 months ago
- The codebase of our paper "Improving the Training of Rectified Flows", NeurIPS 2024☆92Updated 3 months ago
- VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation☆117Updated last week
- Enhancing Video VAE by Wavelet-Driven Energy Flow for Latent Video Diffusion Model☆113Updated this week
- Transformer-Mamba Diffusion Models☆95Updated 7 months ago
- ☆112Updated 7 months ago
- [ICML 2024] On Discrete Prompt Optimization for Diffusion Models - Google☆45Updated 5 months ago
- [NAACL 2024] LaDiC: Are Diffusion Models Really Inferior to Autoregressive Counterparts for Image-to-text Generation?☆37Updated 7 months ago
- CoDe: Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficient☆75Updated last week
- The collection of awesome papers on alignment of diffusion models.☆84Updated last week
- XQ-GAN🚀: An Open-source Image Tokenization Framework for Autoregressive Generation☆182Updated last week
- This is the official implementation for ControlVAR.☆91Updated last month
- Scalable Diffusion Models with State Space Backbone☆150Updated 10 months ago
- [ICLR 2025] Rectified Diffusion: Straightness Is Not Your Need☆165Updated last month
- ☆23Updated 2 weeks ago
- Official PyTorch implementation of "Generalized Consistency Trajectory Models for Image Manipulation"☆34Updated 10 months ago
- Implementation of ViTaR: ViTAR: Vision Transformer with Any Resolution in PyTorch☆30Updated 2 months ago
- Liquid: Language Models are Scalable Multi-modal Generators☆61Updated last month