lucidrains / RQ-Transformer
Implementation of RQ Transformer, proposed in the paper "Autoregressive Image Generation using Residual Quantization"
☆99Updated 2 years ago
Alternatives and similar repositories for RQ-Transformer:
Users that are interested in RQ-Transformer are comparing it to the libraries listed below
- My attempts at applying Soundstream design on learned tokenization of text and then applying hierarchical attention to text generation☆83Updated 4 months ago
- Implementation of Retrieval-Augmented Denoising Diffusion Probabilistic Models in Pytorch☆64Updated 2 years ago
- An implementation of simple diffusion in PyTorch (and JAX)☆35Updated 2 years ago
- JAX implementation ViT-VQGAN☆80Updated 2 years ago
- Official code implementation for the work Preference Alignment with Flow Matching (NeurIPS 2024)☆41Updated 3 months ago
- Implementation of Hourglass Transformer, in Pytorch, from Google and OpenAI☆84Updated 3 years ago
- Official PyTorch implementation for the paper Minimizing Trajectory Curvature of ODE-based Generative Models, ICML 2023☆79Updated this week
- ☆51Updated last year
- [NeurIPS 2023] Official Implementation: "Consistent Diffusion Models"☆55Updated last year
- A convolution-free, transformer-only version of the CycleGAN framework☆33Updated 3 years ago
- Implementation of a multimodal diffusion transformer in Pytorch☆100Updated 7 months ago
- Minimal multi-gpu implementation of EDM2: "Analyzing and Improving the Training Dynamics of Diffusion Models"☆28Updated 11 months ago
- The codebase of our paper "Improving the Training of Rectified Flows", NeurIPS 2024☆95Updated 4 months ago
- An official pytorch implementation of AAAI 2024 paper "Latent Space Editing in Transformer-based Flow Matching"☆36Updated 10 months ago
- Locally Hierarchical Auto-Regressive Modeling for Image Generation (HQ-Transformer)☆29Updated last year
- The official implementation of MAGVLT: Masked Generative Vision-and-Language Transformer (CVPR'23)☆26Updated last year
- Codebase for the Paper: Learning Visual Styles from Audio-Visual Associations (ECCV 2022, in PyTorch)☆14Updated 2 years ago
- Implementation of NWT, audio-to-video generation, in Pytorch☆88Updated 2 years ago
- Jax/Flax implementation of Variational-DiffWave.☆40Updated 2 years ago
- Simple implementation of muP, based on Spectral Condition for Feature Learning. The implementation is SGD only, dont use it for Adam☆73Updated 6 months ago
- Implementation of a Light Recurrent Unit in Pytorch☆48Updated 4 months ago
- Unofficial implementation of Neural Analysis and Synthesis☆7Updated 3 years ago
- SpeechCLIP: Integrating Speech with Pre-Trained Vision and Language Model, Accepted to IEEE SLT 2022☆111Updated 2 years ago
- [ICML 2023] Reflected Diffusion Models (https://arxiv.org/abs/2304.04740)☆157Updated last year
- Implementation of Discrete Key / Value Bottleneck, in Pytorch☆87Updated last year
- Implementation of the proposed Adam-atan2 from Google Deepmind in Pytorch☆102Updated 2 months ago
- Official Code Implementation for 'A Simple Early Exiting Framework for Accelerated Sampling in Diffusion Models'☆16Updated 6 months ago
- ☆35Updated last year
- Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch☆89Updated 3 years ago
- Implementation of Gated State Spaces, from the paper "Long Range Language Modeling via Gated State Spaces", in Pytorch☆97Updated last year