lucidrains / RQ-TransformerLinks
Implementation of RQ Transformer, proposed in the paper "Autoregressive Image Generation using Residual Quantization"
☆120Updated 3 years ago
Alternatives and similar repositories for RQ-Transformer
Users that are interested in RQ-Transformer are comparing it to the libraries listed below
Sorting:
- My attempts at applying Soundstream design on learned tokenization of text and then applying hierarchical attention to text generation☆89Updated 11 months ago
- An implementation of simple diffusion in PyTorch (and JAX)☆35Updated 2 years ago
- Implementation of NWT, audio-to-video generation, in Pytorch☆91Updated 3 years ago
- Implementation of Retrieval-Augmented Denoising Diffusion Probabilistic Models in Pytorch☆65Updated 3 years ago
- Implementation of the proposed Adam-atan2 from Google Deepmind in Pytorch☆125Updated 10 months ago
- Implementation of Hourglass Transformer, in Pytorch, from Google and OpenAI☆94Updated 3 years ago
- Official Code Implementation for 'A Simple Early Exiting Framework for Accelerated Sampling in Diffusion Models'☆20Updated last year
- Official PyTorch implementation for Maximum Likelihood Training of Implicit Nonlinear Diffusion Model (INDM) in NeurIPS 2022.☆40Updated last year
- JAX implementation ViT-VQGAN☆82Updated 3 years ago
- Official PyTorch implementation for the paper Minimizing Trajectory Curvature of ODE-based Generative Models, ICML 2023☆88Updated 7 months ago
- A practical implementation of GradNorm, Gradient Normalization for Adaptive Loss Balancing, in Pytorch☆110Updated last month
- Implementation of a multimodal diffusion transformer in Pytorch☆105Updated last year
- Pytorch implementation of stochastically quantized variational autoencoder (SQ-VAE)☆191Updated 3 years ago
- ☆41Updated last year
- TF/Keras code for DiffStride, a pooling layer with learnable strides.☆124Updated 3 years ago
- The official implementation of MAGVLT: Masked Generative Vision-and-Language Transformer (CVPR'23)☆27Updated last year
- [NeurIPS 2023] Official Implementation: "Consistent Diffusion Models"☆57Updated 2 years ago
- Implementation of Recurrent Interface Network (RIN), for highly efficient generation of images and video without cascading networks, in P…☆204Updated last year
- An official pytorch implementation of AAAI 2024 paper "Latent Space Editing in Transformer-based Flow Matching"☆44Updated last year
- Implementation of a Light Recurrent Unit in Pytorch☆49Updated last year
- ☆141Updated last year
- PyTorch implementation of slicing adversarial network (SAN)☆97Updated last year
- Official code implementation for the work Preference Alignment with Flow Matching (NeurIPS 2024)☆58Updated 10 months ago
- [Neurips 2021]Diffusion Normalizing Flow (DiffFlow)☆118Updated 2 years ago
- A convolution-free, transformer-only version of the CycleGAN framework☆33Updated 3 years ago
- [ICCV 2023] Online Clustered Codebook☆177Updated last year
- [ICLR2022] Code for "Retriever: Learning Content-Style Representation as a Token-Level Bipartite Graph"☆54Updated 2 years ago
- Implementation of Perceiver AR, Deepmind's new long-context attention network based on Perceiver architecture, in Pytorch☆92Updated 2 years ago
- ResiDual: Transformer with Dual Residual Connections, https://arxiv.org/abs/2304.14802☆95Updated 2 years ago
- SpeechCLIP: Integrating Speech with Pre-Trained Vision and Language Model, Accepted to IEEE SLT 2022☆116Updated 2 years ago