lucidrains / RQ-TransformerLinks
Implementation of RQ Transformer, proposed in the paper "Autoregressive Image Generation using Residual Quantization"
☆124Updated 3 years ago
Alternatives and similar repositories for RQ-Transformer
Users that are interested in RQ-Transformer are comparing it to the libraries listed below
Sorting:
- My attempts at applying Soundstream design on learned tokenization of text and then applying hierarchical attention to text generation☆90Updated last year
- An implementation of simple diffusion in PyTorch (and JAX)☆34Updated 3 years ago
- Official Code Implementation for 'A Simple Early Exiting Framework for Accelerated Sampling in Diffusion Models'☆20Updated last year
- JAX implementation ViT-VQGAN☆82Updated 3 years ago
- Implementation of the proposed Adam-atan2 from Google Deepmind in Pytorch☆134Updated 3 months ago
- Implementation of a multimodal diffusion transformer in Pytorch☆107Updated last year
- Implementation of Retrieval-Augmented Denoising Diffusion Probabilistic Models in Pytorch☆66Updated 3 years ago
- Official PyTorch implementation for Maximum Likelihood Training of Implicit Nonlinear Diffusion Model (INDM) in NeurIPS 2022.☆40Updated last year
- The official implementation of MAGVLT: Masked Generative Vision-and-Language Transformer (CVPR'23)☆28Updated 2 years ago
- [NeurIPS 2023] Official Implementation: "Consistent Diffusion Models"☆58Updated 2 years ago
- Official code implementation for the work Preference Alignment with Flow Matching (NeurIPS 2024)☆66Updated last year
- Implementation of NWT, audio-to-video generation, in Pytorch☆92Updated 3 years ago
- This repo contains the official PyTorch implementation of AudioToken: Adaptation of Text-Conditioned Diffusion Models for Audio-to-Image …☆88Updated last year
- Exploration into the proposed "Self Reasoning Tokens" by Felipe Bonetto☆57Updated last year
- Implementation of Recurrent Interface Network (RIN), for highly efficient generation of images and video without cascading networks, in P…☆207Updated last year
- Implementation of Hourglass Transformer, in Pytorch, from Google and OpenAI☆98Updated 4 years ago
- Implementation of a Light Recurrent Unit in Pytorch☆49Updated last year
- ☆144Updated last year
- SpeechCLIP: Integrating Speech with Pre-Trained Vision and Language Model, Accepted to IEEE SLT 2022☆118Updated 3 years ago
- A practical implementation of GradNorm, Gradient Normalization for Adaptive Loss Balancing, in Pytorch☆126Updated 5 months ago
- Implementation of "Analyzing and Improving the Training Dynamics of Diffusion Models"☆98Updated last year
- [ICLR2022] Code for "Retriever: Learning Content-Style Representation as a Token-Level Bipartite Graph"☆54Updated 3 years ago
- Implementation of Perceiver AR, Deepmind's new long-context attention network based on Perceiver architecture, in Pytorch☆94Updated 2 years ago
- ☆44Updated 2 years ago
- ☆53Updated 2 years ago
- An official pytorch implementation of AAAI 2024 paper "Latent Space Editing in Transformer-based Flow Matching"☆51Updated last year
- Implementation of Insertion-deletion Denoising Diffusion Probabilistic Models☆30Updated 3 years ago
- Locally Hierarchical Auto-Regressive Modeling for Image Generation (HQ-Transformer)☆28Updated last year
- Implementation of Discrete Key / Value Bottleneck, in Pytorch☆88Updated 2 years ago
- ResiDual: Transformer with Dual Residual Connections, https://arxiv.org/abs/2304.14802☆97Updated 2 years ago