lucidrains / RQ-Transformer
Implementation of RQ Transformer, proposed in the paper "Autoregressive Image Generation using Residual Quantization"
☆106Updated 3 years ago
Alternatives and similar repositories for RQ-Transformer:
Users that are interested in RQ-Transformer are comparing it to the libraries listed below
- JAX implementation ViT-VQGAN☆83Updated 2 years ago
- My attempts at applying Soundstream design on learned tokenization of text and then applying hierarchical attention to text generation☆86Updated 6 months ago
- Implementation of a multimodal diffusion transformer in Pytorch☆101Updated 10 months ago
- Locally Hierarchical Auto-Regressive Modeling for Image Generation (HQ-Transformer)☆29Updated last year
- Implementation of the proposed Adam-atan2 from Google Deepmind in Pytorch☆103Updated 5 months ago
- Implementation of Retrieval-Augmented Denoising Diffusion Probabilistic Models in Pytorch☆64Updated 3 years ago
- Minimal multi-gpu implementation of EDM2: "Analyzing and Improving the Training Dynamics of Diffusion Models"☆32Updated last year
- The Gaussian Histogram Loss (HL-Gauss) proposed by Imani et al. with a few convenient wrappers for regression, in Pytorch☆58Updated last week
- ☆45Updated last year
- Official Code Implementation for 'A Simple Early Exiting Framework for Accelerated Sampling in Diffusion Models'☆18Updated 9 months ago
- An implementation of simple diffusion in PyTorch (and JAX)☆35Updated 2 years ago
- Implementation of a Light Recurrent Unit in Pytorch☆46Updated 7 months ago
- Exploration into the proposed "Self Reasoning Tokens" by Felipe Bonetto☆55Updated 11 months ago
- Official code implementation for the work Preference Alignment with Flow Matching (NeurIPS 2024)☆49Updated 6 months ago
- ☆51Updated last year
- ☆38Updated last year
- Implementation of NWT, audio-to-video generation, in Pytorch☆90Updated 3 years ago
- Implementation of Hourglass Transformer, in Pytorch, from Google and OpenAI☆87Updated 3 years ago
- [ICLR 2023]DEIS: Fast Sampling of Diffusion Models with Exponential Integrator☆157Updated 2 years ago
- Attempt to make multiple residual streams from Bytedance's Hyper-Connections paper accessible to the public☆82Updated 2 months ago
- [NeurIPS 2023] Official Implementation: "Consistent Diffusion Models"☆56Updated 2 years ago
- An official pytorch implementation of AAAI 2024 paper "Latent Space Editing in Transformer-based Flow Matching"☆36Updated last year
- Implementation of "Analyzing and Improving the Training Dynamics of Diffusion Models"☆94Updated last year
- This repo contains the official PyTorch implementation of AudioToken: Adaptation of Text-Conditioned Diffusion Models for Audio-to-Image …☆81Updated 10 months ago
- ☆37Updated last year
- [ICLR2022] Code for "Retriever: Learning Content-Style Representation as a Token-Level Bipartite Graph"☆54Updated 2 years ago
- PyTorch Implementation of V-objective Diffusion Probabilistic Models with Classifier-free Guidance☆32Updated last year
- The official implementation of MAGVLT: Masked Generative Vision-and-Language Transformer (CVPR'23)☆26Updated last year
- A convolution-free, transformer-only version of the CycleGAN framework☆33Updated 3 years ago
- SpeechCLIP: Integrating Speech with Pre-Trained Vision and Language Model, Accepted to IEEE SLT 2022☆115Updated 2 years ago