lucidrains / RQ-TransformerLinks
Implementation of RQ Transformer, proposed in the paper "Autoregressive Image Generation using Residual Quantization"
☆122Updated 3 years ago
Alternatives and similar repositories for RQ-Transformer
Users that are interested in RQ-Transformer are comparing it to the libraries listed below
Sorting:
- My attempts at applying Soundstream design on learned tokenization of text and then applying hierarchical attention to text generation☆88Updated last year
- An implementation of simple diffusion in PyTorch (and JAX)☆35Updated 2 years ago
- Official Code Implementation for 'A Simple Early Exiting Framework for Accelerated Sampling in Diffusion Models'☆20Updated last year
- Implementation of the proposed Adam-atan2 from Google Deepmind in Pytorch☆132Updated last week
- Implementation of Retrieval-Augmented Denoising Diffusion Probabilistic Models in Pytorch☆65Updated 3 years ago
- [NeurIPS 2023] Official Implementation: "Consistent Diffusion Models"☆58Updated 2 years ago
- Official PyTorch implementation for the paper Minimizing Trajectory Curvature of ODE-based Generative Models, ICML 2023☆88Updated 8 months ago
- Implementation of a multimodal diffusion transformer in Pytorch☆106Updated last year
- Implementation of Recurrent Interface Network (RIN), for highly efficient generation of images and video without cascading networks, in P…☆204Updated last year
- Implementation of Hourglass Transformer, in Pytorch, from Google and OpenAI☆95Updated 3 years ago
- JAX implementation ViT-VQGAN☆82Updated 3 years ago
- Implementation of NWT, audio-to-video generation, in Pytorch☆92Updated 3 years ago
- Official code implementation for the work Preference Alignment with Flow Matching (NeurIPS 2024)☆59Updated 11 months ago
- TF/Keras code for DiffStride, a pooling layer with learnable strides.☆124Updated 3 years ago
- ☆142Updated last year
- Implementation of "Analyzing and Improving the Training Dynamics of Diffusion Models"☆94Updated last year
- An official pytorch implementation of AAAI 2024 paper "Latent Space Editing in Transformer-based Flow Matching"☆46Updated last year
- Official PyTorch implementation for Maximum Likelihood Training of Implicit Nonlinear Diffusion Model (INDM) in NeurIPS 2022.☆40Updated last year
- The official implementation of MAGVLT: Masked Generative Vision-and-Language Transformer (CVPR'23)☆27Updated last year
- A practical implementation of GradNorm, Gradient Normalization for Adaptive Loss Balancing, in Pytorch☆110Updated 2 months ago
- Pytorch implementation of stochastically quantized variational autoencoder (SQ-VAE)☆192Updated 3 years ago
- ☆42Updated last year
- Implementation of a Light Recurrent Unit in Pytorch☆49Updated last year
- Implementation of Perceiver AR, Deepmind's new long-context attention network based on Perceiver architecture, in Pytorch☆91Updated 2 years ago
- SpeechCLIP: Integrating Speech with Pre-Trained Vision and Language Model, Accepted to IEEE SLT 2022☆116Updated 2 years ago
- Official PyTorch implementation for FastDPM, a fast sampling algorithm for diffusion probabilistic models☆83Updated 4 years ago
- [ICCV 2023] Online Clustered Codebook☆176Updated last year
- This repo contains the official PyTorch implementation of AudioToken: Adaptation of Text-Conditioned Diffusion Models for Audio-to-Image …☆86Updated last year
- Implementation of fused cosine similarity attention in the same style as Flash Attention☆217Updated 2 years ago
- Locally Hierarchical Auto-Regressive Modeling for Image Generation (HQ-Transformer)☆29Updated last year