lucidrains / RQ-TransformerLinks

Implementation of RQ Transformer, proposed in the paper "Autoregressive Image Generation using Residual Quantization"

☆112

Alternatives and similar repositories for RQ-Transformer

Users that are interested in RQ-Transformer are comparing it to the libraries listed below

Sorting:

lucidrains / rvq-vae-gpt
My attempts at applying Soundstream design on learned tokenization of text and then applying hierarchical attention to text generation
☆88Updated 9 months ago
patil-suraj / simple-diffusion
An implementation of simple diffusion in PyTorch (and JAX)
☆35Updated 2 years ago
lucidrains / adam-atan2-pytorch
Implementation of the proposed Adam-atan2 from Google Deepmind in Pytorch
☆112Updated 8 months ago
lucidrains / hourglass-transformer-pytorch
Implementation of Hourglass Transformer, in Pytorch, from Google and OpenAI
☆91Updated 3 years ago
lucidrains / retrieval-augmented-ddpm
Implementation of Retrieval-Augmented Denoising Diffusion Probabilistic Models in Pytorch
☆65Updated 3 years ago
byeonghu-na / INDM
Official PyTorch implementation for Maximum Likelihood Training of Implicit Nonlinear Diffusion Model (INDM) in NeurIPS 2022.
☆40Updated last year
patil-suraj / vit-vqgan
JAX implementation ViT-VQGAN
☆83Updated 2 years ago
lucidrains / NWT-pytorch
Implementation of NWT, audio-to-video generation, in Pytorch
☆91Updated 3 years ago
taehong-moon / ee-diffusion
Official Code Implementation for 'A Simple Early Exiting Framework for Accelerated Sampling in Diffusion Models'
☆19Updated last year
rosinality / nansy-pytorch
Unofficial implementation of Neural Analysis and Synthesis
☆7Updated 3 years ago
xrenaa / Retriever
[ICLR2022] Code for "Retriever: Learning Content-Style Representation as a Token-Level Bipartite Graph"
☆54Updated 2 years ago
google-research / diffstride
TF/Keras code for DiffStride, a pooling layer with learnable strides.
☆124Updated 3 years ago
lucidrains / gradnorm-pytorch
A practical implementation of GradNorm, Gradient Normalization for Adaptive Loss Balancing, in Pytorch
☆100Updated last year
lucidrains / light-recurrent-unit-pytorch
Implementation of a Light Recurrent Unit in Pytorch
☆48Updated 10 months ago
kylehkhsu / latent_quantization
☆41Updated last year
giannisdaras / cdm
[NeurIPS 2023] Official Implementation: "Consistent Diffusion Models"
☆57Updated 2 years ago
tmabraham / Trans-CycleGAN
A convolution-free, transformer-only version of the CycleGAN framework
☆33Updated 3 years ago
lucidrains / perceiver-ar-pytorch
Implementation of Perceiver AR, Deepmind's new long-context attention network based on Perceiver architecture, in Pytorch
☆89Updated 2 years ago
dongzhuoyao / uspace
An official pytorch implementation of AAAI 2024 paper "Latent Space Editing in Transformer-based Flow Matching"
☆42Updated last year
sangyun884 / fast-ode
Official PyTorch implementation for the paper Minimizing Trajectory Curvature of ODE-based Generative Models, ICML 2023
☆85Updated 5 months ago
jadehaus / preference-flow-matching
Official code implementation for the work Preference Alignment with Flow Matching (NeurIPS 2024)
☆57Updated 8 months ago
Jack000 / DALLE-pytorch
Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch
☆89Updated 3 years ago
lucidrains / multimodal-dit-pytorch
Implementation of a multimodal diffusion transformer in Pytorch
☆102Updated last year
zhifengkong / FastDPM_pytorch
Official PyTorch implementation for FastDPM, a fast sampling algorithm for diffusion probabilistic models
☆83Updated 4 years ago
lucidrains / recurrent-interface-network-pytorch
Implementation of Recurrent Interface Network (RIN), for highly efficient generation of images and video without cascading networks, in P…
☆205Updated last year
apple / ml-agm
☆44Updated last year
atosystem / SpeechCLIP
SpeechCLIP: Integrating Speech with Pre-Trained Vision and Language Model, Accepted to IEEE SLT 2022
☆115Updated 2 years ago
mmathew23 / improved_edm
Implementation of "Analyzing and Improving the Training Dynamics of Diffusion Models"
☆95Updated last year
lucidrains / insertion-deletion-ddpm
Implementation of Insertion-deletion Denoising Diffusion Probabilistic Models
☆30Updated 3 years ago
kakaobrain / magvlt
The official implementation of MAGVLT: Masked Generative Vision-and-Language Transformer (CVPR'23)
☆26Updated last year