facebookresearch / Qinco
Residual Quantization with Implicit Neural Codebooks
☆49Updated 4 months ago
Related projects ⓘ
Alternatives and complementary repositories for Qinco
- Why Do We Need Weight Decay in Modern Deep Learning? [NeurIPS 2024]☆52Updated last month
- Beyond Straight-Through☆90Updated last year
- CUDA implementation of autoregressive linear attention, with all the latest research findings☆43Updated last year
- ☆118Updated 8 months ago
- My attempts at applying Soundstream design on learned tokenization of text and then applying hierarchical attention to text generation☆82Updated last month
- [NeurIPS 2022] Your Transformer May Not be as Powerful as You Expect (official implementation)☆33Updated last year
- Sequence Modeling with Multiresolution Convolutional Memory (ICML 2023)☆120Updated last year
- A practical implementation of GradNorm, Gradient Normalization for Adaptive Loss Balancing, in Pytorch☆77Updated 10 months ago
- Speech2Vec Reality Check☆77Updated last year
- Reference implementation of "Softmax Attention with Constant Cost per Token" (Heinsen, 2024)☆24Updated 5 months ago
- An official pytorch implementation of AAAI 2024 paper "Latent Space Editing in Transformer-based Flow Matching"☆27Updated 7 months ago
- Implementation of RQ Transformer, proposed in the paper "Autoregressive Image Generation using Residual Quantization"☆95Updated 2 years ago
- ResiDual: Transformer with Dual Residual Connections, https://arxiv.org/abs/2304.14802☆88Updated last year
- ☆51Updated 5 months ago
- Exploration into the proposed "Self Reasoning Tokens" by Felipe Bonetto☆53Updated 6 months ago
- Implementation of the proposed Adam-atan2 from Google Deepmind in Pytorch☆94Updated this week
- Reference implementation of DecDTW in PyTorch (ICLR 2023)☆20Updated last year
- Repository of the paper "Accelerating Transformer Inference for Translation via Parallel Decoding"☆110Updated 8 months ago
- ☆29Updated 2 years ago
- Implementation of Gated State Spaces, from the paper "Long Range Language Modeling via Gated State Spaces", in Pytorch☆95Updated last year
- Here we will test various linear attention designs.☆56Updated 6 months ago
- Keras implement of Finite Scalar Quantization☆64Updated last year
- Pytorch implementation of the PEER block from the paper, Mixture of A Million Experts, by Xu Owen He at Deepmind☆112Updated 3 months ago
- ☆22Updated last month
- Official code for "Maximum Likelihood Training for Score-Based Diffusion ODEs by High-Order Denoising Score Matching" (ICML 2022)☆53Updated 2 years ago
- HGRN2: Gated Linear RNNs with State Expansion☆49Updated 3 months ago
- JAX bindings for Flash Attention v2☆80Updated 4 months ago
- Randomized Positional Encodings Boost Length Generalization of Transformers☆78Updated 8 months ago
- Implementation of the Kalman Filtering Attention proposed in "Kalman Filtering Attention for User Behavior Modeling in CTR Prediction"☆57Updated last year
- TensorFlow implementation of "Finite Scalar Quantization: VQ-VAE Made Simple" (ICLR 2024)☆14Updated 11 months ago