LukasHedegaard / continual-transformers
Official Pytorch Implementation for "Continual Transformers: Redundancy-Free Attention for Online Inference" [ICLR 2023]
☆28Updated last year
Alternatives and similar repositories for continual-transformers:
Users that are interested in continual-transformers are comparing it to the libraries listed below
- A Python library for Continual Inference Networks in PyTorch☆49Updated last year
- Unofficial PyTorch implementation of the paper "cosFormer: Rethinking Softmax In Attention".☆44Updated 3 years ago
- Implementation of Memformer, a Memory-augmented Transformer, in Pytorch☆107Updated 4 years ago
- Implementation of "compositional attention" from MILA, a multi-head attention variant that is reframed as a two-step attention process wi…☆50Updated 2 years ago
- STABILIZING GRADIENTS FOR DEEP NEURAL NETWORKS VIA EFFICIENT SVD PARAMETERIZATION☆16Updated 6 years ago
- A variant of Transformer-XL where the memory is updated not with a queue, but with attention☆47Updated 4 years ago
- Tensorflow Implementation of "Theory and Experiments on Vector Quantized Autoencoders"☆14Updated 5 years ago
- Code for the paper PermuteFormer☆42Updated 3 years ago
- Implementation of Cross Transformer for spatially-aware few-shot transfer, in Pytorch☆51Updated 3 years ago
- Implementation of Multistream Transformers in Pytorch☆53Updated 3 years ago
- ☆22Updated 4 years ago
- A Pytorch Implementations for Various Vector Quantization Methods☆27Updated 3 years ago
- ☆72Updated 3 years ago
- Code for the C2KD paper (ICASSP 2023)☆19Updated last year
- Reference implementation of DecDTW in PyTorch (ICLR 2023)☆20Updated last year
- Code for the IEEE Signal Processing Letters 2022 paper "UAVM: Towards Unifying Audio and Visual Models".☆54Updated last year
- VIsually-Pivoted Audio and(N) Text☆22Updated 2 years ago
- This repository contains source codes for SoftCTC. Original paper can be found here: https://arxiv.org/abs/2212.02135☆19Updated last year
- ☆31Updated 3 years ago
- Curse-of-memory phenomenon of RNNs in sequence modelling☆19Updated this week
- The Pytorch implementation of paper: Masked Spectrogram Prediction For Self-Supervised Audio Pre-Training☆42Updated last month
- Skyformer: Remodel Self-Attention with Gaussian Kernel and Nystr\"om Method (NeurIPS 2021)☆60Updated 2 years ago
- Implementations of various linear RNN layers using pytorch and triton☆49Updated last year
- ☆28Updated 6 months ago
- Implementation of OmniNet, Omnidirectional Representations from Transformers, in Pytorch☆57Updated 3 years ago
- Representation learning for NLP @ JSALT19☆38Updated 4 years ago
- A Pytorch implementation of Attention on Attention module (both self and guided variants), for Visual Question Answering☆40Updated 4 years ago
- DropIT: Dropping Intermediate Tensors for Memory-Efficient DNN Training (ICLR 2023)☆30Updated last year
- Official implementation of FOP method as described in "Fusion and Orthogonal Projection for Improved Face-Voice Association"☆16Updated 11 months ago
- [AAAI 2023 (Oral)] CrissCross: Self-Supervised Audio-Visual Representation Learning with Relaxed Cross-Modal Synchronicity☆23Updated last year