LukasHedegaard / continual-transformers
Official Pytorch Implementation for "Continual Transformers: Redundancy-Free Attention for Online Inference" [ICLR 2023]
☆28Updated last year
Alternatives and similar repositories for continual-transformers
Users that are interested in continual-transformers are comparing it to the libraries listed below
Sorting:
- A Python library for Continual Inference Networks in PyTorch☆50Updated 2 months ago
- Unofficial PyTorch implementation of the paper "cosFormer: Rethinking Softmax In Attention".☆44Updated 3 years ago
- Implementation of Memformer, a Memory-augmented Transformer, in Pytorch☆117Updated 4 years ago
- Curse-of-memory phenomenon of RNNs in sequence modelling☆19Updated last week
- Implementation of "compositional attention" from MILA, a multi-head attention variant that is reframed as a two-step attention process wi…☆50Updated 3 years ago
- Implementation of dynamic temporal pooling (DTP) for time series classification☆40Updated 3 years ago
- Implementation of Multistream Transformers in Pytorch☆54Updated 3 years ago
- Discriminative Prototypes learned by Dynamic Time Warping (DTW) for Time Series Classification (TSC)☆32Updated 4 years ago
- VIsually-Pivoted Audio and(N) Text☆22Updated 3 years ago
- ☆73Updated 4 years ago
- Implementation of Cross Transformer for spatially-aware few-shot transfer, in Pytorch☆52Updated 4 years ago
- Pytorch implementation of Performer from the paper "Rethinking Attention with Performers".☆25Updated 4 years ago
- Layerwise Batch Entropy Regularization☆22Updated 2 years ago
- This repository contains source codes for SoftCTC. Original paper can be found here: https://arxiv.org/abs/2212.02135☆19Updated 2 years ago
- Implementation of OmniNet, Omnidirectional Representations from Transformers, in Pytorch☆57Updated 4 years ago
- Official implementation of OSSGAN [CVPR 2022]☆21Updated 3 years ago
- DropIT: Dropping Intermediate Tensors for Memory-Efficient DNN Training (ICLR 2023)☆30Updated 2 years ago
- This is the public github for our paper "Transformer with a Mixture of Gaussian Keys"☆26Updated 2 years ago
- [AAAI 2023 (Oral)] CrissCross: Self-Supervised Audio-Visual Representation Learning with Relaxed Cross-Modal Synchronicity☆24Updated last year
- A Pytorch Implementations for Various Vector Quantization Methods☆28Updated 3 years ago
- Tensorflow Implementation of "Theory and Experiments on Vector Quantized Autoencoders"☆14Updated 6 years ago
- Reference implementation of DecDTW in PyTorch (ICLR 2023)☆21Updated last year
- Implementations of various linear RNN layers using pytorch and triton☆51Updated last year
- [ICLR 2023] “ Layer Grafted Pre-training: Bridging Contrastive Learning And Masked Image Modeling For Better Representations”, Ziyu Jian…☆24Updated 2 years ago
- STABILIZING GRADIENTS FOR DEEP NEURAL NETWORKS VIA EFFICIENT SVD PARAMETERIZATION☆16Updated 6 years ago
- A 1D implementation of a deformable convolutional layer in PyTorch with a few tricks.☆41Updated last year
- A project to improve out-of-distribution detection (open set recognition) and uncertainty estimation by changing a few lines of code in y…☆45Updated 2 years ago
- [ICCV'21] The Right to Talk: An Audio-Visual Transformer Approach☆20Updated 3 years ago
- This repository contains the code for our ECCV 2022 paper "Temporal and cross-modal attention for audio-visual zero-shot learning"☆24Updated 2 years ago
- A variant of Transformer-XL where the memory is updated not with a queue, but with attention☆48Updated 4 years ago