LukasHedegaard / continual-transformersLinks
Official Pytorch Implementation for "Continual Transformers: Redundancy-Free Attention for Online Inference" [ICLR 2023]
☆28Updated 2 years ago
Alternatives and similar repositories for continual-transformers
Users that are interested in continual-transformers are comparing it to the libraries listed below
Sorting:
- Unofficial PyTorch implementation of the paper "cosFormer: Rethinking Softmax In Attention".☆44Updated 4 years ago
- A Python library for Continual Inference Networks in PyTorch☆53Updated 8 months ago
- ☆72Updated 4 years ago
- Implementation of Multistream Transformers in Pytorch☆54Updated 4 years ago
- Implementation of Memformer, a Memory-augmented Transformer, in Pytorch☆124Updated 5 years ago
- Code for the AVLnet (Interspeech 2021) and Cascaded Multilingual (Interspeech 2021) papers.☆53Updated 3 years ago
- Implementation of Long-Short Transformer, combining local and global inductive biases for attention over long sequences, in Pytorch☆120Updated 4 years ago
- Implementation of Cross Transformer for spatially-aware few-shot transfer, in Pytorch☆54Updated 4 years ago
- VIsually-Pivoted Audio and(N) Text☆22Updated 3 years ago
- Code for Discriminative Sounding Objects Localization (NeurIPS 2020)☆59Updated 3 years ago
- A minimal pytorch package implementing a gradient reversal layer.☆158Updated last year
- Implementation of "Slow-Fast Auditory Streams for Audio Recognition, ICASSP, 2021" in PyTorch☆74Updated 4 years ago
- TF/Keras code for DiffStride, a pooling layer with learnable strides.☆124Updated 3 years ago
- ☆31Updated 4 years ago
- A simple program to calculate and visualize the FLOPs and Parameters of Pytorch models, with handy CLI and easy-to-use Python API.☆131Updated 11 months ago
- Relative Positional Encoding for Transformers with Linear Complexity☆65Updated 3 years ago
- Code for the IEEE Signal Processing Letters 2022 paper "UAVM: Towards Unifying Audio and Visual Models".☆57Updated 2 years ago
- EgoCom: A Multi-person Multi-modal Egocentric Communications Dataset☆58Updated 4 years ago
- Source code for models described in the paper "ESResNe(X)t-fbsp: Learning Robust Time-Frequency Transformation of Audio" (https://arxiv.o…☆46Updated 4 years ago
- Continuous Augmented Positional Embeddings (CAPE) implementation for PyTorch☆42Updated 2 years ago
- Official PyTorch Implementation of Long-Short Transformer (NeurIPS 2021).☆228Updated 3 years ago
- ☆69Updated 3 years ago
- Implementation of "compositional attention" from MILA, a multi-head attention variant that is reframed as a two-step attention process wi…☆51Updated 3 years ago
- Pytorch Implementation for "Preserving Linear Separability in Continual Learning by Backward Feature Projection" (CVPR 2023)☆18Updated 2 years ago
- A variant of Transformer-XL where the memory is updated not with a queue, but with attention☆49Updated 5 years ago
- A JAX implementation of Broaden Your Views for Self-Supervised Video Learning, or BraVe for short.☆49Updated 2 months ago
- Code on selecting an action based on multimodal inputs. Here in this case inputs are voice and text.☆73Updated 4 years ago
- [TMLR 2022] High-Modality Multimodal Transformer☆117Updated last year
- This repository contains the code for our CVPR 2022 paper on "Audio-visual Generalised Zero-shot Learning with Cross-modal Attention and …☆40Updated 2 years ago
- ☆52Updated 2 years ago