idiap / hypermixingLinks
PyTorch implementation for HyperMixing, a linear-time token-mixing technique used in HyperMixer architecture
☆25Updated 2 years ago
Alternatives and similar repositories for hypermixing
Users that are interested in hypermixing are comparing it to the libraries listed below
Sorting:
- Implementation of Griffin from the paper: "Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models"☆56Updated last week
- Pytorch implementation of Simplified Structured State-Spaces for Sequence Modeling (S5)☆79Updated last year
- an implementation of FAdam (Fisher Adam) in PyTorch☆50Updated 4 months ago
- Randomized Positional Encodings Boost Length Generalization of Transformers☆82Updated last year
- [NeurIPS 2023 spotlight] Official implementation of HGRN in our NeurIPS 2023 paper - Hierarchically Gated Recurrent Neural Network for Se…☆65Updated last year
- Implementations of various linear RNN layers using pytorch and triton☆54Updated 2 years ago
- Griffin MQA + Hawk Linear RNN Hybrid☆89Updated last year
- ☆72Updated 4 years ago
- End-to-End Speech Processing Toolkit☆15Updated 9 months ago
- Torch implementation of Soft-DTW, supports CUDA.☆45Updated 2 years ago
- Implementation of the proposed minGRU in Pytorch☆306Updated 7 months ago
- ☆26Updated last year
- A State-Space Model with Rational Transfer Function Representation.☆82Updated last year
- This is the official repository of the papers "Parameter-Efficient Transfer Learning of Audio Spectrogram Transformers" and "Efficient Fi…☆38Updated last year
- Implementation of the proposed Adam-atan2 from Google Deepmind in Pytorch☆132Updated 2 weeks ago
- [SLT'24] The official implementation of SSAMBA: Self-Supervised Audio Representation Learning with Mamba State Space Model☆127Updated last year
- PyTorch implementation of "Squeezeformer: An Efficient Transformer for Automatic Speech Recognition" (NeurIPS 2022)☆146Updated 2 years ago
- Conformer RNN-Transducer☆14Updated 3 years ago
- The project for speech translation☆12Updated 2 years ago
- ☆27Updated last year
- Pytorch implementation of conformer with with training script for end-to-end speech recognition on the LibriSpeech dataset.☆26Updated last year
- Sequence Modeling with Multiresolution Convolutional Memory (ICML 2023)☆127Updated 2 years ago
- ConMamba for Automatic Speech Recognition☆88Updated last year
- Sequence Modeling with Structured State Spaces☆66Updated 3 years ago
- [ICLR 2023] Official implementation of Transnormer in our ICLR 2023 paper - Toeplitz Neural Network for Sequence Modeling☆80Updated last year
- ☆98Updated last year
- DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.☆13Updated 3 years ago
- Implementation of GateLoop Transformer in Pytorch and Jax☆90Updated last year
- Code for the paper: How Much Context Does My Attention-Based ASR System Need?☆11Updated 5 months ago
- Inspired by "Neural Networks Fail to Learn Periodic Functions and How to Fix It"☆67Updated 3 months ago