Unofficial PyTorch implementation of Fastformer based on paper "Fastformer: Additive Attention Can Be All You Need"."
☆131Sep 6, 2021Updated 4 years ago
Alternatives and similar repositories for Fastformer-PyTorch
Users that are interested in Fastformer-PyTorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A pytorch &keras implementation and demo of Fastformer.☆192Sep 22, 2022Updated 3 years ago
- Implementation of Fast Transformer in Pytorch☆176Aug 26, 2021Updated 4 years ago
- ☆13Aug 13, 2020Updated 5 years ago
- FastFormers - highly efficient transformer models for NLU☆706Mar 21, 2025Updated last year
- Code for experiments done for EMNLP2020.☆11Dec 8, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Optimizing bit-level Jaccard Index and Population Counts for large-scale quantized Vector Search via Harley-Seal CSA and Lookup Tables☆22May 18, 2025Updated last year
- Trains Transformer model variants. Data isn't shuffled between batches.☆147Oct 5, 2022Updated 3 years ago
- transformers go brrr...☆148Feb 15, 2022Updated 4 years ago
- Tensorflow Implementation of "Theory and Experiments on Vector Quantized Autoencoders"☆15Feb 27, 2019Updated 7 years ago
- ACL 2021: HiTransformer☆13May 29, 2021Updated 5 years ago
- Interactive tree-maps with SBERT & Hierarchical Clustering (HAC)☆30Dec 31, 2024Updated last year
- [NAACL 2021] This is the code for our paper `Fine-Tuning Pre-trained Language Model with Weak Supervision: A Contrastive-Regularized Self…☆206Aug 17, 2022Updated 3 years ago
- Tagger for explicit cause-and-effect relationships in text☆11Jan 8, 2020Updated 6 years ago
- On Generating Extended Summaries of Long Documents☆78Jan 26, 2021Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code of the paper "Low-Latency Speech Separation Guided Diarization for Telephone Conversations"☆15Dec 22, 2022Updated 3 years ago
- ☆11Jun 21, 2022Updated 4 years ago
- CGAT: Channel-aware Graph Attention Networks☆20Mar 24, 2023Updated 3 years ago
- WaveGlow vocoder with VQVAE☆61Jun 18, 2019Updated 7 years ago
- Implementation of Perceiver, General Perception with Iterative Attention, in Pytorch