Unofficial PyTorch implementation of Fastformer based on paper "Fastformer: Additive Attention Can Be All You Need"."
☆132Sep 6, 2021Updated 4 years ago
Alternatives and similar repositories for Fastformer-PyTorch
Users that are interested in Fastformer-PyTorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of Fast Transformer in Pytorch☆176Aug 26, 2021Updated 4 years ago
- ☆13Aug 13, 2020Updated 5 years ago
- FastFormers - highly efficient transformer models for NLU☆709Mar 21, 2025Updated last year
- Code for experiments done for EMNLP2020.☆11Dec 8, 2022Updated 3 years ago
- Optimizing bit-level Jaccard Index and Population Counts for large-scale quantized Vector Search via Harley-Seal CSA and Lookup Tables☆22May 18, 2025Updated 10 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆19Oct 10, 2020Updated 5 years ago
- Demo for DART, Audio Imagination workshop submission in NeurIPS 2024☆13Apr 15, 2025Updated 11 months ago
- FairSeq repo with Apollo optimizer☆113Dec 20, 2023Updated 2 years ago
- Neural Lexicon Reader: Reduce Pronunciation Errors in End-to-end TTS by Leveraging External Textual Knowledge☆21Jul 25, 2022Updated 3 years ago
- Trains Transformer model variants. Data isn't shuffled between batches.☆143Oct 5, 2022Updated 3 years ago
- An implementation of Additive Attention☆148Feb 15, 2022Updated 4 years ago
- Another attempt at a long-context / efficient transformer by me☆38Apr 11, 2022Updated 4 years ago
- An example using Jupyter-React and Jupyter-React-JS in a Jupyter Notebook☆16Oct 11, 2016Updated 9 years ago
- Tensorflow Implementation of "Theory and Experiments on Vector Quantized Autoencoders"☆15Feb 27, 2019Updated 7 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ACL 2021: HiTransformer☆13May 29, 2021Updated 4 years ago
- ☆28Oct 6, 2020Updated 5 years ago
- Interactive tree-maps with SBERT & Hierarchical Clustering (HAC)☆30Dec 31, 2024Updated last year
- Locally Enhanced Self-Attention: Rethinking Self-Attention as Local and Context Terms☆20Nov 29, 2021Updated 4 years ago
- [NAACL 2021] This is the code for our paper `Fine-Tuning Pre-trained Language Model with Weak Supervision: A Contrastive-Regularized Self…☆206Aug 17, 2022Updated 3 years ago
- An open-source AutoML Library based on PyTorch☆308Apr 6, 2026Updated last week
- On Generating Extended Summaries of Long Documents☆78Jan 26, 2021Updated 5 years ago
- ☆11Jun 21, 2022Updated 3 years ago
- WaveGlow vocoder with VQVAE☆61Jun 18, 2019Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Implementation of Perceiver, General Perception with Iterative Attention, in Pytorch☆1,200Aug 22, 2023Updated 2 years ago
- speech-aligner,是一个从“人声语音”及其“语言文本”,产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech an…☆15Dec 19, 2018Updated 7 years ago
- ☆75Jul 2, 2021Updated 4 years ago
- ☆17Sep 22, 2020Updated 5 years ago
- GPT-J 6B inference on TensorRT with INT-8 precision☆11Apr 5, 2023Updated 3 years ago
- ☆31Jan 16, 2021Updated 5 years ago
- (ACL-IJCNLP 2021) Convolutions and Self-Attention: Re-interpreting Relative Positions in Pre-trained Language Models.☆21Jul 13, 2022Updated 3 years ago
- Some demos using Nvidia RAPIDS for Cheminformatics☆13Aug 17, 2020Updated 5 years ago
- Pytorch implementation of Compressive Transformers, from Deepmind☆164Oct 4, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- SummVis is an interactive visualization tool for text summarization.☆254Jun 17, 2022Updated 3 years ago
- ☆20Jan 31, 2021Updated 5 years ago
- Implementation, trained models and result data for the paper "Aspect-based Document Similarity for Research Papers" #COLING2020☆63Apr 30, 2024Updated last year
- [ICLR 2022] Official implementation of cosformer-attention in cosFormer: Rethinking Softmax in Attention☆199Dec 2, 2022Updated 3 years ago
- ☆12Nov 19, 2024Updated last year
- An intelligent, flexible grammar of machine learning.☆82Jul 29, 2021Updated 4 years ago
- Generating Training Data Made Easy☆43Jul 3, 2020Updated 5 years ago