Implementation of the Transformer variant proposed in "Transformer Quality in Linear Time"
☆372Sep 26, 2023Updated 2 years ago
Alternatives and similar repositories for FLASH-pytorch
Users that are interested in FLASH-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- FLASHQuad_pytorch☆68Apr 1, 2022Updated 4 years ago
- Implementation of fused cosine similarity attention in the same style as Flash Attention☆220Feb 13, 2023Updated 3 years ago
- 基于Gated Attention Unit的Transformer模型(尝鲜版)☆97Feb 24, 2023Updated 3 years ago
- Implementation of a memory efficient multi-head attention as proposed in the paper, "Self-attention Does Not Need O(n²) Memory"☆393Jul 18, 2023Updated 2 years ago
- ☆107Mar 9, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Implementation of Mega, the Single-head Attention with Multi-headed EMA architecture that currently holds SOTA on Long Range Arena☆207Aug 26, 2023Updated 2 years ago
- Official Pytorch Implementation for "Continual Transformers: Redundancy-Free Attention for Online Inference" [ICLR 2023]☆28Oct 16, 2023Updated 2 years ago
- Implementation of Tranception, an attention network, paired with retrieval, that is SOTA for protein fitness prediction☆32Jun 19, 2022Updated 4 years ago
- Official implementation of 'Transformer-VQ: Linear-Time Transformers via Vector Quantization'☆199Dec 4, 2023Updated 2 years ago
- Implementation of ETSformer, state of the art time-series Transformer, in Pytorch☆154Aug 26, 2023Updated 2 years ago
- GAU-alpha-pytorch☆20May 11, 2022Updated 4 years ago
- Implementation of an Attention layer where each head can attend to more than just one token, using coordinate descent to pick topk☆47Jul 16, 2023Updated 2 years ago
- Implementation of a Transformer using ReLA (Rectified Linear Attention) from https://arxiv.org/abs/2104.07012