lucidrains/fast-transformer-pytorch

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/lucidrains/fast-transformer-pytorch)

lucidrains / fast-transformer-pytorch

Implementation of Fast Transformer in Pytorch

☆176

Alternatives and similar repositories for fast-transformer-pytorch

Users that are interested in fast-transformer-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

lucidrains / remixer-pytorch
View on GitHub
Implementation of the Remixer Block from the Remixer paper, in Pytorch
☆36Sep 27, 2021Updated 4 years ago
wuch15 / Fastformer
View on GitHub
A pytorch &keras implementation and demo of Fastformer.
☆192Sep 22, 2022Updated 3 years ago
lucidrains / multistream-transformers
View on GitHub
Implementation of Multistream Transformers in Pytorch
☆54Jul 31, 2021Updated 4 years ago
lucidrains / long-short-transformer
View on GitHub
Implementation of Long-Short Transformer, combining local and global inductive biases for attention over long sequences, in Pytorch
☆120Aug 4, 2021Updated 4 years ago
lucidrains / panoptic-transformer
View on GitHub
Another attempt at a long-context / efficient transformer by me
☆38Apr 11, 2022Updated 4 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
antofuller / configaformers
View on GitHub
A python library for highly configurable transformers - easing model architecture search and experimentation.
☆48Nov 30, 2021Updated 4 years ago
wilile26811249 / Fastformer-PyTorch
View on GitHub
Unofficial PyTorch implementation of Fastformer based on paper "Fastformer: Additive Attention Can Be All You Need"."
☆131Sep 6, 2021Updated 4 years ago
lucidrains / HTM-pytorch
View on GitHub
Implementation of Hierarchical Transformer Memory (HTM) for Pytorch
☆74Sep 15, 2021Updated 4 years ago
lucidrains / token-shift-gpt
View on GitHub
Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixing
☆49Jan 27, 2022Updated 4 years ago
lucidrains / einops-exts
View on GitHub
Implementation of some personal helper functions for Einops, my most favorite tensor manipulation library ❤️
☆57Jan 5, 2023Updated 3 years ago
lucidrains / lie-transformer-pytorch
View on GitHub
Implementation of Lie Transformer, Equivariant Self-Attention, in Pytorch
☆98Feb 19, 2021Updated 5 years ago
lucidrains / local-attention-flax
View on GitHub
Local Attention - Flax module for Jax
☆22May 26, 2021Updated 5 years ago
lucidrains / hourglass-transformer-pytorch
View on GitHub
Implementation of Hourglass Transformer, in Pytorch, from Google and OpenAI
☆99Dec 31, 2021Updated 4 years ago
lucidrains / rela-transformer
View on GitHub
Implementation of a Transformer using ReLA (Rectified Linear Attention) from https://arxiv.org/abs/2104.07012
☆49Apr 6, 2022Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
lucidrains / cross-transformers-pytorch
View on GitHub
Implementation of Cross Transformer for spatially-aware few-shot transfer, in Pytorch
☆54Mar 30, 2021Updated 5 years ago
lucidrains / g-mlp-gpt
View on GitHub
GPT, but made only out of MLPs
☆89May 25, 2021Updated 5 years ago
lucidrains / compressive-transformer-pytorch
View on GitHub
Pytorch implementation of Compressive Transformers, from Deepmind
☆165Oct 4, 2021Updated 4 years ago
lucidrains / memformer
View on GitHub
Implementation of Memformer, a Memory-augmented Transformer, in Pytorch
☆126Nov 13, 2020Updated 5 years ago
lucidrains / adjacent-attention-network
View on GitHub
Graph neural network message passing reframed as a Transformer with local attention
☆70Dec 24, 2022Updated 3 years ago
lucidrains / g-mlp-pytorch
View on GitHub
Implementation of gMLP, an all-MLP replacement for Transformers, in Pytorch
☆431Aug 14, 2021Updated 4 years ago
lucidrains / mlp-gpt-jax
View on GitHub
A GPT, made only of MLPs, in Jax
☆59Jun 23, 2021Updated 5 years ago
lucidrains / contrastive-learner
View on GitHub
A simple to use pytorch wrapper for contrastive self-supervised learning on any neural network
☆153Mar 12, 2021Updated 5 years ago
lucidrains / tranception-pytorch
View on GitHub
Implementation of Tranception, an attention network, paired with retrieval, that is SOTA for protein fitness prediction
☆32Jun 19, 2022Updated 4 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
lucidrains / coco-lm-pytorch
View on GitHub
Implementation of COCO-LM, Correcting and Contrasting Text Sequences for Language Model Pretraining, in Pytorch
☆46Mar 3, 2021Updated 5 years ago
lucidrains / triton-transformer
View on GitHub
Implementation of a Transformer, but completely in Triton
☆279Apr 5, 2022Updated 4 years ago
lucidrains / h-transformer-1d
View on GitHub
Implementation of H-Transformer-1D, Hierarchical Attention for Sequence Learning
☆167Feb 12, 2024Updated 2 years ago
lucidrains / charformer-pytorch
View on GitHub
Implementation of the GBST block from the Charformer paper, in Pytorch
☆118Jul 15, 2021Updated 5 years ago
lucidrains / tableformer-pytorch
View on GitHub
Implementation of TableFormer, Robust Transformer Modeling for Table-Text Encoding, in Pytorch
☆39Mar 29, 2022Updated 4 years ago
lucidrains / gateloop-transformer
View on GitHub
Implementation of GateLoop Transformer in Pytorch and Jax
☆93Jun 18, 2024Updated 2 years ago
lucidrains / halonet-pytorch
View on GitHub
Implementation of the 😇 Attention layer from the paper, Scaling Local Self-Attention For Parameter Efficient Visual Backbones
☆199Mar 24, 2021Updated 5 years ago
lucidrains / ponder-transformer
View on GitHub
Implementation of a Transformer that Ponders, using the scheme from the PonderNet paper
☆84Oct 30, 2021Updated 4 years ago
lucidrains / product-key-memory
View on GitHub
Standalone Product Key Memory module in Pytorch - for augmenting Transformer models
☆87Nov 1, 2025Updated 8 months ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
lucidrains / insertion-deletion-ddpm
View on GitHub
Implementation of Insertion-deletion Denoising Diffusion Probabilistic Models
☆30May 31, 2022Updated 4 years ago
lucidrains / feedback-transformer-pytorch
View on GitHub
Implementation of Feedback Transformer in Pytorch
☆108Mar 2, 2021Updated 5 years ago
lucidrains / perceiver-pytorch
View on GitHub
Implementation of Perceiver, General Perception with Iterative Attention, in Pytorch
☆1,217Jun 8, 2026Updated last month
lucidrains / gated-state-spaces-pytorch
View on GitHub
Implementation of Gated State Spaces, from the paper "Long Range Language Modeling via Gated State Spaces", in Pytorch
☆101Feb 25, 2023Updated 3 years ago
lucidrains / linear-attention-transformer
View on GitHub
Transformer based on a variant of attention that is linear complexity in respect to sequence length
☆842May 5, 2024Updated 2 years ago
lucidrains / invariant-point-attention
View on GitHub
Implementation of Invariant Point Attention, used for coordinate refinement in the structure module of Alphafold2, as a standalone Pytorc…
☆171Nov 25, 2022Updated 3 years ago
lucidrains / learning-to-expire-pytorch
View on GitHub
An implementation of Transformer with Expire-Span, a circuit for learning which memories to retain
☆34Oct 30, 2020Updated 5 years ago