lucidrains / linformerLinks

Implementation of Linformer for Pytorch

☆300

Alternatives and similar repositories for linformer

Users that are interested in linformer are comparing it to the libraries listed below

Sorting:

lucidrains / local-attention
An implementation of local windowed attention for language modeling
☆483Updated 3 months ago
NVIDIA / transformer-ls
Official PyTorch Implementation of Long-Short Transformer (NeurIPS 2021).
☆228Updated 3 years ago
lucidrains / linear-attention-transformer
Transformer based on a variant of attention that is linear complexity in respect to sequence length
☆801Updated last year
rishikksh20 / FNet-pytorch
Unofficial implementation of Google's FNet: Mixing Tokens with Fourier Transforms
☆259Updated 4 years ago
lucidrains / FLASH-pytorch
Implementation of the Transformer variant proposed in "Transformer Quality in Linear Time"
☆369Updated 2 years ago
tatp22 / linformer-pytorch
My take on a practical implementation of Linformer for Pytorch.
☆421Updated 3 years ago
lucidrains / memory-efficient-attention-pytorch
Implementation of a memory efficient multi-head attention as proposed in the paper, "Self-attention Does Not Need O(n²) Memory"
☆383Updated 2 years ago
erksch / fnet-pytorch
Unofficial PyTorch implementation of Google's FNet: Mixing Tokens with Fourier Transforms. With checkpoints.
☆77Updated 3 years ago
ctlllll / SGConv
☆164Updated 2 years ago
lucidrains / routing-transformer
Fully featured implementation of Routing Transformer
☆296Updated 3 years ago
facebookresearch / mega
Sequence modeling with Mega.
☆300Updated 2 years ago
lucidrains / memformer
Implementation of Memformer, a Memory-augmented Transformer, in Pytorch
☆123Updated 4 years ago
rish-16 / aft-pytorch
Unofficial PyTorch implementation of Attention Free Transformer (AFT) layers by Apple Inc.
☆243Updated 3 years ago
lucidrains / flash-cosine-sim-attention
Implementation of fused cosine similarity attention in the same style as Flash Attention
☆217Updated 2 years ago
cmsflash / efficient-attention
An implementation of the efficient attention module.
☆321Updated 4 years ago
OpenNLPLab / cosFormer
[ICLR 2022] Official implementation of cosformer-attention in cosFormer: Rethinking Softmax in Attention
☆196Updated 2 years ago
lucidrains / block-recurrent-transformer-pytorch
Implementation of Block Recurrent Transformer - Pytorch
☆221Updated last year
lucidrains / compressive-transformer-pytorch
Pytorch implementation of Compressive Transformers, from Deepmind
☆162Updated 4 years ago
lucidrains / Mega-pytorch
Implementation of Mega, the Single-head Attention with Multi-headed EMA architecture that currently holds SOTA on Long Range Arena
☆206Updated 2 years ago
lucidrains / rotary-embedding-torch
Implementation of Rotary Embeddings, from the Roformer paper, in Pytorch
☆769Updated 3 months ago
lucidrains / nystrom-attention
Implementation of Nyström Self-attention, from the paper Nyströmformer
☆141Updated 7 months ago
bzhangGo / rmsnorm
Root Mean Square Layer Normalization
☆256Updated 2 years ago
lucidrains / h-transformer-1d
Implementation of H-Transformer-1D, Hierarchical Attention for Sequence Learning
☆165Updated last year
lucidrains / st-moe-pytorch
Implementation of ST-Moe, the latest incarnation of MoE after years of research at Brain, in Pytorch
☆363Updated last year
DeMoriarty / fast_pytorch_kmeans
This is a pytorch implementation of k-means clustering algorithm
☆328Updated 7 months ago
lucidrains / sinkhorn-transformer
Sinkhorn Transformer - Practical implementation of Sparse Sinkhorn Attention
☆268Updated 4 years ago
lucidrains / axial-attention
Implementation of Axial attention - attending to multi-dimensional data efficiently
☆387Updated 4 years ago
lucidrains / g-mlp-pytorch
Implementation of gMLP, an all-MLP replacement for Transformers, in Pytorch
☆430Updated 4 years ago
lucidrains / mlp-mixer-pytorch
An All-MLP solution for Vision, from Google AI
☆1,050Updated 3 months ago
DeMoriarty / TorchPQ
Approximate nearest neighbor search with product quantization on GPU in pytorch and cuda
☆228Updated last year