aliutkus / speLinks

Relative Positional Encoding for Transformers with Linear Complexity

☆64

Alternatives and similar repositories for spe

Users that are interested in spe are comparing it to the libraries listed below

Sorting:

revsic / jax-variational-diffwave
Jax/Flax implementation of Variational-DiffWave.
☆40Updated 3 years ago
alex-matton / causal-transformer-decoder
☆73Updated 4 years ago
gcambara / cape
Continuous Augmented Positional Embeddings (CAPE) implementation for PyTorch
☆41Updated 2 years ago
lucidrains / insertion-deletion-ddpm
Implementation of Insertion-deletion Denoising Diffusion Probabilistic Models
☆30Updated 3 years ago
GoGoDuck912 / pytorch-vector-quantization
A Pytorch Implementations for Various Vector Quantization Methods
☆30Updated 3 years ago
lucidrains / gated-state-spaces-pytorch
Implementation of Gated State Spaces, from the paper "Long Range Language Modeling via Gated State Spaces", in Pytorch
☆101Updated 2 years ago
ag1988 / dss
Sequence Modeling with Structured State Spaces
☆65Updated 2 years ago
teddykoker / performer
Simply Numpy implementation of the FAVOR+ attention mechanism, https://teddykoker.com/2020/11/performers/
☆38Updated 4 years ago
tk-rusch / LEM
Official code for Long Expressive Memory (ICLR 2022, Spotlight)
☆70Updated 3 years ago
L0SG / NanoFlow
PyTorch implementation of the paper "NanoFlow: Scalable Normalizing Flows with Sublinear Parameter Complexity." (NeurIPS 2020)
☆66Updated 4 years ago
lucidrains / axial-positional-embedding
Axial Positional Embedding for Pytorch
☆83Updated 4 months ago
yaohungt / TransformerDissection
[EMNLP'19] Summary for Transformer Understanding
☆53Updated 5 years ago
lucidrains / long-short-transformer
Implementation of Long-Short Transformer, combining local and global inductive biases for attention over long sequences, in Pytorch
☆119Updated 3 years ago
cpcp1998 / PermuteFormer
Code for the paper PermuteFormer
☆42Updated 3 years ago
lucidrains / perceiver-ar-pytorch
Implementation of Perceiver AR, Deepmind's new long-context attention network based on Perceiver architecture, in Pytorch
☆88Updated 2 years ago
ctlllll / SGConv
☆163Updated 2 years ago
NingMiao / InteL-VAEs
Codes for paper <InteL-VAEs: Adding Inductive Biases to VariationalAuto-Encoders via Intermediary Latents>.
☆19Updated 4 years ago
chrischute / flowplusplus
Implementation of Flow++ in PyTorch
☆40Updated 5 years ago
dwromero / wavelet_networks
Code repository of the paper "Wavelet Networks: Scale-Translation Equivariant Learning From Raw Time-Series, TMLR" https://arxiv.org/abs…
☆82Updated last year
yoyolicoris / variational-diffwave
☆31Updated 2 years ago
lucidrains / kalman-filtering-attention
Implementation of the Kalman Filtering Attention proposed in "Kalman Filtering Attention for User Behavior Modeling in CTR Prediction"
☆58Updated last year
f90 / Seq-U-Net
Official implementation of the Seq-U-Net for efficient sequence modelling
☆79Updated 11 months ago
distsup / DistSup
Representation learning for NLP @ JSALT19
☆39Updated 4 years ago
lsj2408 / URPE
[NeurIPS 2022] Your Transformer May Not be as Powerful as You Expect (official implementation)
☆34Updated last year
Rick-McCoy / Reformer-pytorch
Implements Reformer: The Efficient Transformer in pytorch.
☆86Updated 5 years ago
liu-ziyin / NeurIPS_2020_Snake
☆31Updated 3 years ago
lucidrains / memory-transformer-xl
A variant of Transformer-XL where the memory is updated not with a queue, but with attention
☆49Updated 4 years ago
cfoster0 / CLAP
Contrastive Language-Audio Pretraining
☆87Updated 3 years ago
Newbeeer / Anytime-Auto-Regressive-Model
Code for ICLR 2021 Paper, "Anytime Sampling for Autoregressive Models via Ordered Autoencoding"
☆26Updated 2 years ago
lucidrains / memformer
Implementation of Memformer, a Memory-augmented Transformer, in Pytorch
☆119Updated 4 years ago