LUMIA-Group / FourierTransformerLinks

The official Pytorch implementation of the paper "Fourier Transformer: Fast Long Range Modeling by Removing Sequence Redundancy with FFT Operator" (ACL 2023 Findings)

☆38

Alternatives and similar repositories for FourierTransformer

Users that are interested in FourierTransformer are comparing it to the libraries listed below

Sorting:

AmeenAli / HiddenMambaAttn
Official PyTorch Implementation of "The Hidden Attention of Mamba Models"
☆224Updated last year
Adamdad / rational_kat_cu
☆67Updated 5 months ago
PKU-ML / non_neg
Official Code for ICLR 2024 Paper: Non-negative Contrastive Learning
☆45Updated last year
WailordHe / DenseSSM
A repository for DenseSSMs
☆87Updated last year
kyegomez / Griffin
Implementation of Griffin from the paper: "Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models"
☆55Updated 3 months ago
Hprairie / Bi-Mamba2
A Triton Kernel for incorporating Bi-Directionality in Mamba2
☆71Updated 6 months ago
nanowell / Differential-Transformer-PyTorch
PyTorch implementation of the Differential-Transformer architecture for sequence modeling, specifically tailored as a decoder-only model …
☆71Updated 8 months ago
lucidrains / agent-attention-pytorch
Implementation of Agent Attention in Pytorch
☆90Updated last year
assafbk / DeciMamba
DeciMamba: Exploring the Length Extrapolation Potential of Mamba (ICLR 2025)
☆28Updated 3 months ago
OpenNLPLab / HGRN
[NeurIPS 2023 spotlight] Official implementation of HGRN in our NeurIPS 2023 paper - Hierarchically Gated Recurrent Neural Network for Se…
☆66Updated last year
transformer-vq / transformer_vq
☆195Updated last year
MambaMixer / M2
☆47Updated last year
pengzhangzhi / Awesome-Mamba
Awesome list of papers that extend Mamba to various applications.
☆134Updated last month
Itamarzimm / UnifiedImplicitAttnRepr
[ICLR 2025] Official Code Release for Explaining Modern Gated-Linear RNNs via a Unified Implicit Attention Formulation
☆43Updated 4 months ago
OpenNLPLab / Tnn
[ICLR 2023] Official implementation of Transnormer in our ICLR 2023 paper - Toeplitz Neural Network for Sequence Modeling
☆79Updated last year
krafton-ai / mambaformer-icl
MambaFormer in-context learning experiments and implementation for https://arxiv.org/abs/2402.04248
☆55Updated last year
OpenNLPLab / HGRN2
HGRN2: Gated Linear RNNs with State Expansion
☆55Updated 10 months ago
dongzhuoyao / flowseq
An official pytorch implementation of EACL2024 short paper "Flow Matching for Conditional Text Generation in a Few Sampling Steps"
☆18Updated last year
berlino / gated_linear_attention
☆105Updated last year
goombalab / hydra
Official implementation of "Hydra: Bidirectional State Space Models Through Generalized Matrix Mixers"
☆140Updated 5 months ago
badripatro / mamba360
State Space Models
☆68Updated last year
TsinghuaC3I / SoRA
[EMNLP 2023, Main Conference] Sparse Low-rank Adaptation of Pre-trained Language Models
☆79Updated last year
LCM-Lab / Bridge_Gap_Diffusion
☆34Updated last year
justinlovelace / latent-diffusion-for-language
☆136Updated last year
bwconrad / soft-moe
PyTorch implementation of "From Sparse to Soft Mixtures of Experts"
☆58Updated last year
igul222 / plaid
☆103Updated 2 years ago
yikangshen / MoA
Mixture of Attention Heads
☆47Updated 2 years ago
lsj2408 / URPE
[NeurIPS 2022] Your Transformer May Not be as Powerful as You Expect (official implementation)
☆34Updated last year
chuanyang-Zheng / DAPE
The this is the official implementation of "DAPE: Data-Adaptive Positional Encoding for Length Extrapolation"
☆38Updated 9 months ago
FarnoushRJ / MambaLRP
[NeurIPS 2024] Official implementation of the paper "MambaLRP: Explaining Selective State Space Sequence Models".
☆41Updated 8 months ago