bojone / rnnLinks

一些RNN的实现

☆51

Alternatives and similar repositories for rnn

Users that are interested in rnn are comparing it to the libraries listed below

Sorting:

dashstander / block-recurrent-transformer
Pytorch implementation of "Block Recurrent Transformers" (Hutchins & Schlag et al., 2022)
☆84Updated 3 years ago
ZhuiyiTechnology / GAU-alpha
基于Gated Attention Unit的Transformer模型（尝鲜版）
☆98Updated 2 years ago
wuch15 / Fastformer
A pytorch &keras implementation and demo of Fastformer.
☆190Updated 3 years ago
bojone / univae
基于Transformer的单模型、多尺度的VAE模型
☆57Updated 4 years ago
BlinkDL / minGPT-tuned
A *tuned* minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
☆118Updated 4 years ago
OpenNLPLab / HGRN
[NeurIPS 2023 spotlight] Official implementation of HGRN in our NeurIPS 2023 paper - Hierarchically Gated Recurrent Neural Network for Se…
☆65Updated last year
taolei87 / sru
Training RNNs as Fast as CNNs (https://arxiv.org/abs/1709.02755)
☆35Updated 4 years ago
Doraemonzzz / hgru-pytorch
☆27Updated last year
JunnYu / FLASHQuad_pytorch
FLASHQuad_pytorch
☆68Updated 3 years ago
libeineu / ODE-Transformer
This is a code repository for the ACL 2022 paper "ODE Transformer: An Ordinary Differential Equation-Inspired Model for Sequence Generati…
☆35Updated 3 years ago
DRSY / EMO
[ICLR 2024]EMO: Earth Mover Distance Optimization for Auto-Regressive Language Modeling(https://arxiv.org/abs/2310.04691)
☆126Updated last year
lucidrains / FLASH-pytorch
Implementation of the Transformer variant proposed in "Transformer Quality in Linear Time"
☆369Updated 2 years ago
microsoft / EfficientLongSequenceModeling
☆51Updated 2 years ago
bojone / analytical-classification
逻辑回归和单层softmax的解析解
☆12Updated 4 years ago
easysam / Autoformer
Implementation of the paper "Autoformer: Decomposition Transformers with Auto-Correlation for Long-Term Series Forecasting", https://arxi…
☆19Updated 4 years ago
bojone / shuffle
Python下shuffle几百G文件
☆33Updated 4 years ago
OpenNLPLab / Transnormer
[EMNLP 2022] Official implementation of Transnormer in our EMNLP 2022 paper - The Devil in Linear Transformer
☆63Updated 2 years ago
thu-coai / TaiLr
ICLR2023 - Tailoring Language Generation Models under Total Variation Distance
☆21Updated 2 years ago
bojone / tiger
A Tight-fisted Optimizer
☆50Updated 2 years ago
yaohungt / TransformerDissection
[EMNLP'19] Summary for Transformer Understanding
☆53Updated 5 years ago
Noahs-ARK / RFA
☆33Updated 4 years ago
FreedomIntelligence / complex-order
☆84Updated 5 years ago
OpenNLPLab / Tnn
[ICLR 2023] Official implementation of Transnormer in our ICLR 2023 paper - Toeplitz Neural Network for Sequence Modeling
☆80Updated last year
CyberZHG / torch-multi-head-attention
Multi-head attention in PyTorch
☆154Updated 6 years ago
lancopku / Explicit-Sparse-Transformer
code for Explicit Sparse Transformer
☆61Updated 2 years ago
1140310118 / tdlm
实现了Transformer中的几种位置编码方案
☆44Updated 4 years ago
rishikksh20 / rectified-linear-attention
Sparse Attention with Linear Units
☆19Updated 4 years ago
kyegomez / Griffin
Implementation of Griffin from the paper: "Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models"
☆56Updated this week
Mryangkaitong / big_dataloader
☆37Updated 3 years ago
aliutkus / spe
Relative Positional Encoding for Transformers with Linear Complexity
☆65Updated 3 years ago