Gladys-Zhao/mRNN-mLSTM

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Gladys-Zhao/mRNN-mLSTM)

Gladys-Zhao / mRNN-mLSTM

Code for ICML 2020 paper: Do RNN and LSTM have Long Memory?

☆17

Alternatives and similar repositories for mRNN-mLSTM

Users that are interested in mRNN-mLSTM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

sustcsonglin / disco-pointer
View on GitHub
Official Implementation of ACL2023: Don't Parse, Choose Spans! Continuous and Discontinuous Constituency Parsing via Autoregressive Span …
☆14Aug 25, 2023Updated 2 years ago
Noahs-ARK / PaLM
View on GitHub
PyTorch implementation for PaLM: A Hybrid Parser and Language Model.
☆10Jan 7, 2020Updated 6 years ago
bdusell / stack-attention
View on GitHub
Code for the paper "Stack Attention: Improving the Ability of Transformers to Model Hierarchical Patterns"
☆18Mar 15, 2024Updated 2 years ago
rishikksh20 / rectified-linear-attention
View on GitHub
Sparse Attention with Linear Units
☆20Apr 21, 2021Updated 5 years ago
rycolab / aflt-f2023
View on GitHub
Advanced Formal Language Theory (263-5352-00L; Frühjahr 2023)
☆10Feb 21, 2023Updated 3 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
KurochkinAlexey / AntisymmetricRNN
View on GitHub
Python implementation of paper "AntisymmetricRNN: A Dynamical System View on Recurrent Neural Networks"
☆15Aug 2, 2019Updated 6 years ago
yikangshen / megablocks
View on GitHub
☆20May 30, 2024Updated 2 years ago
JRC1995 / Continuous-RvNN
View on GitHub
Official Repository for "Modeling Hierarchical Structures with Continuous Recursive Neural Networks" (ICML 2021)
☆12Aug 18, 2021Updated 4 years ago
WentaoZhan1998 / geospaNN
View on GitHub
☆20Jun 8, 2026Updated last month
proger / nanokitchen
View on GitHub
Parallel Associative Scan for Language Models
☆18Jan 8, 2024Updated 2 years ago
cyk1337 / Highway-Transformer
View on GitHub
[ACL‘20] Highway Transformer: A Gated Transformer.
☆33Dec 5, 2021Updated 4 years ago
maximzubkov / fft-scan
View on GitHub
Efficient PScan implementation in PyTorch
☆17Jan 2, 2024Updated 2 years ago
johanwind / wind_rwkv
View on GitHub
☆27Feb 26, 2026Updated 5 months ago
MangoKiller / SimOAR_OAR
View on GitHub
☆11Nov 8, 2023Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
Doraemonzzz / hgru-pytorch
View on GitHub
☆29Jul 9, 2024Updated 2 years ago
dangxingyu / rnn-icrag
View on GitHub
Official repository of paper "RNNs Are Not Transformers (Yet): The Key Bottleneck on In-context Retrieval"
☆27Apr 17, 2024Updated 2 years ago
Aditya239233 / GNNExplainer
View on GitHub
Code for running experiments and benchmarking on GNNExplainer: Generating Explanations for Graph Neural Networks
☆15May 8, 2021Updated 5 years ago
glassroom / heinsen_attention
View on GitHub
Reference implementation of "Softmax Attention with Constant Cost per Token" (Heinsen, 2024)
☆25Jun 6, 2024Updated 2 years ago
X-rayLaser / multi-directional-mdrnn
View on GitHub
Custom Keras layers for implementing multi-dimensional recurrent neural networks (MDRNNs) described in Alex Graves's paper https://arxiv.…
☆10Apr 27, 2020Updated 6 years ago
Doraemonzzz / xmixers
View on GitHub
Xmixers: A collection of SOTA efficient token/channel mixers
☆29Sep 4, 2025Updated 10 months ago
zwd2016 / multivariate-time-series-prediction
View on GitHub
This code is the implementation of this paper (Multistage attention network for multivariate time series prediction)
☆23Apr 24, 2020Updated 6 years ago
foxlf823 / DilatedRnn
View on GitHub
A PyTorch implement of Dilated RNN
☆11Dec 31, 2017Updated 8 years ago
ermongroup / fast_feedforward_computation
View on GitHub
Official code for "Accelerating Feedforward Computation via Parallel Nonlinear Equation Solving", ICML 2021
☆30Sep 25, 2021Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
00ffcc / chunkRWKV6
View on GitHub
continous batching and parallel acceleration for RWKV6
☆23Jun 28, 2024Updated 2 years ago
zhu-minjun / SafetyLock
View on GitHub
Your finetuned model's back to its original safety standards faster than you can say "SafetyLock"!
☆11Oct 16, 2024Updated last year
sandialabs / quinn
View on GitHub
Quantification of Uncertainties in Neural Networks
☆11Feb 25, 2026Updated 5 months ago
renll / SeqBoat
View on GitHub
[NeurIPS 2023] Sparse Modular Activation for Efficient Sequence Modeling
☆40Dec 2, 2023Updated 2 years ago
expz / annotated-hyena
View on GitHub
An annotated implementation of the Hyena Hierarchy paper
☆34May 28, 2023Updated 3 years ago
131250208 / InfExtraction
View on GitHub
☆24Oct 26, 2022Updated 3 years ago
PredictiveIntelligenceLab / UQDeepONet
View on GitHub
☆11Mar 18, 2023Updated 3 years ago
NGMLGroup / Koopman-TGNN-Interpretability
View on GitHub
Repository for the paper "Interpreting Temporal Graph Neural Networks with Koopman Theory"
☆12Apr 7, 2026Updated 3 months ago
subho406 / agalite
View on GitHub
AGaLiTe: Approximate Gated Linear Transformers for Online Reinforcement Learning (Published in TMLR)
☆24Oct 15, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
RachelCmy / den2vel
View on GitHub
The tensorflow implementation of the paper, "Learning Meaningful Controls for Fluids" (SIGGRAPH 2021 https://rachelcmy.github.io/den2vel/…
☆26Sep 13, 2023Updated 2 years ago
gerdm / rebayes-mini
View on GitHub
Minimalist version of probml/rebayes
☆10Apr 9, 2026Updated 3 months ago
lucidrains / gateloop-transformer
View on GitHub
Implementation of GateLoop Transformer in Pytorch and Jax
☆93Jun 18, 2024Updated 2 years ago
lsj2408 / URPE
View on GitHub
[NeurIPS 2022] Your Transformer May Not be as Powerful as You Expect (official implementation)
☆35Aug 6, 2023Updated 2 years ago
assafbk / DeciMamba
View on GitHub
DeciMamba: Exploring the Length Extrapolation Potential of Mamba (ICLR 2025)
☆32Apr 9, 2025Updated last year
leanderloew / ES-RNN-Pytorch
View on GitHub
This is a work in progress Pytorch implementation of the recently proposed ES-RNN by Slawek Smyl, winner of the M4 competition
☆12Apr 9, 2019Updated 7 years ago
nanowell / Q-Sparse-LLM
View on GitHub
My Implementation of Q-Sparse: All Large Language Models can be Fully Sparsely-Activated
☆37Aug 14, 2024Updated last year