bojone / rnn
一些RNN的实现
☆49Updated last year
Alternatives and similar repositories for rnn:
Users that are interested in rnn are comparing it to the libraries listed below
- ☆28Updated 6 months ago
- A Tight-fisted Optimizer☆47Updated last year
- Pytorch implementation of "Block Recurrent Transformers" (Hutchins & Schlag et al., 2022)☆83Updated 2 years ago
- [NeurIPS 2023 spotlight] Official implementation of HGRN in our NeurIPS 2023 paper - Hierarchically Gated Recurrent Neural Network for Se…☆62Updated 9 months ago
- FLASHQuad_pytorch☆66Updated 2 years ago
- 基于Gated Attention Unit的Transformer模型(尝鲜版)☆97Updated last year
- 逻辑回归和单层softmax的解析解☆12Updated 3 years ago
- Implementation of Griffin from the paper: "Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models"☆51Updated this week
- A pytorch &keras implementation and demo of Fastformer.☆188Updated 2 years ago
- code for Explicit Sparse Transformer☆58Updated last year
- ☆64Updated 5 months ago
- Code for ICML 2020 paper: Do RNN and LSTM have Long Memory?☆17Updated 4 years ago
- ☆14Updated last year
- Code for COMET: Cardinality Constrained Mixture of Experts with Trees and Local Search☆10Updated last year
- Python下shuffle几百G文件☆33Updated 3 years ago
- 基于Transformer的单模型、多尺度的VAE模型☆55Updated 3 years ago
- [EMNLP'19] Summary for Transformer Understanding☆53Updated 5 years ago
- This is a code repository for the ACL 2022 paper "ODE Transformer: An Ordinary Differential Equation-Inspired Model for Sequence Generati…☆29Updated 2 years ago
- Learning to Encode Position for Transformer with Continuous Dynamical Model☆59Updated 4 years ago
- Implementation of the paper "Autoformer: Decomposition Transformers with Auto-Correlation for Long-Term Series Forecasting", https://arxi…☆18Updated 3 years ago
- ICLR2023 - Tailoring Language Generation Models under Total Variation Distance☆21Updated last year
- Training RNNs as Fast as CNNs (https://arxiv.org/abs/1709.02755)☆31Updated 3 years ago
- Implementation of Memory-Compressed Attention, from the paper "Generating Wikipedia By Summarizing Long Sequences"☆70Updated last year
- Implementations of various linear RNN layers using pytorch and triton☆49Updated last year
- ☆14Updated last month
- RoFormer升级版☆151Updated 2 years ago
- ☆32Updated 3 years ago
- ☆18Updated last year
- ☆50Updated 2 years ago
- A Tight-fisted Optimizer (Tiger), implemented in PyTorch.☆11Updated 7 months ago