sliced-rnn
☆469Nov 24, 2018Updated 7 years ago
Alternatives and similar repositories for srnn
Users that are interested in srnn are comparing it to the libraries listed below
Sorting:
- Pervasive Attention: 2D Convolutional Networks for Sequence-to-Sequence Prediction☆498May 8, 2021Updated 4 years ago
- Training RNNs as Fast as CNNs (https://arxiv.org/abs/1709.02755)☆2,111Jan 4, 2022Updated 4 years ago
- The Tensorflow code for this ACL 2018 paper: "Baseline Needs More Love: On Simple Word-Embedding-Based Models and Associated Pooling Mech…☆288Dec 17, 2022Updated 3 years ago
- Implemented transformer NN block for Machine translation, text classfication, Natural language inference as well as Machine reading compr…☆11Mar 1, 2026Updated 3 weeks ago
- The Natural Language Decathlon: A Multitask Challenge for NLP☆2,339May 1, 2025Updated 10 months ago
- [ICLR'19] Trellis Networks for Sequence Modeling☆473Aug 20, 2019Updated 6 years ago
- Reversible Recurrent Neural Network Pytorch Implementation☆21Dec 6, 2017Updated 8 years ago
- An optimizer that trains as fast as Adam and as good as SGD.☆2,909Jul 23, 2023Updated 2 years ago
- Examples of using sparse attention, as in "Generating Long Sequences with Sparse Transformers"☆1,611Aug 12, 2020Updated 5 years ago
- some attention implements☆1,452Nov 20, 2019Updated 6 years ago
- PyTorch implementation of the Quasi-Recurrent Neural Network - up to 16 times faster than NVIDIA's cuDNN LSTM☆1,264Feb 12, 2022Updated 4 years ago
- NCRF++, a Neural Sequence Labeling Toolkit. Easy use to any sequence labeling tasks (e.g. NER, POS, Segmentation). It includes character …☆1,896Jun 30, 2022Updated 3 years ago
- A pytorch implementation of FFTNet.☆37Aug 31, 2018Updated 7 years ago
- XLNet: Generalized Autoregressive Pretraining for Language Understanding☆6,176May 28, 2023Updated 2 years ago
- Single Headed Attention RNN - "Stop thinking with your head"☆1,181Nov 27, 2021Updated 4 years ago
- Training RNNs as Fast as CNNs (Simple Recurrent Unit)☆32Sep 27, 2017Updated 8 years ago
- Tensorflow implementation of contextualized word representations from bi-directional language models☆1,613Mar 4, 2023Updated 3 years ago
- Skip RNN: Learning to Skip State Updates in Recurrent Neural Networks (ICLR 2018)☆124Mar 24, 2023Updated 2 years ago
- Tensorflow implementation for DilatedRNN☆354Oct 24, 2017Updated 8 years ago
- Convenient hyperparameter optimization☆14Apr 30, 2024Updated last year
- Tensorflow implementation of "Language Modeling with Gated Convolutional Networks"☆273Jan 16, 2017Updated 9 years ago
- Facilitating the design, comparison and sharing of deep text matching models.☆3,855Aug 2, 2024Updated last year
- RUDDER for ATARI games with delayed rewards in OpenAI Baselines package☆268Oct 24, 2019Updated 6 years ago
- Facebook AI Research Sequence-to-Sequence Toolkit☆3,730Sep 17, 2021Updated 4 years ago
- Code for the Eager Translation Model from the paper You May Not Need Attention☆294Dec 17, 2018Updated 7 years ago
- TensorFlow implementation of Independently Recurrent Neural Networks☆513Aug 19, 2021Updated 4 years ago
- An open-source NLP research library, built on PyTorch.☆11,893Nov 22, 2022Updated 3 years ago
- Code for the article "Semantic-Unit-Based Dilated Convolution for Multi-Label Text Classification" (EMNLP 2018)☆155Jan 8, 2019Updated 7 years ago
- A PyTorch implementation of the WaveGlow: A Flow-based Generative Network for Speech Synthesis☆205Nov 6, 2018Updated 7 years ago
- Unsupervised neural machine translation; weight sharing; GAN☆94Sep 26, 2018Updated 7 years ago
- all kinds of text classification models and more with deep learning☆7,950Sep 28, 2023Updated 2 years ago
- A Keras+TensorFlow Implementation of the Transformer: Attention Is All You Need☆714Sep 24, 2021Updated 4 years ago
- TensorFlow implementation [ICLR 18] "Learning Approximate Inference Networks for Structured Prediction"☆30Jun 10, 2018Updated 7 years ago
- An implementation of DeepMind's Relational Recurrent Neural Networks (NeurIPS 2018) in PyTorch.☆246Dec 27, 2018Updated 7 years ago
- Basic wavenet and fftnet vocoder model.☆19Feb 7, 2022Updated 4 years ago
- A TensorFlow Implementation of the Transformer: Attention Is All You Need☆4,459May 21, 2023Updated 2 years ago
- Code for the paper☆11May 24, 2024Updated last year
- Nested LSTM Cell☆252Mar 25, 2018Updated 7 years ago
- Code and model for the paper "Improving Language Understanding by Generative Pre-Training"☆2,283Jan 25, 2019Updated 7 years ago