Reproducing Character-Level-Language-Modeling with Deeper Self-Attention in PyTorch
☆62Dec 31, 2018Updated 7 years ago
Alternatives and similar repositories for Character-Level-Language-Modeling-with-Deeper-Self-Attention-pytorch
Users that are interested in Character-Level-Language-Modeling-with-Deeper-Self-Attention-pytorch are comparing it to the libraries listed below
Sorting:
- Zeta implementation of a reusable and plug in and play feedforward from the paper "Exponentially Faster Language Modeling"☆16Nov 11, 2024Updated last year
- ☆15Aug 22, 2016Updated 9 years ago
- ☆12Nov 11, 2019Updated 6 years ago
- ☆21Nov 16, 2018Updated 7 years ago
- Pytorch Implemetation for our NAACL2019 Paper "Riemannian Normalizing Flow on Variational Wasserstein Autoencoder for Text Modeling" http…☆63Apr 1, 2020Updated 5 years ago
- [NeurIPS 2021] Open Rule Induction☆20May 22, 2022Updated 3 years ago
- ☆15Dec 5, 2019Updated 6 years ago
- Finding Generalizable Evidence by Learning to Convince Q&A Models☆25Jan 5, 2023Updated 3 years ago
- ☆21Oct 23, 2024Updated last year
- Adaptive embedding and softmax☆17Jan 22, 2022Updated 4 years ago
- Seq2seqAttGeneration, an basic implementation of text generation that using seq2seq attention model to generate poem series. this project…☆18Jan 11, 2021Updated 5 years ago
- rich posterior approximations and anomaly detection☆20Mar 15, 2019Updated 6 years ago
- A Probability Reasoning and Semantic Embedding-based Knowledge Graph Alignment System☆25Sep 2, 2022Updated 3 years ago
- Conversion of recurrent neural network language models to weighted finite state transducers☆58Jun 1, 2018Updated 7 years ago
- Code and Data for ACL 2020 paper "Few-Shot NLG with Pre-Trained Language Model"☆190May 23, 2025Updated 9 months ago
- RNN language model using Tensorflow☆19Jun 29, 2016Updated 9 years ago
- ☆12Mar 21, 2024Updated last year
- a pure-Python PATRICIA trie implementation.☆30Dec 14, 2014Updated 11 years ago
- Subword Language Model for Query Auto-Completion☆67Sep 5, 2019Updated 6 years ago
- Code for SIGDial 2019 Best Paper: Structured Fusion Networks for Dialog https://arxiv.org/abs/1907.10016☆30Aug 19, 2019Updated 6 years ago
- Natural Language Generation by Hierarchical Decoding with Linguistic Patterns (NAACL-HLT 2018), Investigating Linguistic Pattern Ordering…☆32Sep 23, 2018Updated 7 years ago
- This repo contains code to reproduce some of the results presented in the paper "SentenceMIM: A Latent Variable Language Model"☆28Jun 22, 2022Updated 3 years ago
- ☆32Apr 29, 2020Updated 5 years ago
- (AAAI'20) The source code for the paper "Controlling the Amount of Verbatim Copying in Abstractive Summarization".☆32Oct 14, 2020Updated 5 years ago
- A repository to bind mecab for Python 3.5+. Not using swig nor pybind. (Not Maintained Now)☆28May 21, 2021Updated 4 years ago
- TransOMCS is a commonsense knowledge resource transferred from ASER. It is in the format of OMCS but two orders of magnitude larger.☆70Aug 25, 2020Updated 5 years ago
- A public wiki for the deep learning reading group at UC Berkeley☆27Aug 20, 2016Updated 9 years ago
- A TensorFlow implementation of NRTR, a No-Recurrence Seq2Seq Model for Scene Text Recognition☆31Sep 1, 2019Updated 6 years ago
- Information Extraction Dataset Zoo.☆30Apr 9, 2022Updated 3 years ago
- This is the Javascript Code, it helps you to find you visited your Facebook Profile.☆12Sep 13, 2018Updated 7 years ago
- Template and steps to build your personal blog using Jekyll and Minimal Mistake☆10Feb 24, 2020Updated 6 years ago
- AAAI-20 paper: Cross-Lingual Natural Language Generation via Pre-Training☆129Aug 4, 2021Updated 4 years ago
- An implementation of BERT using PyTorch's TransformerEncoder☆32Dec 15, 2019Updated 6 years ago
- [CVPR 2019] Pytorch code for Audio Visual Scene-Aware Dialog☆34Feb 1, 2021Updated 5 years ago
- chinese coreference resolution, implementation of paper 1606.01323v2 (stanford)☆28Sep 18, 2018Updated 7 years ago
- Cycle-consistent Conditional Adversarial Transfer Networks, ACM MM 2019☆33Sep 18, 2019Updated 6 years ago
- [NeurIPS 2020] "The Lottery Ticket Hypothesis for Pre-trained BERT Networks", Tianlong Chen, Jonathan Frankle, Shiyu Chang, Sijia Liu, Ya…☆142Dec 30, 2021Updated 4 years ago
- ☆35Apr 8, 2019Updated 6 years ago
- 📃 Set the correct (tab) titles for your arXiv papers containing tabs.☆46Dec 10, 2025Updated 2 months ago