nadavbh12 / Character-Level-Language-Modeling-with-Deeper-Self-Attention-pytorchView external linksLinks
Reproducing Character-Level-Language-Modeling with Deeper Self-Attention in PyTorch
☆62Dec 31, 2018Updated 7 years ago
Alternatives and similar repositories for Character-Level-Language-Modeling-with-Deeper-Self-Attention-pytorch
Users that are interested in Character-Level-Language-Modeling-with-Deeper-Self-Attention-pytorch are comparing it to the libraries listed below
Sorting:
- Zeta implementation of a reusable and plug in and play feedforward from the paper "Exponentially Faster Language Modeling"☆16Nov 11, 2024Updated last year
- ☆15Aug 22, 2016Updated 9 years ago
- ☆12Nov 11, 2019Updated 6 years ago
- ☆21Nov 16, 2018Updated 7 years ago
- Pytorch Implemetation for our NAACL2019 Paper "Riemannian Normalizing Flow on Variational Wasserstein Autoencoder for Text Modeling" http…☆63Apr 1, 2020Updated 5 years ago
- ☆15Dec 5, 2019Updated 6 years ago
- PyTorch implementation of the Feed-Forward Attention Mechanism.☆18Jul 17, 2018Updated 7 years ago
- [NeurIPS 2021] Open Rule Induction☆19May 22, 2022Updated 3 years ago
- ☆21Oct 23, 2024Updated last year
- Finding Generalizable Evidence by Learning to Convince Q&A Models☆25Jan 5, 2023Updated 3 years ago
- Adaptive embedding and softmax☆17Jan 22, 2022Updated 4 years ago
- rich posterior approximations and anomaly detection☆20Mar 15, 2019Updated 6 years ago
- ☆21Jul 24, 2019Updated 6 years ago
- A Probability Reasoning and Semantic Embedding-based Knowledge Graph Alignment System☆25Sep 2, 2022Updated 3 years ago
- Conversion of recurrent neural network language models to weighted finite state transducers☆58Jun 1, 2018Updated 7 years ago
- Code and Data for ACL 2020 paper "Few-Shot NLG with Pre-Trained Language Model"☆190May 23, 2025Updated 8 months ago
- ☆12Mar 21, 2024Updated last year
- Code repo for "Transformer on a Diet" paper☆31Jun 22, 2020Updated 5 years ago
- This repository contains the code for running the character-level Sandwich Transformers from our ACL 2020 paper on Improving Transformer …☆57Jan 1, 2021Updated 5 years ago
- Subword Language Model for Query Auto-Completion☆67Sep 5, 2019Updated 6 years ago
- Abstractive Multi-Document Summarisation, generating Wikipedia lead sections for specific domains. Exploiting target summaries content st…☆29Sep 29, 2022Updated 3 years ago
- Code for SIGDial 2019 Best Paper: Structured Fusion Networks for Dialog https://arxiv.org/abs/1907.10016☆30Aug 19, 2019Updated 6 years ago
- Markdown to Telegram MarkdownV2 Converter☆13Jul 15, 2024Updated last year
- This repo contains code to reproduce some of the results presented in the paper "SentenceMIM: A Latent Variable Language Model"☆28Jun 22, 2022Updated 3 years ago
- (AAAI'20) The source code for the paper "Controlling the Amount of Verbatim Copying in Abstractive Summarization".☆32Oct 14, 2020Updated 5 years ago
- A repository to bind mecab for Python 3.5+. Not using swig nor pybind. (Not Maintained Now)☆28May 21, 2021Updated 4 years ago
- TransOMCS is a commonsense knowledge resource transferred from ASER. It is in the format of OMCS but two orders of magnitude larger.☆69Aug 25, 2020Updated 5 years ago
- All NLP experiments described in ArXiv paper 1904.02682☆33Jun 24, 2019Updated 6 years ago
- A public wiki for the deep learning reading group at UC Berkeley☆27Aug 20, 2016Updated 9 years ago
- Reformer, the efficient Transformer, in Pytorch☆2,193Jun 21, 2023Updated 2 years ago
- A TensorFlow implementation of NRTR, a No-Recurrence Seq2Seq Model for Scene Text Recognition☆31Sep 1, 2019Updated 6 years ago
- Information Extraction Dataset Zoo.☆30Apr 9, 2022Updated 3 years ago
- Zheng Zhao's doctoral dissertation from Aalto University☆35Oct 10, 2022Updated 3 years ago
- This is the Javascript Code, it helps you to find you visited your Facebook Profile.☆12Sep 13, 2018Updated 7 years ago
- Template and steps to build your personal blog using Jekyll and Minimal Mistake☆10Feb 24, 2020Updated 5 years ago
- Deep Adaptive Image Clustering Paper Implementation☆31Dec 29, 2018Updated 7 years ago
- An implementation of BERT using PyTorch's TransformerEncoder☆32Dec 15, 2019Updated 6 years ago
- Pre-training of Language Models for Language Understanding☆83Aug 24, 2019Updated 6 years ago
- Self-supervised NER prototype - updated version (69 entity types - 17 broad entity groups). Uses pretrained BERT models with no fine tuni…☆78Jul 16, 2022Updated 3 years ago