domerin0 / opensubtitles-parser
download, extract, parse and tokenize the opensubtitles dataset with this script
☆45Updated 7 years ago
Related projects ⓘ
Alternatives and complementary repositories for opensubtitles-parser
- A repository linking to publicly available dialog datasets. Feel free to send pull requests.☆66Updated 2 years ago
- DSTC6: End-to-End Conversation Modeling Track☆57Updated 6 years ago
- Easy to use scripts for evaluating word vectors on a variety of tasks.☆120Updated 3 years ago
- Decomposable Attention Model for Sentence Pair Classification (from https://arxiv.org/abs/1606.01933)☆96Updated 7 years ago
- NAACL 2019 paper: Density Matching for Bilingual Word Embedding (Zhou et al., 2019)☆64Updated last year
- An updated version of the Parser-v1 repo, used for Stanford's submission in the CoNLL17 shared task.☆47Updated 6 years ago
- Resources for the OpenNMT hackathon☆51Updated 5 years ago
- ☆85Updated 7 years ago
- Large scale sentential paraphrases collection and annotation☆47Updated last year
- Dynamic evaluation for pytorch language models, now includes hyperparameter tuning☆105Updated 6 years ago
- ☆48Updated 6 years ago
- ☆48Updated 7 years ago
- eXtensible Neural Machine Translation☆185Updated 4 years ago
- ☆55Updated 9 years ago
- AskUbuntu Question Dataset☆68Updated 8 years ago
- LSTM Language Model with Subword Units Input Representations☆43Updated 3 years ago
- NLP tools on Lasagne☆61Updated 7 years ago
- Tool for comparison and evaluation of machine translation.☆56Updated 2 years ago
- Deep Character-Level Neural Machine Translation☆72Updated 7 years ago
- Author implementation of "Learning Recurrent Span Representations for Extractive Question Answering" (Lee et al. 2016)☆33Updated 7 years ago
- Implementation of "Controlling Output Length in Neural Encoder-Decoders"☆42Updated 6 years ago
- scripts and configuration files for Edinburgh neural MT submission to WMT 16 shared translation task☆138Updated 4 years ago
- ☆54Updated 7 years ago
- ☆17Updated 7 years ago
- Neural Coref Models☆107Updated 6 years ago
- Hierarchical Encoder Decoder for Dialog Modelling☆96Updated 6 years ago
- Large corpus of uncompressed and compressed sentences from news articles.☆123Updated 7 years ago
- Assessing syntactic abilities of BERT☆150Updated 5 years ago
- Code for upcoming TACL paper w/ Graham Neubig, "Neural Lattice Language Models".☆48Updated 6 years ago
- Baseline models, training scripts, and instructions on how to reproduce our results for our state-of-art grammar correction system from M…☆69Updated 5 years ago