domerin0 / opensubtitles-parserLinks
download, extract, parse and tokenize the opensubtitles dataset with this script
☆44Updated 7 years ago
Alternatives and similar repositories for opensubtitles-parser
Users that are interested in opensubtitles-parser are comparing it to the libraries listed below
Sorting:
- An updated version of the Parser-v1 repo, used for Stanford's submission in the CoNLL17 shared task.☆47Updated 6 years ago
- ☆88Updated 8 years ago
- ☆54Updated 9 years ago
- Dynamic evaluation for pytorch language models, now includes hyperparameter tuning☆104Updated 7 years ago
- Reproduction instructions for "Rapid Adaptation of Neural Machine Translation to New Languages"☆39Updated 6 years ago
- NAACL 2019 paper: Density Matching for Bilingual Word Embedding (Zhou et al., 2019)☆63Updated 2 years ago
- scripts and configuration files for Edinburgh neural MT submission to WMT 16 shared translation task☆138Updated 4 years ago
- ☆56Updated 6 years ago
- ☆18Updated 7 years ago
- ☆47Updated 8 years ago
- Baseline models, training scripts, and instructions on how to reproduce our results for our state-of-art grammar correction system from M…☆73Updated 6 years ago
- eXtensible Neural Machine Translation☆186Updated 5 years ago
- Resources for the OpenNMT hackathon☆51Updated 6 years ago
- Implementation of "Controlling Output Length in Neural Encoder-Decoders"☆42Updated 7 years ago
- Decomposable Attention Model for Sentence Pair Classification (from https://arxiv.org/abs/1606.01933)☆95Updated 8 years ago
- Easy to use scripts for evaluating word vectors on a variety of tasks.☆118Updated 4 years ago
- Non-autoregressive Neural Machine Translation (not a full version)☆71Updated 2 years ago
- Large scale sentential paraphrases collection and annotation☆46Updated 2 years ago
- SemEval-2018 Task 12: The Argument Reasoning Comprehension Task☆31Updated 7 years ago
- Deep Character-Level Neural Machine Translation☆71Updated 8 years ago
- Code for the collection and analysis of the MTNT dataset☆55Updated 6 years ago
- DSTC6: End-to-End Conversation Modeling Track☆56Updated 7 years ago
- TheanoLM is a recurrent neural network language modeling tool implemented using Theano☆81Updated 11 months ago
- ☆134Updated 7 years ago
- Neural Coref Models☆107Updated 6 years ago
- modlm: A toolkit for mixture of distributions language models☆27Updated 7 years ago
- Large corpus of uncompressed and compressed sentences from news articles.☆123Updated 8 years ago
- An extremely simple Python wrapper for the SRI Language Modeling toolkit☆70Updated 10 years ago
- Graph-based Dependency Parser☆46Updated 9 years ago
- Tool for comparison and evaluation of machine translation.☆56Updated 3 years ago