domerin0 / opensubtitles-parserLinks
download, extract, parse and tokenize the opensubtitles dataset with this script
☆44Updated 7 years ago
Alternatives and similar repositories for opensubtitles-parser
Users that are interested in opensubtitles-parser are comparing it to the libraries listed below
Sorting:
- scripts and configuration files for Edinburgh neural MT submission to WMT 16 shared translation task☆138Updated 4 years ago
- Recurrent neural network grammars☆190Updated 7 years ago
- ☆55Updated 10 years ago
- eXtensible Neural Machine Translation☆187Updated 5 years ago
- A repository linking to publicly available dialog datasets. Feel free to send pull requests.☆69Updated 3 years ago
- ☆88Updated 8 years ago
- Resources for the OpenNMT hackathon☆51Updated 6 years ago
- Tool for comparison and evaluation of machine translation.☆56Updated 3 years ago
- Large corpus of uncompressed and compressed sentences from news articles.☆125Updated 8 years ago
- Easy to use scripts for evaluating word vectors on a variety of tasks.☆119Updated 4 years ago
- ☆18Updated 7 years ago
- ☆56Updated 7 years ago
- Easy Bootstrap Resampling and Approximate Randomization for BLEU, METEOR, and TER using Multiple Optimizer Runs. This implements "Better …☆205Updated 2 years ago
- DRESS simplification model (EMNLP 2017) described in http://aclweb.org/anthology/D/D17/D17-1062.pdf☆155Updated 3 years ago
- Attention-based NMT with Coverage, Context Gate, and Reconstruction☆95Updated 4 years ago
- ☆181Updated 7 years ago
- Neural macine translation soft alignment visualisations for web and command line☆72Updated 4 years ago
- An updated version of the Parser-v1 repo, used for Stanford's submission in the CoNLL17 shared task.☆47Updated 7 years ago
- Dynamic evaluation for pytorch language models, now includes hyperparameter tuning☆104Updated 7 years ago
- Baseline models, training scripts, and instructions on how to reproduce our results for our state-of-art grammar correction system from M…☆73Updated 6 years ago
- A series of scripts to download and parse the OpenSubtitles corpus.☆86Updated 9 years ago
- A list of Neural MT implementations☆363Updated 3 years ago
- Attention-based NMT with a coverage mechanism to indicate whether a source word is translated or not☆111Updated 5 years ago
- TheanoLM is a recurrent neural network language modeling tool implemented using Theano☆81Updated last year
- SemCor and Masc documents annotated with NOAD word senses.☆184Updated 5 years ago
- NAACL 2019 paper: Density Matching for Bilingual Word Embedding (Zhou et al., 2019)☆63Updated 2 years ago
- Dynamic data selection for neural machine translation☆20Updated 7 years ago
- LSTM Language Model with Subword Units Input Representations☆42Updated 4 years ago
- Corpus preprocessing☆99Updated last year
- Neural network sequence labeling model☆250Updated 6 years ago