groverjeenu / Bilingual-Word-Embeddings-with-Bucketed-CNN-for-Parallel-Sentence-Extraction
Code for our paper in ACL 2017
☆14Updated 7 years ago
Alternatives and similar repositories for Bilingual-Word-Embeddings-with-Bucketed-CNN-for-Parallel-Sentence-Extraction:
Users that are interested in Bilingual-Word-Embeddings-with-Bucketed-CNN-for-Parallel-Sentence-Extraction are comparing it to the libraries listed below
- A BiRNN framework implemented in Python and TensorFlow to extract parallel sentences from aligned comparable corpora.☆33Updated 6 years ago
- ☆42Updated 6 years ago
- Baseline models, training scripts, and instructions on how to reproduce our results for our state-of-art grammar correction system from M…☆69Updated 5 years ago
- Decoding platform for machine translation research☆54Updated 5 years ago
- Tools for extracting parallel corpora from article titles across languages in Wikipedia☆72Updated 9 years ago
- ☆33Updated 3 years ago
- Sume is an implementation of the concept-based ILP model for summarization.☆38Updated 6 years ago
- Tool for comparison and evaluation of machine translation.☆56Updated 2 years ago
- TER-plus Machine Translation metric.☆31Updated 2 years ago
- A statistical machine translation (SMT)-based grammatical error correction system that makes use of neural network joint models (NNJM) an…☆25Updated 6 years ago
- LSTM Language Model with Subword Units Input Representations☆42Updated 3 years ago
- Large scale sentential paraphrases collection and annotation☆46Updated 2 years ago
- Efficient Markov Chain word alignment☆53Updated 3 years ago
- Named Entity Disambiguation for Noisy Text☆66Updated 7 years ago
- ☆34Updated 8 years ago
- Java code from the 2008 EMNLP paper "Bayesian Unsupervised Topic Segmentation" by Eisenstein and Barzilay☆36Updated 9 years ago
- Alignment and annotation for comparable documents.☆22Updated 6 years ago
- Source code for the paper "Morphological Inflection Generation with Hard Monotonic Attention"☆38Updated 6 years ago
- Neural macine translation soft alignment visualisations for web and command line☆72Updated 3 years ago
- Multilingual hierarchical attention networks toolkit☆77Updated 5 years ago
- Universal segmenter based on the Universal Dependency framework, written by Y. Shao, Uppsala University☆34Updated 5 years ago
- takahe is a multi-sentence compression module☆54Updated 3 years ago
- ☆14Updated 8 years ago
- Examples, tutorials and use cases for Marian, including our WMT-2017/18 baselines.☆76Updated last year
- Uncovering divergent linguistic information in word embeddings with lessons for intrinsic and extrinsic evaluation☆63Updated 6 years ago
- ☆43Updated 9 years ago
- Efficient Low-Memory Aligner☆140Updated 2 weeks ago
- Event Time Extraction with a Decision Tree of Neural Classifiers☆18Updated 5 years ago
- NMT for chinese-english using tensor2tensor☆47Updated 7 years ago
- In this project, we use skip-gram model to embed Wikipedia Concepts and Entities. The English version of Wikipedia contains more than fiv…☆56Updated 7 years ago