windsuzu / Joint-Semantic-Phonetic-Embedding
We use phonetics as a feature to create a joint semantic-phonetic embedding and improve the neural machine translation between Chinese and Japanese. π₯³
β11Updated 3 years ago
Alternatives and similar repositories for Joint-Semantic-Phonetic-Embedding:
Users that are interested in Joint-Semantic-Phonetic-Embedding are comparing it to the libraries listed below
- The baseline model code for WMT 2021 Triangular MTβ13Updated 3 years ago
- ODSQA: OPEN-DOMAIN SPOKEN QUESTION ANSWERING DATASETβ59Updated 2 years ago
- machine translation and quality estimationβ34Updated 6 years ago
- β42Updated 6 years ago
- MaxMatch (M^2) Scorer - Evaluation program for grammatical error correction systems.β149Updated 2 years ago
- Code for Synchronous Bidirectional Neural Machine Translation (SB-NMT)β65Updated 5 years ago
- β119Updated 4 years ago
- A PyTorch implementation of "Reaching Human-level Performance in Automatic Grammatical Error Correction: An Empirical Study"β50Updated 6 years ago
- Terminology Datasetβ23Updated 4 years ago
- β32Updated 2 years ago
- Repository of "An Empirical Study of Incorporating Pseudo Data into Grammatical Error Correction" (EMNLP-IJCNLP 2019)β68Updated 5 years ago
- XenC: open-source data selection tool for NLPβ60Updated 8 years ago
- Lexically Constrained Neural Machine Translation with Levenshtein Transformerβ39Updated 4 years ago
- A grammatical error correction reading list maintained by Beijing Language and Culture University Natural Language Processing Groupβ24Updated 4 years ago
- Models, system configurations and outputs of our winning GEC systems in the BEA 2019 shared task described in R. Grundkiewicz, M. Junczysβ¦β50Updated 5 years ago
- Code for cross-sentence grammatical error correction using multilayer convolutional seq2seq models (ACL 2019)β50Updated 4 years ago
- Re-rank n-best lists using additional features.β28Updated 6 years ago
- NMT for chinese-english using tensor2tensorβ47Updated 7 years ago
- We release a dataset based on Wikipedia sentences and the corresponding translations in 6 different languages along with the scores (scalβ¦β81Updated 3 years ago
- Cynical data selectionβ20Updated 4 years ago
- Explicit Sentence Compression for Neural Machine Translationβ10Updated 4 years ago
- Framework for neural-based Quality Estimationβ42Updated 4 years ago
- Supplementary material for "When and Why Are Pre-trained Word Embeddings Useful for Neural Machine Translation?" at NAACL 2018β121Updated 4 years ago
- Prior Knowledge Integration for Neural Machine Translation using Posterior Regularizationβ11Updated 6 years ago
- β14Updated 9 years ago
- Scripts to preprocess training and test data and to run fast_align and gizaβ108Updated 3 years ago
- My implementation of LASER architecture in Fairseqβ12Updated 4 years ago
- An open-access corpus of conversational bilingual speech in Cantonese and Englishβ40Updated 2 years ago
- Code and data for the NAACL 2019 paper "Improving Cross-Domain Chinese Word Segmentation with Word Embeddings"β10Updated 5 years ago
- Neural quality estimation toolkit for grammatical error correction and other language generation applications.β49Updated 5 years ago