windsuzu / Joint-Semantic-Phonetic-Embedding
We use phonetics as a feature to create a joint semantic-phonetic embedding and improve the neural machine translation between Chinese and Japanese. 🥳
☆11Updated 3 years ago
Alternatives and similar repositories for Joint-Semantic-Phonetic-Embedding:
Users that are interested in Joint-Semantic-Phonetic-Embedding are comparing it to the libraries listed below
- The baseline model code for WMT 2021 Triangular MT☆13Updated 3 years ago
- ☆42Updated 6 years ago
- TVsub: DCU-Tencent Chinese-English Dialogue Corpus☆46Updated 6 years ago
- ODSQA: OPEN-DOMAIN SPOKEN QUESTION ANSWERING DATASET☆59Updated 2 years ago
- Neural macine translation soft alignment visualisations for web and command line☆72Updated 3 years ago
- Models, system configurations and outputs of our winning GEC systems in the BEA 2019 shared task described in R. Grundkiewicz, M. Junczys…☆50Updated 5 years ago
- Code for Synchronous Bidirectional Neural Machine Translation (SB-NMT)☆65Updated 5 years ago
- Repository of "An Empirical Study of Incorporating Pseudo Data into Grammatical Error Correction" (EMNLP-IJCNLP 2019)☆68Updated 5 years ago
- Explicit Sentence Compression for Neural Machine Translation☆10Updated 4 years ago
- Data Augmentation for Neural Machine Translation☆31Updated 7 years ago
- ☆31Updated 2 years ago
- CamRest676 is an English data set, I translate it into Chinese for training nlu.☆12Updated 7 years ago
- Baseline models, training scripts, and instructions on how to reproduce our results for our state-of-art grammar correction system from M…☆69Updated 5 years ago
- A grammatical error correction reading list maintained by Beijing Language and Culture University Natural Language Processing Group☆24Updated 4 years ago
- Supplementary material for "When and Why Are Pre-trained Word Embeddings Useful for Neural Machine Translation?" at NAACL 2018☆121Updated 4 years ago
- Thin wrapper for the AllenNLP's implementation of supervised open information extraction☆17Updated 5 years ago
- The implementation of "Does Multi-Encoder Help? A Case Study on Context-AwareNeural Machine Translation"☆40Updated 4 years ago
- Lexically Constrained Neural Machine Translation with Levenshtein Transformer☆39Updated 4 years ago
- Use bert to predict punctuation on IWSLT2012 and The People's Daily 2014☆65Updated 4 years ago
- ☆30Updated 6 years ago
- Fine tuning bert for text generation☆37Updated 5 years ago
- Dataset for TALLIP2019 paper "Ancient-Modern Chinese Translation with a New Large Training Dataset"☆22Updated 2 years ago
- MaxMatch (M^2) Scorer - Evaluation program for grammatical error correction systems.☆149Updated 2 years ago
- A PyTorch implementation of "Reaching Human-level Performance in Automatic Grammatical Error Correction: An Empirical Study"☆50Updated 6 years ago
- The repository for the paper: Rethinking Document-level Neural Machine Translation☆25Updated 2 years ago
- Neural quality estimation toolkit for grammatical error correction and other language generation applications.☆49Updated 5 years ago
- XenC: open-source data selection tool for NLP☆60Updated 8 years ago
- Unofficial implementation of "Non-Autoregressive Machine Translation with Latent Alignments" https://arxiv.org/abs/2004.07437☆24Updated 4 years ago
- Efficient Markov Chain word alignment☆53Updated 3 years ago
- Scripts to preprocess training and test data and to run fast_align and giza☆108Updated 3 years ago