zhangxiangnick / wordvec-aligned-en-zh
Aligned bilingual word vectors for English and Chinese
☆11Updated 6 years ago
Related projects: ⓘ
- An open-access corpus of conversational bilingual speech in Cantonese and English☆40Updated 2 years ago
- repo for Tibetan corpora☆21Updated last year
- Supplementary material for "When and Why Are Pre-trained Word Embeddings Useful for Neural Machine Translation?" at NAACL 2018☆119Updated 4 years ago
- 😎 Curated list of Tibetan NLP projects☆33Updated 4 years ago
- ☆18Updated this week
- A PyTorch implementation of a punctuation prediction system using (B)LSTM, which automatically adds suitable punctuation into text withou…☆62Updated 4 years ago
- ✒️ དག་བྱེད། Dakje, improving your spelling and readability☆11Updated 2 years ago
- Deep Learning systems for training and testing disfluency detection and related tasks on speech data.☆57Updated 5 years ago
- A language model-based approach to Grammatical Error Correction for English that uses minimal annotated data.☆49Updated 5 years ago
- Neural macine translation soft alignment visualisations for web and command line☆72Updated 3 years ago
- A collection of basic python modules for spoken natural language processing☆56Updated 4 years ago
- We use phonetics as a feature to create a joint semantic-phonetic embedding and improve the neural machine translation between Chinese an…☆11Updated 3 years ago
- Punctuation restoration in ASR text☆33Updated 5 years ago
- General-Purpose Neural Networks for Sentence Boundary Detection☆74Updated last year
- A BiRNN framework implemented in Python and TensorFlow to extract parallel sentences from aligned comparable corpora.☆33Updated 6 years ago
- ☆42Updated 6 years ago
- Examples, tutorials and use cases for Marian, including our WMT-2017/18 baselines.☆78Updated last year
- Translation Error Rate (TER)☆43Updated 6 years ago
- ☆18Updated 7 years ago
- Neural quality estimation toolkit for grammatical error correction and other language generation applications.☆49Updated 5 years ago
- Code for the paper "Multi-Task Learning for Domain-General Spoken Disfluency Detection in Dialogue Systems" (Igor Shalyminov, Arash Eshgh…☆24Updated last year
- Tools for filtering and cleaning parallel and monolingual corpora for machine translation and other natural language processing tasks.☆40Updated 9 months ago
- Use bert to predict punctuation on IWSLT2012 and The People's Daily 2014☆65Updated 4 years ago
- RNNs for Text Normalization☆37Updated 6 years ago
- Covering grammars for English and Russian text normalization☆60Updated 5 years ago
- Improving Low-Resource Neural Machine Translation of Related Languages by Transfer Learning☆17Updated last year
- Automatic Essay Scoring☆34Updated 4 years ago
- Estimate the phonetic distance between Chinese words and get similar sounding candidate words.☆35Updated last year
- Linguistically analyzed Classical Tibetan texts☆23Updated 3 years ago
- Corpus preprocessing☆95Updated 6 months ago