UniversalDependencies / UD_Chinese-HK
Spoken mandarin Chinese from Hong Kong.
☆12Updated 4 months ago
Alternatives and similar repositories for UD_Chinese-HK:
Users that are interested in UD_Chinese-HK are comparing it to the libraries listed below
- Spoken Cantonese from Hong Kong.☆29Updated 4 months ago
- An open-access corpus of conversational bilingual speech in Cantonese and English☆40Updated 2 years ago
- Code for the paper "Multi-Task Learning for Domain-General Spoken Disfluency Detection in Dialogue Systems" (Igor Shalyminov, Arash Eshgh…☆24Updated 2 years ago
- ☆30Updated 6 years ago
- ODSQA: OPEN-DOMAIN SPOKEN QUESTION ANSWERING DATASET☆59Updated 3 years ago
- Source code for the paper "Morphological Inflection Generation with Hard Monotonic Attention"☆37Updated 6 years ago
- Deep Learning systems for training and testing disfluency detection and related tasks on speech data.☆58Updated 6 years ago
- ☆49Updated 3 years ago
- ☆92Updated 4 months ago
- RNNs for Text Normalization☆38Updated 7 years ago
- Efficient Markov Chain word alignment☆53Updated 3 years ago
- Python library for n-gram models in ARPA format☆40Updated 2 years ago
- Neural macine translation soft alignment visualisations for web and command line☆72Updated 3 years ago
- Summary of Responses to Questionnaire on Annotation Platform https://forms.gle/iZk8kehkjAWmB8xe9☆59Updated 4 years ago
- ☆42Updated 6 years ago
- Cynical data selection☆20Updated 4 years ago
- ☆18Updated 7 years ago
- A BiRNN framework implemented in Python and TensorFlow to extract parallel sentences from aligned comparable corpora.☆33Updated 6 years ago
- A bunch of scripts exploiting several tools to perform inverse text normalization (ITN)☆21Updated 7 years ago
- Language modeling scripts based on TensorFlow☆58Updated 5 years ago
- Baseline models, training scripts, and instructions on how to reproduce our results for our state-of-art grammar correction system from M…☆72Updated 5 years ago
- The Berkeley Word Aligner☆22Updated 8 years ago
- YiSi: A Semantic Machine Translation Evaluation Metric for Evaluating Languages with Different Levels of Available Resources☆25Updated 5 years ago
- XenC: open-source data selection tool for NLP☆63Updated 8 years ago
- A collection of basic python modules for spoken natural language processing☆56Updated 5 years ago
- A simple n-gram language model.☆10Updated 6 years ago
- Python binding for SRI Language Modeling Toolkit implemented in Cython☆29Updated 3 years ago
- Multilingual grapheme-to-phoneme conversion☆20Updated 7 years ago
- ☆16Updated 7 years ago
- Scorer for grammatical error correction systems.☆14Updated 9 years ago