UniversalDependencies / UD_Cantonese-HKLinks
Spoken Cantonese from Hong Kong.
☆29Updated 3 weeks ago
Alternatives and similar repositories for UD_Cantonese-HK
Users that are interested in UD_Cantonese-HK are comparing it to the libraries listed below
Sorting:
- Spoken mandarin Chinese from Hong Kong.☆12Updated 3 weeks ago
- An open-access corpus of conversational bilingual speech in Cantonese and English☆40Updated 3 years ago
- Transformers for Cantonese☆57Updated 4 years ago
- An audio and transcribed corpus of contemporary Hong Kong Cantonese☆37Updated 4 years ago
- ODSQA: OPEN-DOMAIN SPOKEN QUESTION ANSWERING DATASET☆62Updated 3 years ago
- 粵文語料篩選器 Cantonese text filter☆40Updated 2 months ago
- Neural macine translation soft alignment visualisations for web and command line☆72Updated 3 years ago
- Training an n-gram based Language Model using KenLM toolkit for Deep Speech 2☆114Updated 6 years ago
- XenC: open-source data selection tool for NLP☆64Updated 9 years ago
- Links to data used in Sproat & Jaitly (https://arxiv.org/abs/1611.00068) experiments.☆76Updated 3 years ago
- Cantonese segmentation tool 粵語分詞工具☆30Updated 4 years ago
- RNNs for Text Normalization☆39Updated 7 years ago
- A curated list of resources dedicated to Natural Language Processing (NLP) of Cantonese | 粵語 NLP☆92Updated 3 years ago
- ☆49Updated 3 years ago
- Chinese Wordnet v.2☆22Updated 8 years ago
- Efficient Markov Chain word alignment☆53Updated 3 years ago
- ☆42Updated 6 years ago
- Unsupervised spoken sentence embeddings☆14Updated 2 years ago
- Code for the paper "Multi-Task Learning for Domain-General Spoken Disfluency Detection in Dialogue Systems" (Igor Shalyminov, Arash Eshgh…☆24Updated 2 years ago
- Grammarly Corpus of Discourse Coherence and accompanying code for discourse coherence models☆18Updated 5 years ago
- Punctuation restoration in ASR text☆33Updated 5 years ago
- Corpus preprocessing☆97Updated last year
- The Fisher and CALLHOME Spanish–English Speech Translation Corpus☆40Updated 3 years ago
- Code for cross-sentence grammatical error correction using multilayer convolutional seq2seq models (ACL 2019)☆50Updated 5 years ago
- A PyTorch implementation of a punctuation prediction system using (B)LSTM, which automatically adds suitable punctuation into text withou…☆61Updated 5 years ago
- Corpus of Annotations for Misspelings☆26Updated last year
- ☆22Updated 5 years ago
- ☆31Updated 7 years ago
- A collection of basic python modules for spoken natural language processing☆56Updated 5 years ago
- Extraction de LExique par Variation d'Entropie - Lexicon extraction based on the variation of entropy☆14Updated 4 years ago