lmorgadodacosta / CantoneseWN
The Cantonese Wordnet
☆14Updated last year
Alternatives and similar repositories for CantoneseWN:
Users that are interested in CantoneseWN are comparing it to the libraries listed below
- A frequency lexicon for Hong Kong Cantonese☆21Updated 4 years ago
- rime-cantonese 上游詞表倉庫☆27Updated 7 months ago
- fastText vectors created from Hong Kong data.☆21Updated 4 years ago
- Python scripts and datasets of the "Extremely Low-Resource Neural Machine Translation: A Case Study of Cantonese" project☆15Updated 2 years ago
- Global ASP - African Storybook Project for the World☆14Updated 4 months ago
- An audio and transcribed corpus of contemporary Hong Kong Cantonese☆35Updated 4 years ago
- 粵文語料篩選器 Cantonese text filter☆38Updated last month
- BERT Tokenizer with vocabulary tailored for Cantonese☆20Updated 2 years ago
- Cantonese segmentation tool 粵語分詞工具☆29Updated 4 years ago
- 《香港二十世紀中期粵語語料庫》打包器☆16Updated 8 years ago
- An open-access corpus of conversational bilingual speech in Cantonese and English☆40Updated 2 years ago
- cantonese-mandarin unsupervised neural translation for sw project☆26Updated last year
- Spoken Cantonese from Hong Kong.☆29Updated 4 months ago
- ☆17Updated last year
- A curated list of resources dedicated to Natural Language Processing (NLP) of Cantonese | 粵語 NLP☆87Updated 3 years ago
- 粵語對話語料☆24Updated last year
- Identification and conversion functions for Chinese text processing☆59Updated 4 months ago
- Dictionary of pairs of Korean word and IPA crawled from Wiktionary (Korean edition)☆20Updated last year
- ToneNet: A CNN Model of Tone Classification of Mandarin Chinese☆17Updated 5 years ago
- An English-to-Cantonese machine translation model☆49Updated last year
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawler☆23Updated 4 years ago
- Extract and align grammar patterns from English sentences.☆54Updated 2 years ago
- PyTorch speech2text inference script for the NVidia openseq2seq wav2letter model variant☆10Updated 5 years ago
- finite-state toolkit, EM and Bayesian (Gibbs sampling) training for FST and context-free derivation forests☆41Updated 2 years ago
- Microsoft Speech Language Translation (MSLT) Corpus☆19Updated 7 years ago
- Ideographic Description Sequence Checker Tools☆20Updated 7 years ago
- Transformers for Cantonese☆56Updated 4 years ago
- Hong Kong Cantonese Corpus of transcribed speech (spontaneous speech, radio programmes and a monologue).☆56Updated last year
- Online (real-time) decoder to be used with DeepSpeech2 model☆24Updated 5 years ago
- explores Chinese language models with sub-character level visual information☆16Updated 6 years ago