lmorgadodacosta / CantoneseWN
The Cantonese Wordnet
☆14Updated last year
Alternatives and similar repositories for CantoneseWN:
Users that are interested in CantoneseWN are comparing it to the libraries listed below
- A frequency lexicon for Hong Kong Cantonese☆21Updated 4 years ago
- 粵文語料篩選器 Cantonese text filter☆38Updated 3 weeks ago
- Spoken Cantonese from Hong Kong.☆29Updated 5 months ago
- fastText vectors created from Hong Kong data.☆21Updated 4 years ago
- rime-cantonese 上游詞表倉庫☆27Updated 7 months ago
- An audio and transcribed corpus of contemporary Hong Kong Cantonese☆36Updated 4 years ago
- Unicode Standard tokenization routines and orthography profile segmentation☆35Updated 2 months ago
- An open-access corpus of conversational bilingual speech in Cantonese and English☆40Updated 2 years ago
- 《香港二十世紀中期粵語語料庫》打包器☆16Updated 9 years ago
- Cantonese segmentation tool 粵語分詞工具☆30Updated 4 years ago
- A powerful, tagset-independent and theory-neutral meta model and API for storing, manipulating, and representing nearly all types of ling…☆15Updated 2 years ago
- Python scripts and datasets of the "Extremely Low-Resource Neural Machine Translation: A Case Study of Cantonese" project☆15Updated 2 years ago
- Global ASP - African Storybook Project for the World☆14Updated 2 weeks ago
- The zhong [|] Chinese grammars☆14Updated 3 years ago
- CyberCan is a lexicon of contemporary Cantonese based on more than 100 million pieces of internet texts from discussion forums in Hong Ko…☆12Updated 3 years ago
- BERT Tokenizer with vocabulary tailored for Cantonese☆21Updated 2 years ago
- Extraction de LExique par Variation d'Entropie - Lexicon extraction based on the variation of entropy☆14Updated 4 years ago
- Chinese Wordnet v.2☆22Updated 8 years ago
- Online (real-time) decoder to be used with DeepSpeech2 model☆25Updated 5 years ago
- Multilingual grapheme-to-phoneme conversion☆20Updated 7 years ago
- Bilingual sengence aligner☆27Updated last year
- Transformers for Cantonese☆56Updated 4 years ago
- Cantonese Romanization Converter☆16Updated 4 years ago
- universal syllabification algorithms☆44Updated 2 years ago
- English web corpus with 4M tokens and several annotation types☆26Updated last year
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawler☆23Updated 4 years ago
- A list of resources for conservation, development, and documentation of endangered, minority, and low or under-resourced human languages.☆34Updated last year
- Dictionary of pairs of Korean word and IPA crawled from Wiktionary (Korean edition)☆20Updated last year
- American English Pronunciation Dictionary☆34Updated 7 years ago
- 粵語拼音轉換表☆33Updated last week