CanCLID / canto-filterLinks
粵文語料篩選器 Cantonese text filter
☆41Updated 6 months ago
Alternatives and similar repositories for canto-filter
Users that are interested in canto-filter are comparing it to the libraries listed below
Sorting:
- A curated list of resources dedicated to Natural Language Processing (NLP) of Cantonese | 粵語 NLP☆93Updated 4 years ago
- An English-to-Cantonese machine translation model☆52Updated 6 months ago
- A frequency lexicon for Hong Kong Cantonese☆23Updated 5 years ago
- Hong Kong Cantonese Corpus of transcribed speech (spontaneous speech, radio programmes and a monologue).☆71Updated last year
- Transformers for Cantonese☆57Updated 5 years ago
- An audio and transcribed corpus of contemporary Hong Kong Cantonese☆38Updated 4 years ago
- 台語、族語、客語的語料清單、彙整☆44Updated 5 years ago
- 臺灣閩南語常用詞辭典 資料檔☆80Updated 2 years ago
- BERT Tokenizer with vocabulary tailored for Cantonese☆23Updated 2 years ago
- rime-cantonese 上游詞表倉庫☆30Updated last year
- Spoken Cantonese from Hong Kong.☆30Updated last month
- Cantonese Linguistics and NLP☆392Updated last year
- Cantonese segmentation tool 粵語分詞工具☆30Updated 5 years ago
- 台灣媠聲標記網站☆14Updated 3 weeks ago
- cantonese-mandarin unsupervised neural translation for sw project☆27Updated 2 years ago
- 臺灣言語服務☆46Updated 6 years ago
- ☆10Updated 3 years ago
- A toolset for computation and comparison of Chinese dialects☆41Updated 2 weeks ago
- Dictionary of pairs of Korean word and IPA crawled from Wiktionary (Korean edition)☆21Updated 2 years ago
- fastText vectors created from Hong Kong data.☆22Updated 5 years ago
- Tools for processing open Cantonese dictionary data provided words.hk☆22Updated 8 months ago
- one script for xls-r/xlsr/whisper fine-tuning☆42Updated 2 years ago
- 閩南語拼音轉換表☆21Updated 9 years ago
- Code for paper "Kanbun-LM: Reading and Translating Classical Chinese in Japanese Method by Language Models"☆17Updated 2 years ago
- 電腦用漢字粵語拼音表 / Cantonese Pronunciation List of the Characters for Computers☆58Updated last year
- The Cantonese Wordnet☆14Updated last year
- 粵語對話語料☆29Updated 2 years ago
- Taiwanese Translation with BERT based model and RNN. Collection of Taiwanese text corpus☆12Updated 3 years ago
- ☆92Updated last year
- Python scripts and datasets of the "Extremely Low-Resource Neural Machine Translation: A Case Study of Cantonese" project☆15Updated 2 years ago