zachary822 / chinese-converterLinks
Converts between traditional and simplified Chinese
☆31Updated 8 months ago
Alternatives and similar repositories for chinese-converter
Users that are interested in chinese-converter are comparing it to the libraries listed below
Sorting:
- Estimate the phonetic distance between Chinese words and get similar sounding candidate words.☆37Updated 3 weeks ago
- Python module that identifies Chinese text as being Simplified or Traditional☆93Updated 6 months ago
- Hong Kong Cantonese Corpus of transcribed speech (spontaneous speech, radio programmes and a monologue).☆61Updated last year
- Cantonese segmentation tool 粵語分詞工具☆30Updated 4 years ago
- pkuseg多领域中文分词工具; The pkuseg toolkit for multi-domain Chinese word segmentation☆61Updated last week
- Multilingual sentence alignment using sentence embeddings☆118Updated 7 months ago
- 中文繁体和简体字符对照表☆45Updated 4 months ago
- BERT Tokenizer with vocabulary tailored for Cantonese☆21Updated 2 years ago
- Identification and conversion functions for Chinese text processing☆59Updated 6 months ago
- An English-to-Cantonese machine translation model☆51Updated 2 months ago
- 古文现代文翻译平行语料库☆105Updated 3 years ago
- A versioned python wrapper package for cmudict (https://github.com/cmusphinx/cmudict).☆63Updated last month
- 使用 pinyin-data 和 phrase-pinyin-data 中的拼音数据文件覆盖 pypinyin 中的内置拼音数据☆59Updated 4 months ago
- super fast cpp implementation of longest common subsequence/substring☆68Updated last year
- repo for Tibetan corpora☆21Updated 2 years ago
- Grapheme-to-Phoneme lexicons for Chinese dialects☆69Updated 2 years ago
- Constants used in Chinese text processing☆371Updated 5 months ago
- Library for extracting text and timestamps from multiple subtitle files (.ass, .ssa, .srt, .sub, .txt).☆52Updated last year
- An open-access corpus of conversational bilingual speech in Cantonese and English☆40Updated 3 years ago
- Hanzi Converter for Traditional and Simplified Chinese☆188Updated 5 years ago
- convert epub file to txt☆88Updated 5 years ago
- 渊 - A project for Classical Chinese☆104Updated 3 years ago
- English loanwords in Japanese☆17Updated 7 months ago
- Converting Chinese number string <=> int/float/str☆19Updated last month
- 《现代汉语大词典》字词头☆26Updated 4 years ago
- A curated list of resources dedicated to Natural Language Processing (NLP) of Cantonese | 粵語 NLP☆92Updated 3 years ago
- Python library for CJK (Chinese, Japanese, and Korean) language dictionary☆91Updated this week
- 粵語拼音自動標註工具 Cantonese Pronunciation Automatic Labeling Tool☆69Updated 8 months ago
- 汉字数据集,包括汉字的相关信息,例如笔画数、部首、拼音、英文释义/同义词等。☆118Updated 4 years ago
- 近代汉语语料库数据集 自然语言处理 语料库 古代汉语 古汉语 文言文 数字人文 计算语言☆162Updated 3 months ago