znwang25 / fuzzychinese
A small package to fuzzy match chinese words
☆87Updated 2 years ago
Alternatives and similar repositories for fuzzychinese:
Users that are interested in fuzzychinese are comparing it to the libraries listed below
- Estimate the phonetic distance between Chinese words and get similar sounding candidate words.☆37Updated last year
- 粤语分词工具☆46Updated 6 years ago
- 中文分词软件基准测试 | Chinese tokenizer benchmark☆24Updated 6 years ago
- This is a pre-trained LSTM model. This model can help you to segment unpunctuated historical Chinese texts. 這是基於 LSTM 的預訓練模型。此模型可幫助您為漢語古文…☆26Updated 3 years ago
- Chinese Subjective Dectection based on subjective knowlegebase, 中文主观性计算。基于中文主观性知识库的句子主观性评定方法。☆57Updated last year
- 公司、企业名称模糊匹配,基于词频的公司名主体提取,基于编辑距离的匹配度☆41Updated 4 years ago
- Dictionary for Cantonese word segmentation☆35Updated 10 months ago
- Cantonese segmentation tool 粵語分詞工具☆30Updated 4 years ago
- 近代汉语语料库数据集 自然语言处理 语料库 古代汉语 古汉语 文言文 数字人文 计算语言☆156Updated last month
- Pre-trained ELECTRA from Hong Kong data☆28Updated 4 years ago
- company name parser, extract company name brand. 中文公司名称分词工具,支持公司名称中的地名,品牌名(主词),行业词,公司名后缀提取。☆90Updated 2 years ago
- 人民日报(1946-2003)☆134Updated 6 years ago
- pkuseg多领域中文分词工具; The pkuseg toolkit for multi-domain Chinese word segmentation☆58Updated 7 months ago
- 维基百科中文语料整理☆296Updated 7 years ago
- 汉字字符特征提取工具,可以提取出字符中的字音(声母、韵母、声调)、字形(偏旁、部首)、四角编码等特征,同时可作为tensor输入到模型☆137Updated 4 years ago
- ☆124Updated 4 years ago
- 李傲龍的博客