yichen0831 / opencc-python
OpenCC made with Python
☆542Updated last year
Alternatives and similar repositories for opencc-python:
Users that are interested in opencc-python are comparing it to the libraries listed below
- Simple conversion and localization between simplified and traditional Chinese using tables from MediaWiki.☆529Updated 9 months ago
- Constants used in Chinese text processing☆365Updated last month
- Hanzi Converter for Traditional and Simplified Chinese☆183Updated 4 years ago
- PTT 八卦版問答中文語料☆236Updated 3 months ago
- Some meaningless nscripter tools.☆677Updated 4 years ago
- 中文詞向量訓練教學☆517Updated 2 years ago
- Use C Api and Swig to Speed up jieba 高效的中文分词库☆631Updated 3 years ago
- CKIP Transformers☆710Updated last year
- A 30000+ Chinese MRC dataset - Delta Reading Comprehension Dataset☆306Updated 4 years ago
- 📦 快速转化「中文数字」和「阿拉伯数字」~ (最新特性:分数,日期、温度等转化)☆693Updated 3 weeks ago
- Chinese word segmentation algorithm without corpus(无需语料库的中文分词)☆498Updated 4 years ago
- 批踢踢推文產生器☆221Updated 2 months ago
- 微信公众号语料库☆574Updated 6 years ago
- A Chinese sentiment dataset may be useful for sentiment analysis.☆230Updated 8 years ago
- A Python wrapper around the NLPIR/ICTCLAS Chinese segmentation software.☆573Updated last month
- Cantonese Linguistics and NLP☆364Updated 7 months ago
- 漢語拆字字典☆748Updated 2 years ago
- 中華大辭典☆114Updated last year
- 词语拼音数据☆462Updated this week
- 古典中文語料庫☆280Updated 2 years ago
- Chinese stopwords collection☆133Updated 4 years ago
- 对常用的6700个汉字进行音、形比较,输出音近字、形近字的列表。 # 相近字☆446Updated 9 months ago
- 速度更快、效果更好的中文新词发现☆512Updated 10 months ago
- MONPA 罔拍是一個提供正體中文斷詞、詞性標註以及命名實體辨識的多任務模型☆245Updated 2 years ago
- dgk_lost_conv 中文对白语料 chinese conversation corpus☆1,090Updated 3 years ago
- python3实现互信息和左右熵的新词发现☆588Updated 5 years ago
- xmnlp:提供中文分词, 词性标注, 命名体识别,情感分析,文本纠错,文本转拼音,文本摘要,偏旁部首,句子表征及文本相似度计算等功能☆1,262Updated 2 years ago
- Time-NLP的python3版本 中文时间表达词转换☆515Updated 2 years ago
- 综合了同义词词林扩展版与知网(Hownet)的词语相似度计算方法,词汇覆盖更多、结果更准确。☆725Updated 2 years ago
- CKIP CoreNLP Toolkits☆118Updated last year