yichen0831 / opencc-pythonLinks
OpenCC made with Python
☆561Updated last year
Alternatives and similar repositories for opencc-python
Users that are interested in opencc-python are comparing it to the libraries listed below
Sorting:
- Simple conversion and localization between simplified and traditional Chinese using tables from MediaWiki.☆548Updated last year
- Constants used in Chinese text processing☆377Updated 9 months ago
- PTT 八卦版問答中文語料☆245Updated 11 months ago
- Hanzi Converter for Traditional and Simplified Chinese☆190Updated 5 years ago
- CKIP Transformers☆749Updated 2 years ago
- Use C Api and Swig to Speed up jieba 高效的中文分词库☆638Updated 4 years ago
- 漢語拆字字典☆796Updated 2 years ago
- 📦 快速转化「中文数字」和「阿拉伯数字」~ (最新特性:分数,日期、温度等转化)☆739Updated 9 months ago
- 古典中文語料庫☆292Updated 3 years ago
- A CWN Python binding with graph structure☆33Updated 2 years ago
- Cantonese Linguistics and NLP☆392Updated last year
- A 30000+ Chinese MRC dataset - Delta Reading Comprehension Dataset☆314Updated 5 years ago
- 中文詞向量訓練教學☆517Updated 2 years ago
- 批踢踢推文產生器☆222Updated 11 months ago
- CKIP CoreNLP Toolkits☆125Updated 2 years ago
- Some meaningless nscripter tools.☆678Updated 5 years ago
- 汉字拆字库,可以将汉字拆解成偏旁部首,在机器学习中作为汉字的字形特征 | Hanzi Decomposition Library allows Chinese characters to be broken down into radicals and components…☆390Updated 11 months ago
- A tool for ancient Chinese segmentation.☆54Updated 6 years ago
- Chinese stopwords collection☆138Updated 5 years ago
- 繁體+簡體中文詞庫字典檔☆107Updated last year
- 中研院中文斷詞系統 python版本用戶端程式☆21Updated 9 years ago
- 中華大辭典☆121Updated last year
- 訓練中文詞向量 Word2vec, Word2vec was created by a team of researchers led by Tomas Mikolov at Google.☆59Updated 2 years ago
- MONPA 罔拍是一個提供正體中文斷詞、詞性標註以及命名實體辨識的多任務模型☆247Updated 7 months ago
- 拼音转汉字, 拼音输入法引擎, pin yin -> 拼音☆625Updated 4 months ago
- CKIP Neural Chinese Word Segmentation, POS Tagging, and NER☆1,672Updated 2 months ago
- Python module that identifies Chinese text as being Simplified or Traditional☆100Updated 10 months ago
- 词语拼音数据☆500Updated 2 months ago
- 教育部重編國語辭典 資料檔; 若有建議或 bug 請在 moedict-process 反應☆145Updated 2 years ago
- 对常用的6700个汉字进行音、形比较,输出音近字、形近字的列表。 # 相近字☆466Updated last year