Synkied / hanzipyLinks
Hanzipy is a Chinese character and NLP module for Chinese language processing for python. It is primarily written to help provide a framework for Chinese language learners to explore Chinese.
☆25Updated 2 months ago
Alternatives and similar repositories for hanzipy
Users that are interested in hanzipy are comparing it to the libraries listed below
Sorting:
- Unicode-only CJKV IDS data☆13Updated last year
- Han character library for CJKV languages☆163Updated 4 years ago
- Ideographic Description Sequence Checker Tools☆25Updated 8 years ago
- Sentence aligner☆118Updated 4 years ago
- Multilingual sentence alignment using sentence embeddings☆127Updated 11 months ago
- Python library for CJK (Chinese, Japanese, and Korean) language dictionary☆93Updated last week
- OpusFilter - Parallel corpus processing toolkit☆110Updated last month
- 中文词典 / 中文詞典。Chinese / Chinese-English dictionaries.☆202Updated last year
- ☆78Updated 2 months ago
- Python package for WikiMedia dump processing (Wiktionary, Wikipedia etc). Wikitext parsing, template expansion, Lua module execution. Fo …☆108Updated last week
- A frequency lexicon for Hong Kong Cantonese☆23Updated 5 years ago
- 粵文語料篩選器 Cantonese text filter☆41Updated 7 months ago
- A modern, interlingual wordnet interface for Python☆267Updated last month
- Tokenizer POS-Tagger and Dependency-parser with BERT/RoBERTa/DeBERTa/GPT models for Japanese and other languages☆52Updated 2 months ago
- MorphyNet: a Large Multilingual Database of Derivational and Inflectional Morphology (+morpheme segmentation)☆51Updated 2 years ago
- IDS data for CJK Unified Ideographs☆462Updated 2 years ago
- This packages up data for the Open Multilingual Wordnet☆55Updated 5 months ago
- Chinese lexicon containing definitions, character origins, and statistics, built for Dong Chinese (https://www.dong-chinese.com)☆49Updated 5 years ago
- Tool to fix bitexts and tag near-duplicates for removal☆33Updated last month
- Spoken Cantonese from Hong Kong.☆30Updated last month
- Machine-Translation-based sentence alignment tool for parallel text☆313Updated 4 years ago
- <u><a href="https://circse.github.io/LT4HALA/" style="color: white">Workshop on Language Technologies for Historical and Ancient Language…☆34Updated last year
- ☆29Updated last month
- Improved Sentence Alignment in Linear Time and Space☆184Updated 2 years ago
- CCL 2023 古汉语通假字语料库的构建及应用研究:通假字资源库☆18Updated 2 years ago
- HSK 3.0 Vocabulary Lists (words and characters)☆89Updated last year
- A web application that interfaces two GEC systems. [web instance is down]☆32Updated last year
- The World Atlas of Language Structures☆67Updated last year
- 《国际中文教育中文水平等级标准》 查询系统 Query System of Chinese Proficiency Grading Standards for International Chinese Language Education, New HSK Levels …☆38Updated this week
- Find Chinese sentences based on your known vocabulary and other rules☆64Updated last year