Synkied / hanzipy
Hanzipy is a Chinese character and NLP module for Chinese language processing for python. It is primarily written to help provide a framework for Chinese language learners to explore Chinese.
☆21Updated last year
Alternatives and similar repositories for hanzipy:
Users that are interested in hanzipy are comparing it to the libraries listed below
- CCL 2023 古汉语通假字语料库的构建及应用研究:通假字资源库☆15Updated last year
- ☆28Updated last week
- <u><a href="https://circse.github.io/LT4HALA/" style="color: white">Workshop on Language Technologies for Historical and Ancient Language…☆34Updated 11 months ago
- This packages up data for the Open Multilingual Wordnet☆48Updated last week
- 《国际中文教育中文水平等级标准》 查询系统 Query System of Chinese Proficiency Grading Standards for International Chinese Language Education, New HSK Levels …☆29Updated last year
- Tokenizer POS-tagger and Dependency-parser for Classical Chinese☆15Updated last month
- Tokenizer POS-tagger and Dependency-parser for Classical Chinese☆65Updated 5 months ago
- Tokenizer POS-Tagger and Dependency-parser with BERT/RoBERTa/DeBERTa/GPT models for Japanese and other languages☆50Updated 3 weeks ago
- Linguistically analyzed Classical Tibetan texts☆26Updated 3 years ago
- uncover old chinese textual parallels based on sound☆13Updated 6 months ago
- Tokenizer POS-tagger and Dependency-parser for Classical Chinese☆13Updated last year
- Chinese lexicon containing definitions, character origins, and statistics, built for Dong Chinese (https://www.dong-chinese.com)☆46Updated 4 years ago
- A text file containing English words, along with the definition, parts of speech (noun,verb,adjective,etc.), and a link to the url where …☆12Updated last year
- Han character library for CJKV languages☆157Updated 4 years ago
- 中文词典 / 中文詞典。Chinese / Chinese-English dictionaries.☆169Updated last year
- Ideographic Description Sequence Checker Tools☆20Updated 7 years ago
- [LREC 2020] EtymDB, an Etymological DataBase (v2.1)☆24Updated 3 years ago
- The spoken L1 corpus represents present-day spoken Chinese (Putonghua) used in mainland China, which is designed as a comparable corpus t…☆19Updated 3 years ago
- Scripts to work with Chinese language data☆12Updated 3 years ago
- Chinese (zh-cnm) opendata audio files for 8,596 hsk words and 1,707 syllabs.☆45Updated 4 years ago
- A modern, interlingual wordnet interface for Python☆244Updated this week
- MorphyNet: a Large Multilingual Database of Derivational and Inflectional Morphology (+morpheme segmentation)☆44Updated 2 years ago
- Find Chinese sentences based on your known vocabulary and other rules☆61Updated last year
- Python library for CJK (Chinese, Japanese, and Korean) language dictionary☆89Updated last week
- Chinese (Simplified/Traditional) and Japanese Kanji handwriting input method. Convolutional neural network (CNN) using Tensorflow/Keras u…☆14Updated 6 months ago
- Spoken Cantonese from Hong Kong.☆29Updated 5 months ago
- Python package for WikiMedia dump processing (Wiktionary, Wikipedia etc). Wikitext parsing, template expansion, Lua module execution. Fo…☆99Updated this week
- 🏷 བོད་ཏོག [pʰøtɔk̚] Tibetan word tokenizer in Python☆67Updated last month
- máobĭ (毛笔) is an Anki add-on to create cards with writing quizzes for Hanzi (Chinese characters)☆53Updated 6 months ago
- A Python package for learning, evaluating, annotating, and extracting vector representations of construction grammars☆37Updated 6 months ago