Synkied / hanzipyLinks
Hanzipy is a Chinese character and NLP module for Chinese language processing for python. It is primarily written to help provide a framework for Chinese language learners to explore Chinese.
☆27Updated 5 months ago
Alternatives and similar repositories for hanzipy
Users that are interested in hanzipy are comparing it to the libraries listed below
Sorting:
- A modern, interlingual wordnet interface for Python☆278Updated last week
- This packages up data for the Open Multilingual Wordnet☆59Updated last week
- A Python package for learning, evaluating, annotating, and extracting vector representations of construction grammars☆43Updated last year
- Tokenizer POS-Tagger and Dependency-parser with BERT/RoBERTa/DeBERTa/GPT models for Japanese and other languages☆54Updated 4 months ago
- Han character library for CJKV languages☆165Updated 4 years ago
- SegBo: A database of borrowed sounds in the world’s languages☆16Updated last year
- Multilingual sentence alignment using sentence embeddings☆138Updated last year
- Unicode-only CJKV IDS data☆13Updated last year
- Sentence aligner☆123Updated 4 years ago
- 中文词典 / 中文詞典。Chinese / Chinese-English dictionaries.☆225Updated last month
- Python library for CJK (Chinese, Japanese, and Korean) language dictionary☆94Updated last week
- uncover old chinese textual parallels based on sound☆15Updated last month
- Linguistically analyzed Classical Tibetan texts☆28Updated 4 years ago
- Python package for WikiMedia dump processing (Wiktionary, Wikipedia etc). Wikitext parsing, template expansion, Lua module execution. Fo…☆107Updated 2 months ago
- Chinese lexicon containing definitions, character origins, and statistics, built for Dong Chinese (https://www.dong-chinese.com)☆56Updated 2 months ago
- CCL 2023 古汉语通假字语料库的构建及应用研究:通假字资源库☆20Updated 2 years ago
- Wiktra - Python tool of Wiktionary Transliteration modules for 514 languages and its 102 different scripts (orthographies)☆34Updated 6 months ago
- A list of vocabulary lists☆22Updated 5 years ago
- IDS data for CJK Unified Ideographs☆478Updated 2 years ago
- The World Atlas of Language Structures☆73Updated last year
- 粵文語料篩選器 Cantonese text filter☆41Updated 9 months ago
- 《国际中文教育中文水平等级标准》 查询系统 Query System of Chinese Proficiency Grading Standards for International Chinese Language Education, New HSK Levels …☆40Updated 2 months ago
- STREUSLE: a corpus with comprehensive lexical semantic annotation (multiword expressions, supersenses)☆69Updated 2 months ago
- ☆34Updated 2 years ago
- Spoken Cantonese from Hong Kong.☆30Updated 2 months ago
- OpusFilter - Parallel corpus processing toolkit☆115Updated this week
- Raw text of 申報☆27Updated 4 years ago
- Tokenizer POS-tagger and Dependency-parser for Classical Chinese☆69Updated 3 months ago
- Collaborative data curation for Glottolog☆183Updated 3 weeks ago
- Chinese (Simplified/Traditional) and Japanese Kanji handwriting input method. Convolutional neural network (CNN) using Tensorflow/Keras u…☆14Updated last year