Synkied / hanzipyLinks
Hanzipy is a Chinese character and NLP module for Chinese language processing for python. It is primarily written to help provide a framework for Chinese language learners to explore Chinese.
☆23Updated last month
Alternatives and similar repositories for hanzipy
Users that are interested in hanzipy are comparing it to the libraries listed below
Sorting:
- A modern, interlingual wordnet interface for Python☆259Updated 2 weeks ago
- Tokenizer POS-Tagger and Dependency-parser with BERT/RoBERTa/DeBERTa/GPT models for Japanese and other languages☆52Updated 3 weeks ago
- Han character library for CJKV languages☆162Updated 4 years ago
- This packages up data for the Open Multilingual Wordnet☆53Updated 3 months ago
- A Python package for learning, evaluating, annotating, and extracting vector representations of construction grammars☆38Updated 11 months ago
- OpusFilter - Parallel corpus processing toolkit☆109Updated last month
- Multilingual sentence alignment using sentence embeddings☆123Updated 10 months ago
- Python library for CJK (Chinese, Japanese, and Korean) language dictionary☆92Updated last week
- Python package for WikiMedia dump processing (Wiktionary, Wikipedia etc). Wikitext parsing, template expansion, Lua module execution. Fo…☆107Updated last week
- Unicode-only CJKV IDS data☆12Updated last year
- Sentence aligner☆117Updated 4 years ago
- Spoken Cantonese from Hong Kong.☆30Updated last week
- ☆28Updated this week
- CCL 2023 古汉语通假字语料库的构建及应用研究:通假字资源库☆18Updated 2 years ago
- MorphyNet: a Large Multilingual Database of Derivational and Inflectional Morphology (+morpheme segmentation)☆47Updated 2 years ago
- The World Atlas of Language Structures☆64Updated 11 months ago
- ☆75Updated last month
- Tokenizer POS-tagger and Dependency-parser for Classical Chinese☆64Updated 10 months ago
- 粵文語料篩選器 Cantonese text filter☆41Updated 5 months ago
- <u><a href="https://circse.github.io/LT4HALA/" style="color: white">Workshop on Language Technologies for Historical and Ancient Language…☆33Updated last year
- Tokenizer POS-tagger and Dependency-parser for Classical Chinese☆17Updated this week
- 中文词典 / 中文詞典。Chinese / Chinese-English dictionaries.☆190Updated last year
- ☆31Updated last year
- SegBo: A database of borrowed sounds in the world’s languages☆16Updated last year
- Tokenizer POS-tagger and Dependency-parser for Classical Chinese☆13Updated 2 months ago
- Linguistically analyzed Classical Tibetan texts☆26Updated 4 years ago
- Gather modern English word frequencies from all enwiki articles.☆222Updated last year
- 🏷 བོད་ཏོག [pʰøtɔk̚] Tibetan word tokenizer in Python☆68Updated 6 months ago
- Morphological Dictionaries for German Language☆29Updated 7 years ago
- Chinese (Simplified/Traditional) and Japanese Kanji handwriting input method. Convolutional neural network (CNN) using Tensorflow/Keras u…☆14Updated 10 months ago