Synkied / hanzipyLinks
Hanzipy is a Chinese character and NLP module for Chinese language processing for python. It is primarily written to help provide a framework for Chinese language learners to explore Chinese.
☆21Updated this week
Alternatives and similar repositories for hanzipy
Users that are interested in hanzipy are comparing it to the libraries listed below
Sorting:
- Han character library for CJKV languages☆159Updated 4 years ago
- Tokenizer POS-Tagger and Dependency-parser with BERT/RoBERTa/DeBERTa/GPT models for Japanese and other languages☆52Updated 3 months ago
- Multilingual sentence alignment using sentence embeddings☆120Updated 8 months ago
- ☆28Updated last year
- A modern, interlingual wordnet interface for Python☆254Updated last week
- This packages up data for the Open Multilingual Wordnet☆50Updated last month
- Code for paper "Kanbun-LM: Reading and Translating Classical Chinese in Japanese Method by Language Models"☆16Updated 2 years ago
- OpusFilter - Parallel corpus processing toolkit☆105Updated 2 weeks ago
- A frequency lexicon for Hong Kong Cantonese☆22Updated 4 years ago
- TUFS Asian Language Parallel Corpus☆50Updated 2 years ago
- 粵文語料篩選器 Cantonese text filter☆40Updated 3 months ago
- A list of vocabulary lists☆21Updated 5 years ago
- Wiktra - Python tool of Wiktionary Transliteration modules for 514 languages and its 102 different scripts (orthographies)☆30Updated 2 weeks ago
- Python library for CJK (Chinese, Japanese, and Korean) language dictionary☆92Updated this week
- Sentence aligner☆115Updated 4 years ago
- MorphyNet: a Large Multilingual Database of Derivational and Inflectional Morphology (+morpheme segmentation)☆47Updated 2 years ago
- Small-vocabulary neural sequence-to-sequence generation with optional feature conditioning☆32Updated 3 weeks ago
- Python package for WikiMedia dump processing (Wiktionary, Wikipedia etc). Wikitext parsing, template expansion, Lua module execution. Fo…☆103Updated last month
- MAMMOTH: MAssively Multilingual Modular Open Translation @ Helsinki☆23Updated 2 weeks ago
- <u><a href="https://circse.github.io/LT4HALA/" style="color: white">Workshop on Language Technologies for Historical and Ancient Language…☆34Updated last year
- ☆74Updated 3 months ago
- A Python package for learning, evaluating, annotating, and extracting vector representations of construction grammars☆38Updated 8 months ago
- These are lists for a variety of languages containing words that are distinctive to each language.☆38Updated 3 years ago
- Improved Sentence Alignment in Linear Time and Space☆175Updated 2 years ago
- ☆28Updated last month
- Tokenizer POS-tagger and Dependency-parser for Classical Chinese☆15Updated 3 months ago
- A accurate multilingual word aligner based on LaBSE☆21Updated last year
- SegBo: A database of borrowed sounds in the world’s languages☆16Updated last year
- Morphological Dictionaries for German Language☆29Updated 7 years ago
- Correction of spaces with character-based neural language models.☆13Updated 2 years ago