Synkied / hanzipyLinks
Hanzipy is a Chinese character and NLP module for Chinese language processing for python. It is primarily written to help provide a framework for Chinese language learners to explore Chinese.
☆23Updated 2 weeks ago
Alternatives and similar repositories for hanzipy
Users that are interested in hanzipy are comparing it to the libraries listed below
Sorting:
- Tokenizer POS-Tagger and Dependency-parser with BERT/RoBERTa/DeBERTa/GPT models for Japanese and other languages☆52Updated 2 weeks ago
- A modern, interlingual wordnet interface for Python☆257Updated last month
- This packages up data for the Open Multilingual Wordnet☆52Updated 2 months ago
- Python library for CJK (Chinese, Japanese, and Korean) language dictionary☆92Updated this week
- Sentence aligner☆116Updated 4 years ago
- A Python package for learning, evaluating, annotating, and extracting vector representations of construction grammars☆38Updated 10 months ago
- Python package for WikiMedia dump processing (Wiktionary, Wikipedia etc). Wikitext parsing, template expansion, Lua module execution. Fo…☆106Updated this week
- Wiktra - Python tool of Wiktionary Transliteration modules for 514 languages and its 102 different scripts (orthographies)☆30Updated 2 months ago
- Multilingual sentence alignment using sentence embeddings☆122Updated 9 months ago
- MorphyNet: a Large Multilingual Database of Derivational and Inflectional Morphology (+morpheme segmentation)☆47Updated 2 years ago
- ☆30Updated last year
- The source of the phonetic transcriptions is Oxford Advanced Learner's Dictionary (3rd ed.), available from the Oxford Text Archive (http…☆24Updated 8 years ago
- OpusFilter - Parallel corpus processing toolkit☆109Updated 3 weeks ago
- A list of vocabulary lists☆22Updated 5 years ago
- SegBo: A database of borrowed sounds in the world’s languages☆16Updated last year
- Tokenizer POS-tagger and Dependency-parser for Classical Chinese☆13Updated last month
- ☆28Updated 3 months ago
- several algorithms for converting dependency structures into constituency structures.☆10Updated 3 years ago
- ☆74Updated last week
- Morphological Dictionaries for German Language☆29Updated 7 years ago
- Chinese (Simplified/Traditional) and Japanese Kanji handwriting input method. Convolutional neural network (CNN) using Tensorflow/Keras u…☆14Updated 9 months ago
- Gather modern English word frequencies from all enwiki articles.☆222Updated last year
- Unicode-only CJKV IDS data☆12Updated last year
- uncover old chinese textual parallels based on sound☆14Updated 9 months ago
- Open Language Profiles — English profile datasets from CEFR-J☆145Updated 5 years ago
- Tokenizer POS-tagger and Dependency-parser for Classical Chinese☆15Updated 5 months ago
- Export UNIHAN's database to csv, json or yaml☆59Updated this week
- TUFS Asian Language Parallel Corpus☆51Updated 2 years ago
- <u><a href="https://circse.github.io/LT4HALA/" style="color: white">Workshop on Language Technologies for Historical and Ancient Language…☆33Updated last year
- These are lists for a variety of languages containing words that are distinctive to each language.☆38Updated 3 years ago