Synkied / hanzipyLinks
Hanzipy is a Chinese character and NLP module for Chinese language processing for python. It is primarily written to help provide a framework for Chinese language learners to explore Chinese.
☆27Updated 6 months ago
Alternatives and similar repositories for hanzipy
Users that are interested in hanzipy are comparing it to the libraries listed below
Sorting:
- Multilingual sentence alignment using sentence embeddings☆139Updated last year
- Python library for CJK (Chinese, Japanese, and Korean) language dictionary☆93Updated this week
- A Python package for learning, evaluating, annotating, and extracting vector representations of construction grammars☆43Updated last year
- Han character library for CJKV languages☆165Updated 4 years ago
- Sentence aligner☆124Updated 4 years ago
- This packages up data for the Open Multilingual Wordnet☆60Updated last week
- ☆29Updated 3 months ago
- OpusFilter - Parallel corpus processing toolkit☆115Updated this week
- CCL 2023 古汉语通假字语料库的构建及应用研究:通假字资源库☆20Updated 2 years ago
- Unicode-only CJKV IDS data☆13Updated last year
- A text file containing English words, along with the definition, parts of speech (noun,verb,adjective,etc.), and a link to the url where …☆12Updated last year
- ☆34Updated 2 years ago
- Code for paper "Kanbun-LM: Reading and Translating Classical Chinese in Japanese Method by Language Models"☆21Updated 2 years ago
- Tokenizer POS-Tagger and Dependency-parser with BERT/RoBERTa/DeBERTa/GPT models for Japanese and other languages☆54Updated 3 weeks ago
- ☆81Updated 2 weeks ago
- A list of vocabulary lists☆22Updated 5 years ago
- 中文词典 / 中文詞典。Chinese / Chinese-English dictionaries.☆230Updated last month
- A modern, interlingual wordnet interface for Python☆282Updated last week
- Tokenizer POS-tagger and Dependency-parser for Classical Chinese☆69Updated 4 months ago
- The source of the phonetic transcriptions is Oxford Advanced Learner's Dictionary (3rd ed.), available from the Oxford Text Archive (http…☆26Updated 8 years ago
- Improved Sentence Alignment in Linear Time and Space☆188Updated 2 years ago
- Chinese lexicon containing definitions, character origins, and statistics, built for Dong Chinese (https://www.dong-chinese.com)☆56Updated 2 months ago
- uncover old chinese textual parallels based on sound☆15Updated last week
- Workshop on Language Technologies for Historical and Ancient Languages (LT4HALA)☆35Updated last week
- MorphyNet: a Large Multilingual Database of Derivational and Inflectional Morphology (+morpheme segmentation)☆54Updated 2 years ago
- Python package for WikiMedia dump processing (Wiktionary, Wikipedia etc). Wikitext parsing, template expansion, Lua module execution. Fo…☆107Updated 2 months ago
- Linguistically analyzed Classical Tibetan texts☆28Updated 4 years ago
- Tokenizer POS-tagger and Dependency-parser for Classical Chinese☆14Updated last month
- ☆19Updated 4 years ago
- MAGPIE: A sense-annotated corpus of potentially idiomatic expressions☆31Updated 5 years ago