Synkied / hanzipyLinks
Hanzipy is a Chinese character and NLP module for Chinese language processing for python. It is primarily written to help provide a framework for Chinese language learners to explore Chinese.
☆27Updated 4 months ago
Alternatives and similar repositories for hanzipy
Users that are interested in hanzipy are comparing it to the libraries listed below
Sorting:
- Han character library for CJKV languages☆164Updated 4 years ago
- Multilingual sentence alignment using sentence embeddings☆131Updated last year
- Python package for WikiMedia dump processing (Wiktionary, Wikipedia etc). Wikitext parsing, template expansion, Lua module execution. Fo…☆108Updated 2 weeks ago
- Tokenizer POS-Tagger and Dependency-parser with BERT/RoBERTa/DeBERTa/GPT models for Japanese and other languages☆52Updated 3 months ago
- A modern, interlingual wordnet interface for Python☆276Updated this week
- Sentence aligner☆121Updated 4 years ago
- uncover old chinese textual parallels based on sound☆15Updated 2 weeks ago
- 中文词典 / 中文詞典。Chinese / Chinese-English dictionaries.☆214Updated last year
- A Python package for learning, evaluating, annotating, and extracting vector representations of construction grammars☆41Updated last year
- Workshop on Language Technologies for Historical and Ancient Languages (LT4HALA)☆34Updated last week
- ☆29Updated last month
- Ideographic Description Sequence Checker Tools☆25Updated 8 years ago
- Python library for CJK (Chinese, Japanese, and Korean) language dictionary☆94Updated this week
- ☆49Updated last year
- Tokenizer POS-tagger and Dependency-parser for Classical Chinese☆67Updated 2 months ago
- CCL 2023 古汉语通假字语料库的构建及应用研究:通假字资源库☆19Updated 2 years ago
- Spoken Cantonese from Hong Kong.☆30Updated last month
- This packages up data for the Open Multilingual Wordnet☆58Updated 6 months ago
- OpusFilter - Parallel corpus processing toolkit☆113Updated this week
- Chinese Notes: A digital library for classical and historic Chinese texts with built in dictionary and reader☆26Updated last week
- The source of the phonetic transcriptions is Oxford Advanced Learner's Dictionary (3rd ed.), available from the Oxford Text Archive (http…☆24Updated 8 years ago
- Wiktra - Python tool of Wiktionary Transliteration modules for 514 languages and its 102 different scripts (orthographies)☆31Updated 5 months ago
- A list of vocabulary lists☆22Updated 5 years ago
- Tokenizer POS-tagger and Dependency-parser for Classical Chinese☆14Updated 5 months ago
- ☆32Updated 2 years ago
- IDS data for CJK Unified Ideographs☆471Updated 2 years ago
- ☆19Updated 4 years ago
- Open Language Profiles — English profile datasets from CEFR-J☆155Updated 5 years ago
- 粵文語料篩選器 Cantonese text filter☆41Updated 8 months ago
- 🇨🇳Open source Chinese HSK vocabulary list with example sentences☆45Updated 6 years ago