Synkied / hanzipy
Hanzipy is a Chinese character and NLP module for Chinese language processing for python. It is primarily written to help provide a framework for Chinese language learners to explore Chinese.
☆16Updated 10 months ago
Related projects ⓘ
Alternatives and complementary repositories for hanzipy
- Code for paper "Kanbun-LM: Reading and Translating Classical Chinese in Japanese Method by Language Models"☆16Updated last year
- Tokenizer POS-Tagger and Dependency-parser with BERT/RoBERTa/DeBERTa models for Japanese and other languages☆48Updated last month
- Chinese (Simplified/Traditional) and Japanese Kanji handwriting input method. Convolutional neural network (CNN) using Tensorflow/Keras u…☆13Updated this week
- Tokenizer POS-tagger and Dependency-parser for Classical Chinese☆13Updated last year
- Linguistically analyzed Classical Tibetan texts☆24Updated 3 years ago
- Chinese lexicon containing definitions, character origins, and statistics, built for Dong Chinese (https://www.dong-chinese.com)☆39Updated 4 years ago
- 《国际中文教育中文水平等级标准》 查询系统 Query System of Chinese Proficiency Grading Standards for International Chinese Language Education, New HSK Levels …☆25Updated 7 months ago
- repo for Tibetan corpora☆21Updated last year
- ☆28Updated 2 weeks ago
- máobĭ (毛笔) is an Anki add-on to create cards with writing quizzes for Hanzi (Chinese characters)☆51Updated last week
- Practice Chinese language grammar☆16Updated 3 years ago
- 🏷 བོད་ཏོག [pʰøtɔk̚] Tibetan word tokenizer in Python☆58Updated 2 months ago
- Das Chinesisch-Deutsche Wörterbuch HanDeDict, das bis August 2015 auf der Webseite von CHDW verfügbar war.☆21Updated this week
- Dictionary of pairs of Korean word and IPA crawled from Wiktionary (Korean edition)☆18Updated last year
- Hong Kong Cantonese Corpus of transcribed speech (spontaneous speech, radio programmes and a monologue).☆44Updated 7 months ago
- cc-kedict: Creative Commons Korean-English Dictionary☆41Updated 3 years ago
- 🈵 Collected resources to learn/study Manchu (Manchurian Language). 满语滿族満州語入門。☆11Updated last year
- A frequency lexicon for Hong Kong Cantonese☆20Updated 4 years ago
- CCL 2023 古汉语通假字语料库的构建及应用研究:通假字资源库☆10Updated last year
- Multilingual sentence alignment using sentence embeddings☆97Updated this week
- Tokenizer POS-tagger and Dependency-parser for Classical Chinese☆62Updated last week
- Python library for CJK (Chinese, Japanese, and Korean) language dictionary☆82Updated last week
- 🦜 NLP for Tibetan, in Python.☆32Updated last year
- Tokenizer POS-tagger and Dependency-parser for Classical Chinese☆16Updated 2 years ago
- ✒️ དག་བྱེད། Dakje, improving your spelling and readability☆11Updated 2 years ago
- Tokenizer POS-tagger and Dependency-parser for Classical Chinese☆11Updated 5 months ago
- Wiktra - Python tool of Wiktionary Transliteration modules for 514 languages and its 102 different scripts (orthographies)☆27Updated 3 years ago
- 粵文語料篩選器 Cantonese text filter☆33Updated 2 months ago
- Chinese (zh-cnm) opendata audio files for 8,596 hsk words and 1,707 syllabs.☆44Updated 3 years ago
- 臺灣閩南語常用詞辭典 資料檔☆76Updated last year