Synkied / hanzipy
Hanzipy is a Chinese character and NLP module for Chinese language processing for python. It is primarily written to help provide a framework for Chinese language learners to explore Chinese.
☆19Updated last year
Alternatives and similar repositories for hanzipy:
Users that are interested in hanzipy are comparing it to the libraries listed below
- ☆28Updated 4 months ago
- Chinese lexicon containing definitions, character origins, and statistics, built for Dong Chinese (https://www.dong-chinese.com)☆44Updated 4 years ago
- Multilingual sentence alignment using sentence embeddings☆110Updated 4 months ago
- A Python package for learning, evaluating, annotating, and extracting vector representations of construction grammars☆36Updated 4 months ago
- Han character library for CJKV languages☆155Updated 4 years ago
- Code for paper "Kanbun-LM: Reading and Translating Classical Chinese in Japanese Method by Language Models"☆16Updated last year
- Python library for CJK (Chinese, Japanese, and Korean) language dictionary☆89Updated this week
- 🈵 Collected resources to learn/study Manchu (Manchurian Language). 满语滿族満州語入門。☆13Updated last year
- SegBo: A database of borrowed sounds in the world’s languages☆16Updated 11 months ago
- Raw text of 申報☆25Updated 3 years ago
- 中文词典 / 中文詞典。Chinese / Chinese-English dictionaries.☆162Updated 10 months ago
- <u><a href="https://circse.github.io/LT4HALA/" style="color: white">Workshop on Language Technologies for Historical and Ancient Language…☆33Updated 9 months ago
- ☆25Updated last year
- Code for the paper: Wikinflection: Massive semi-supervised generation of multilingual inflectional corpus from Wiktionary (Metheniti and …☆9Updated 4 years ago
- 《国际中文教育中文水平等级标准》 查询系统 Query System of Chinese Proficiency Grading Standards for International Chinese Language Education, New HSK Levels …☆28Updated 11 months ago
- Spoken Cantonese from Hong Kong.☆29Updated 4 months ago
- Tokenizer POS-Tagger and Dependency-parser with BERT/RoBERTa/DeBERTa/GPT models for Japanese and other languages☆51Updated 2 months ago
- Tokenizer POS-tagger and Dependency-parser for Classical Chinese☆13Updated last year
- A list of vocabulary lists☆21Updated 4 years ago
- Tokenizer POS-tagger and Dependency-parser for Classical Chinese☆66Updated 3 months ago
- uncover old chinese textual parallels based on sound☆13Updated 4 months ago
- ☆19Updated 3 years ago
- CCL 2023 古汉语通假字语料库的构建及应用研究:通假字资源库☆14Updated last year
- Chinese (Simplified/Traditional) and Japanese Kanji handwriting input method. Convolutional neural network (CNN) using Tensorflow/Keras u…☆14Updated 4 months ago
- An easy-to-use library to linguistically compare one sentence and its words to another, in the same language or a different one. For inst…☆22Updated 3 years ago
- Sentence aligner☆110Updated 3 years ago
- Small-vocabulary neural sequence-to-sequence generation with optional feature conditioning☆33Updated this week
- OpusFilter - Parallel corpus processing toolkit☆104Updated 2 weeks ago
- STREUSLE: a corpus with comprehensive lexical semantic annotation (multiword expressions, supersenses)☆65Updated 2 years ago
- A corpus of short answers written by learners of English and graded with CEFR levels☆10Updated 3 years ago