purobaburi / manchu-resources
🈵 Collected resources to learn/study Manchu (Manchurian Language). 满语滿族満州語入門。
☆12Updated last year
Alternatives and similar repositories for manchu-resources:
Users that are interested in manchu-resources are comparing it to the libraries listed below
- A Manchu dictionary website☆11Updated last month
- A simple dictionary in Manchu, Chinese and English.☆11Updated 9 years ago
- Hanzipy is a Chinese character and NLP module for Chinese language processing for python. It is primarily written to help provide a frame…☆18Updated last year
- A OCR System for the Manchu Script☆33Updated last year
- https://arxiv.org/pdf/2402.18025☆29Updated 3 weeks ago
- CCL 2023 古汉语通假字语料库的构建及应用研究:通假字资源库☆12Updated last year
- Tokenizer POS-tagger and Dependency-parser for Classical Chinese☆65Updated 3 months ago
- ☆28Updated 3 months ago
- 粵文語料篩選器 Cantonese text filter☆38Updated last week
- Linguistically analyzed Classical Tibetan texts☆26Updated 3 years ago
- A Python library to add reconstructed pronunciations of Middle Chinese on Chinese texts☆9Updated last year
- Adaptive Machine Translation with Large Language Models☆30Updated last month
- <u><a href="https://circse.github.io/LT4HALA/" style="color: white">Workshop on Language Technologies for Historical and Ancient Language…☆33Updated 8 months ago
- A frequency lexicon for Hong Kong Cantonese☆21Updated 4 years ago
- HDIC : Integrated Database of Hanzi Dictionaries in Early Japan☆36Updated this week
- Tokenizer POS-Tagger and Dependency-parser with BERT/RoBERTa/DeBERTa/GPT models for Japanese and other languages☆51Updated last month
- uncover old chinese textual parallels based on sound☆13Updated 3 months ago
- 中文词典 / 中文詞典。Chinese / Chinese-English dictionaries.☆156Updated 10 months ago
- A Python package for learning, evaluating, annotating, and extracting vector representations of construction grammars☆35Updated 4 months ago
- MorphyNet: a Large Multilingual Database of Derivational and Inflectional Morphology (+morpheme segmentation)☆40Updated last year
- repo for Tibetan corpora☆21Updated last year
- The spoken L1 corpus represents present-day spoken Chinese (Putonghua) used in mainland China, which is designed as a comparable corpus t…☆18Updated 3 years ago
- Yet another IDS (Ideographic Description Sequences) lists with MIT license☆105Updated 5 months ago
- Code for paper "Kanbun-LM: Reading and Translating Classical Chinese in Japanese Method by Language Models"☆16Updated last year
- Multilingual sentence alignment using sentence embeddings☆108Updated 3 months ago
- 🏷 བོད་ཏོ ག [pʰøtɔk̚] Tibetan word tokenizer in Python☆61Updated 3 weeks ago
- Tools for processing open Cantonese dictionary data provided words.hk☆18Updated 3 weeks ago
- A list of vocabulary lists☆21Updated 4 years ago
- A curated list of resources dedicated to Natural Language Processing (NLP) of Cantonese | 粵語 NLP☆86Updated 3 years ago
- Tokenizer POS-tagger and Dependency-parser for Classical Chinese☆13Updated last year