dan-qqq / Chinese_dialect_distance
县级行政区方言所属数据及根据语言树算得的方言距离。Linguistic distances between Chinese dialects.
☆30Updated 2 years ago
Alternatives and similar repositories for Chinese_dialect_distance:
Users that are interested in Chinese_dialect_distance are comparing it to the libraries listed below
- A toolset for computation and comparison of Chinese dialects☆36Updated this week
- 粤语分词工具☆46Updated 6 years ago
- Cantonese segmentation tool 粵語分詞工具☆30Updated 4 years ago
- 人民日报爬虫(Python)☆114Updated 2 months ago
- This is a code example repo for the NLP course offered by the Institute of Chinese Information Processing of BNU.☆16Updated this week
- Hong Kong Cantonese Corpus of transcribed speech (spontaneous speech, radio programmes and a monologue).☆59Updated last year
- 人民日报(1946-2003)☆134Updated 6 years ago
- Chinese Dialect Database☆17Updated 7 years ago
- QuanSyn: A Python Package for Quantitative Syntax Analysis.☆34Updated 2 weeks ago
- This project aims to curate and provide a comprehensive collection of prompts designed specifically for generative AI models in the conte…☆32Updated 3 weeks ago
- China county-level population data (census)☆82Updated last year
- Workshop on Functional PCA for Phonetics and Prosody☆27Updated this week
- Total Factor Productivity with Undesirable Outputs in Stata☆25Updated 3 months ago
- Shanghainese TTS☆23Updated last year
- Stata连享会推文集锦☆69Updated 2 years ago
- 人民日报(1946-2024)、习近平系列重要讲话数据库、古诗文☆57Updated last month
- 中文文本分析工具、语料、预训练模型相关资源汇总。☆137Updated this week
- Some basic praat scripts.☆205Updated 9 months ago
- Transformers for Cantonese☆56Updated 4 years ago
- Machine Learning for Social Scientists☆61Updated last year
- Trainable algorithm for automatic measurement of voice onset time☆64Updated last year
- VOT manipulation☆18Updated 2 years ago
- AlphaReadabilityChinese is a tool that calculates the readability of Chinese texts, which includes indices at lexical, syntactic, and sem…☆23Updated last year
- Here are some resources for Stata learning.☆23Updated 2 weeks ago
- CyberCan is a lexicon of contemporary Cantonese based on more than 100 million pieces of internet texts from discussion forums in Hong Ko…☆12Updated 3 years ago
- 使用 pinyin-data 和 phrase-pinyin-data 中的拼音数据文件覆盖 pypinyin 中的内置拼音数据☆57Updated 3 months ago
- 近代汉语语料库数据集 自然语言处理 语料库 古代汉语 古汉语 文言文 数字人文 计算语言☆156Updated last month
- some basic scripts for linguistics☆11Updated 2 weeks ago
- ☆81Updated last year
- SikuBERT:四库全书的预训练语言模型(四库BERT) Pre-training Model of Siku Quanshu☆130Updated last year