psychbruce / ChineseNames
🀄 Chinese Name Database (1930-2008).
☆147Updated 6 months ago
Alternatives and similar repositories for ChineseNames:
Users that are interested in ChineseNames are comparing it to the libraries listed below
- A tool for ancient Chinese segmentation.☆53Updated 5 years ago
- 人民日报(1946-2003)☆129Updated 6 years ago
- 近代汉语语料库数据集 自然语言处理 语料库 古代汉语 古汉语 文言文 数字人文 计算语言☆152Updated 2 months ago
- Machine Learning for Social Scientists☆61Updated last year
- 古文现代文翻译平行语料库☆100Updated 3 years ago
- BERT-CCPoem is an BERT-based pre-trained model particularly for Chinese classical poetry☆148Updated 2 years ago
- Raw text of 申報☆23Updated 3 years ago
- GuwenBERT: 古文预训练语言模型(古文BERT) A Pre-trained Language Model for Classical Chinese (Literary Chinese)☆515Updated 3 years ago
- Chinese Dialect Database☆17Updated 7 years ago
- AnchiBERT: A Pre-Trained Model for Ancient Chinese Language Understanding and Generation(古文预训练模型)☆63Updated 3 years ago
- 人民日报(1946-2023)、习近平系列重要讲话数据库、古诗文☆53Updated 4 months ago
- This is a pre-trained LSTM model. This model can help you to segment unpunctuated historical Chinese texts. 這是基於 LSTM 的預訓練模型。此模型可幫助您為漢語古文…☆24Updated 3 years ago
- 汉典网站数据(汉字、拼音、释义等)☆57Updated last year
- 人民日报 爬虫(Python)☆100Updated this week
- 《计算社会科学》课程☆22Updated 6 months ago
- Tokenizer POS-tagger and Dependency-parser for Classical Chinese☆15Updated 2 months ago
- Poetry-related datasets developed by THUAIPoet (Jiuge) group.☆218Updated 4 years ago
- 甲言,专注于古代汉语(古汉语/古文/文言文/文言)处理的NLP工具包,支持文言词库构建、分词、词性标注、断句和标点。Jiayan, the 1st NLP toolkit designed for Classical Chinese, supports lexicon co…☆599Updated 3 years ago
- GuwenModels: 古文自然语言处理模型合集, 收录互联网上的古文相关模型及资源. A collection of Classical Chinese natural language processing models, including Classical Ch…☆166Updated last year
- Ancient Chinese Corpus with Word Sense Annotation☆46Updated 8 months ago
- The spoken L1 corpus represents present-day spoken Chinese (Putonghua) used in mainland China, which is designed as a comparable corpus t…☆18Updated 3 years ago
- Download, extract and index Weiboscope data☆24Updated 5 years ago
- Automated Essay Scoring Method for Chinese Second Language Writing☆25Updated 2 years ago
- Han character library for CJKV languages☆153Updated 3 years ago
- CyberCan is a lexicon of contemporary Cantonese based on more than 100 million pieces of internet texts from discussion forums in Hong Ko…☆12Updated 3 years ago
- QuanSyn: A Python Package for Quantitative Syntax Analysis.☆23Updated last month
- 渊 - A project for Classical Chinese☆96Updated 2 years ago
- Interactive grand unified timeline of 30,800 ancient Chinese people / 古人全表☆111Updated 4 years ago
- This project aims to curate and provide a comprehensive collection of prompts designed specifically for generative AI models in the conte…☆31Updated 3 weeks ago
- Official code and data of the ACL 2022 paper "QuoteR: A Benchmark of Quote Recommendation for Writing"☆59Updated 2 years ago