近代汉语语料库数据集 自然语言处理 语料库 古代汉语 古汉语 文言文 数字人文 计算语言
☆168Mar 4, 2025Updated 11 months ago
Alternatives and similar repositories for Pre-modern_Chinese_corpus_dataset
Users that are interested in Pre-modern_Chinese_corpus_dataset are comparing it to the libraries listed below
Sorting:
- 汉语古典文本资料库☆321Feb 3, 2018Updated 8 years ago
- An evaluation bentchmark for classical Chinese☆18Dec 13, 2023Updated 2 years ago
- 一个面向繁体中文古籍分词的python工具包☆36Jan 3, 2022Updated 4 years ago
- ☆24Aug 24, 2023Updated 2 years ago
- Ancient Chinese Corpus with Word Sense Annotation☆62May 29, 2024Updated last year
- 古代汉语资源☆17Feb 25, 2023Updated 3 years ago
- SikuBERT:四库全书的预训练语言模型(四库BERT) Pre-training Model of Siku Quanshu☆153Jul 30, 2023Updated 2 years ago
- Tokenizer POS-tagger and Dependency-parser for Classical Chinese☆19Updated this week
- Tokenizer POS-tagger and Dependency-parser for Classical Chinese☆14Dec 30, 2025Updated 2 months ago
- A step-by-step problem set for implementing a high-quality deep dependency parser in Pytorch☆15Aug 12, 2017Updated 8 years ago
- Python version for Doug Biber's Multidimensional Analysis (MDA)☆39Nov 13, 2025Updated 3 months ago
- 中文自然语言处理数据集,平时做做实验的材料。欢迎补充提交合并。☆37Dec 3, 2021Updated 4 years ago
- A Benchmark for Classical Chinese Based on a Crowdsourcing System.☆59May 25, 2021Updated 4 years ago
- Annotations and code for the EMNLP 2018 paper 'Weeding out Conventionalized Metaphors: A Corpus of Novel Metaphor Annotations'☆10Feb 20, 2023Updated 3 years ago
- Raw text of 申報☆27Jan 17, 2022Updated 4 years ago
- 中文古诗词语料库☆27Sep 1, 2016Updated 9 years ago
- 医疗语料库。医疗机构名语料库。药品本位码。☆70Mar 27, 2024Updated last year
- Buddhist Studies Authority Databases☆18Nov 8, 2021Updated 4 years ago
- A large corpus of Chinese fixed phrases and idioms scraped from a reputable educational website (30310 instances). 一个大型的中文成语及俗语语料库,内含3031…☆11Oct 29, 2021Updated 4 years ago
- 非常全的文言文(古文)-现代文平行语料☆1,413Apr 21, 2024Updated last year
- An Ellipsis-aware Chinese Dependency Treebank for Web Text☆26May 14, 2018Updated 7 years ago
- Deep Learning For Ultrasound Tongue Imaging☆12Dec 17, 2024Updated last year
- 《管錐編》(Limited Views: Essays on Ideas and Letters)☆14Oct 10, 2017Updated 8 years ago
- 英文文献的《中国图书馆分类法》自动标注小程序☆13Oct 29, 2024Updated last year
- 大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP☆9,862Feb 6, 2026Updated 3 weeks ago
- 地球上最全的华语现代诗歌语料库,3k+诗人,80K+诗歌,15M+字☆719Sep 12, 2025Updated 5 months ago
- GuwenBERT: 古文预训练语言模型(古文BERT) A Pre-trained Language Model for Classical Chinese (Literary Chinese)☆554Aug 31, 2021Updated 4 years ago
- 殆知阁古代文献☆1,471May 13, 2024Updated last year
- classic Chinese punctuate experiment with keras using daizhige(殆知阁古代文献藏书) dataset☆35Dec 8, 2022Updated 3 years ago
- Code for TALLIP2019 paper "µ-Forcing: Training Variational Recurrent Autoencoders for Text Generation"☆12May 27, 2019Updated 6 years ago
- Using BiLSTM-CRF model for Chinese NER☆15Mar 1, 2018Updated 8 years ago
- The starting point for raising issues for Libero Publisher☆16Mar 25, 2020Updated 5 years ago
- 古漢語常用字典☆13Sep 1, 2016Updated 9 years ago
- An open-source classical Chinese information processing toolkit developed by Tsinghua Natural Language Processing Group☆52Dec 13, 2018Updated 7 years ago
- 古典中文語料庫☆301Jun 11, 2022Updated 3 years ago
- Neural Network Semantic Parser for Almond☆15Apr 11, 2019Updated 6 years ago
- Chinese Characters Visualization & Chinese Text Augmentation.☆17Sep 19, 2022Updated 3 years ago
- CHisIEC An Information Extraction Corpus for Ancient Chinese History☆18Nov 25, 2025Updated 3 months ago
- A large high-quality corpus of Chinese synonyms 一个大型、高质量的中文同义词语料库。☆69Nov 20, 2021Updated 4 years ago