近代汉语语料库数据集 自然语言处理 语料库 古代汉语 古汉语 文言文 数字人文 计算语言
☆171Mar 4, 2025Updated last year
Alternatives and similar repositories for Pre-modern_Chinese_corpus_dataset
Users that are interested in Pre-modern_Chinese_corpus_dataset are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 汉语古典文本资料库☆330Feb 3, 2018Updated 8 years ago
- 古代汉语资源☆17Feb 25, 2023Updated 3 years ago
- 一个面向繁体中文古籍分词的python工具包☆38Jan 3, 2022Updated 4 years ago
- 甲言,专注于古代汉语(古汉语/古文/文言文/文言)处理的NLP工具包,支持文言词库构建、分词、词性标注、断句和标点。Jiayan, the 1st NLP toolkit designed for Classical Chinese, supports lexicon co…☆667Nov 2, 2021Updated 4 years ago
- SikuBERT:四库全书的预训练语言模型(四库BERT) Pre-training Model of Siku Quanshu☆161Jul 30, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Ancient Chinese Corpus with Word Sense Annotation☆68May 29, 2024Updated last year
- Annotations and code for the EMNLP 2018 paper 'Weeding out Conventionalized Metaphors: A Corpus of Novel Metaphor Annotations'☆10Feb 20, 2023Updated 3 years ago
- 古文语言理解测评基准 Classical Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard☆58Aug 23, 2023Updated 2 years ago
- A step-by-step problem set for implementing a high-quality deep dependency parser in Pytorch☆15Aug 12, 2017Updated 8 years ago
- 中文自然语言处理数据集,平时做做实验的材料。欢迎补充提交合并。☆37Dec 3, 2021Updated 4 years ago
- ☆24Aug 24, 2023Updated 2 years ago
- Raw text of 申報☆27Jan 17, 2022Updated 4 years ago
- A Benchmark for Classical Chinese Based on a Crowdsourcing System.☆60May 25, 2021Updated 4 years ago
- Tokenizer POS-tagger and Dependency-parser for Classical Chinese☆15Dec 30, 2025Updated 4 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Python version for Doug Biber's Multidimensional Analysis (MDA)☆40Nov 13, 2025Updated 6 months ago
- 英文文献的《中国图书馆分类法》自动标注小程序☆12Oct 29, 2024Updated last year
- ☆23Jun 2, 2019Updated 6 years ago
- 非常全的文言文(古文)-现代文平行语料☆1,439Apr 21, 2024Updated 2 years ago
- Tokenizer POS-tagger and Dependency-parser for Classical Chinese☆19Apr 13, 2026Updated last month
- The official GitHub repository for AC-EVAL, an ancient Chinese evaluation suite for large language models (LLMs)☆16Nov 12, 2024Updated last year
- 中文古诗词语料库☆28Sep 1, 2016Updated 9 years ago
- 数据字典,汉字字典,汉字库,诗经305首, 人名25万☆24Jul 21, 2022Updated 3 years ago
- 中文 NLP 资源库,语料库,相关的框架,文章收集。☆28May 20, 2022Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- classic Chinese punctuate experiment with keras using daizhige(殆知阁古代文献藏书) dataset☆35Dec 8, 2022Updated 3 years ago
- An annotated Chinese metaphor dataset☆23Feb 23, 2024Updated 2 years ago
- 殆知阁古代文献☆1,521May 13, 2024Updated 2 years ago
- GuwenBERT: 古文预训练语言模型(古文BERT) A Pre-trained Language Model for Classical Chinese (Literary Chinese)☆563Aug 31, 2021Updated 4 years ago
- 大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP☆9,895Feb 6, 2026Updated 3 months ago
- 地球上最全的华语现代诗歌语料库,3k+诗人,80K+诗歌,15M+字☆727Sep 12, 2025Updated 8 months ago
- Neural Network Semantic Parser for Almond☆15Apr 11, 2019Updated 7 years ago
- 中国近现代历史文献选集☆79Oct 28, 2023Updated 2 years ago
- 中文恶意网页检测数据集与检测方法☆21Mar 4, 2025Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- a Corpus for Classical Chinese Language Event Extraction☆25Nov 11, 2025Updated 6 months ago
- Applied Traditional-Chinese-Handwriting-Dataset to realize handwriting recognition by CNN model.☆36Oct 5, 2023Updated 2 years ago
- 医疗语料库。医疗机构名语料库。药品本位码。☆70Mar 27, 2024Updated 2 years ago
- 中华经典文献数据集☆21Jun 29, 2023Updated 2 years ago
- explores Chinese language models with sub-character level visual information