NiuTrans/Classical-Modern

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/NiuTrans/Classical-Modern)

NiuTrans / Classical-Modern

非常全的文言文（古文）-现代文平行语料

☆1,467

Alternatives and similar repositories for Classical-Modern

Users that are interested in Classical-Modern are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

BangBOOM / Classical-Chinese
View on GitHub
古文现代文翻译平行语料库
☆117Jan 12, 2022Updated 4 years ago
jiaeyan / Jiayan
View on GitHub
甲言，专注于古代汉语(古汉语/古文/文言文/文言)处理的NLP工具包，支持文言词库构建、分词、词性标注、断句和标点。Jiayan, the 1st NLP toolkit designed for Classical Chinese, supports lexicon co…
☆678Nov 2, 2021Updated 4 years ago
Ethan-yt / guwen-models
View on GitHub
GuwenModels: 古文自然语言处理模型合集, 收录互联网上的古文相关模型及资源. A collection of Classical Chinese natural language processing models, including Classical Ch…
☆201Dec 11, 2023Updated 2 years ago
Scagin / CCTC
View on GitHub
文言文翻译、古文翻译语料数据集
☆53Oct 14, 2020Updated 5 years ago
Ethan-yt / guwenbert
View on GitHub
GuwenBERT: 古文预训练语言模型（古文BERT） A Pre-trained Language Model for Classical Chinese (Literary Chinese)
☆566Aug 31, 2021Updated 4 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
cloudyskyy / Guwen-UNILM
View on GitHub
本仓库是基于bert4keras实现的古文-现代文翻译模型。具体使用了基于掩码自注意力机制的UNILM(Li al., 2019)预训练模型作为翻译系统的backbone。我们首先使用了普通的中文（现代文）BERT、Roberta权重作为UNILM的初始权重以训练UNILM…
☆54May 3, 2022Updated 4 years ago
Ethan-yt / CCLUE
View on GitHub
古文语言理解测评基准 Classical Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard
☆58Aug 23, 2023Updated 2 years ago
garychowcmu / daizhigev20
View on GitHub
殆知阁古代文献
☆1,596May 13, 2024Updated 2 years ago
raynardj / yuan
View on GitHub
渊 - A project for Classical Chinese
☆110Feb 23, 2022Updated 4 years ago
hsc748NLP / SikuBERT-for-digital-humanities-and-classical-Chinese-information-processing
View on GitHub
SikuBERT：四库全书的预训练语言模型（四库BERT） Pre-training Model of Siku Quanshu
☆168Jul 30, 2023Updated 2 years ago
mahavivo / scripta-sinica
View on GitHub
汉语古典文本资料库
☆350Feb 3, 2018Updated 8 years ago
ttzHome / AnchiBERT
View on GitHub
AnchiBERT: A Pre-Trained Model for Ancient Chinese Language Understanding and Generation(古文预训练模型)
☆75Jul 16, 2021Updated 5 years ago
jizijing / C-CLUE
View on GitHub
A Benchmark for Classical Chinese Based on a Crowdsourcing System.
☆60May 25, 2021Updated 5 years ago
dayihengliu / a2m_chineseNMT
View on GitHub
Dataset for TALLIP2019 paper "Ancient-Modern Chinese Translation with a New Large Training Dataset"
☆27Jul 8, 2022Updated 4 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
KoichiYasuoka / SuPar-Kanbun
View on GitHub
Tokenizer POS-tagger and Dependency-parser for Classical Chinese
☆20Jun 10, 2026Updated last month
KoichiYasuoka / UD-Kanbun
View on GitHub
Tokenizer POS-tagger and Dependency-parser for Classical Chinese
☆76Jun 10, 2026Updated last month
Xunzi-LLM-of-Chinese-classics / XunziALLM
View on GitHub
☆432Jul 20, 2025Updated last year
Jihuai-wpy / bert-ancient-chinese
View on GitHub
Pretrained BERT for Ancient (Classical) Chinese, with an expanded vocabulary for rare characters.
☆48Feb 20, 2023Updated 3 years ago
baudzhou / WYWEB
View on GitHub
An evaluation bentchmark for classical Chinese
☆20Dec 13, 2023Updated 2 years ago
CIRCSE / LT4HALA
View on GitHub
Workshop on Language Technologies for Historical and Ancient Languages (LT4HALA)
☆38May 19, 2026Updated 2 months ago
esbatmop / MNBVC
View on GitHub
MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化，也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志…
☆4,246Jul 13, 2026Updated last week
SCUT-DLVCLab / TongGu-LLM
View on GitHub
[EMNLP 2024] TongGu, a classical Chinese language model.
☆70Sep 28, 2024Updated last year
Werneror / Poetry
View on GitHub
非常全的古诗词数据，收录了从先秦到现代的共计85万余首古诗词。
☆1,758Aug 8, 2023Updated 2 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
yxcs / poems-db
View on GitHub
比较全的中华古诗古词古文库，包括21万首古诗词，以及注释、赏析等信息，包含10000多名诗人以及诗人的介绍、生平等，同时包含，1600多个词牌介绍，中国70多个朝代解析，和古诗文的近200个分类标签
☆426Sep 11, 2023Updated 2 years ago
UniversalDependencies / UD_Classical_Chinese-Kyoto
View on GitHub
☆31May 6, 2026Updated 2 months ago
hsc748NLP / sikufenci
View on GitHub
一个面向繁体中文古籍分词的python工具包
☆38Jan 3, 2022Updated 4 years ago
iris2hu / ancient_chinese_sense_annotation
View on GitHub
Ancient Chinese Corpus with Word Sense Annotation
☆74May 29, 2024Updated 2 years ago
RUCAIBox / Erya
View on GitHub
☆19Oct 6, 2023Updated 2 years ago
JasonWade001 / chtxt
View on GitHub
中华经典古籍精校、诗词，四书五经、四大名著、诗经、楚辞、全唐诗、全宋词、唐诗三百首、宋詞三百首、二十四史......
☆162Feb 19, 2021Updated 5 years ago
JiangYanting / Pre-modern_Chinese_corpus_dataset
View on GitHub
近代汉语语料库数据集自然语言处理语料库古代汉语古汉语文言文数字人文计算语言
☆173Mar 4, 2025Updated last year
yuting-wei / AC-EVAL
View on GitHub
The official GitHub repository for AC-EVAL, an ancient Chinese evaluation suite for large language models (LLMs)
☆17Nov 12, 2024Updated last year
brightmart / nlp_chinese_corpus
View on GitHub
大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP
☆9,906Feb 6, 2026Updated 5 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
chinese-poetry / chinese-poetry
View on GitHub
The most comprehensive database of Chinese poetry 🧶最全中华古诗词数据库, 唐宋两朝近一万四千古诗人, 接近5.5万首唐诗加26万宋诗. 两宋时期1564位词人，21050首词。
☆52,880Jun 17, 2026Updated last month
rime-aca / corpus
View on GitHub
古典中文語料庫
☆309Jun 11, 2022Updated 4 years ago
ydli-ai / CSL
View on GitHub
[COLING 2022] CSL: A Large-scale Chinese Scientific Literature Dataset 中文科学文献数据集
☆673Jun 19, 2023Updated 3 years ago
Scagin / Classical2Modern
View on GitHub
This is an interpreter which translate classical Chinese to modern Chinese. 该项目是一个把文言文翻译成现代文的翻译器
☆36May 5, 2023Updated 3 years ago
JianXiao2021 / ancient_text_generation_LLM
View on GitHub
输入现代汉语句子，生成古汉语风格的句子。基于荀子基座大模型，采用“文言文（古文）- 现代文平行语料”中的部分数据进行LoRA微调训练而得。
☆284Aug 27, 2024Updated last year
hsc748NLP / SikuGPT
View on GitHub
☆21Apr 30, 2023Updated 3 years ago
pwxcoo / chinese-xinhua
View on GitHub
中华新华字典数据库。包括歇后语，成语，词语，汉字。
☆11,617Dec 26, 2023Updated 2 years ago