一个面向繁体中文古籍分词的python工具包
☆36Jan 3, 2022Updated 4 years ago
Alternatives and similar repositories for sikufenci
Users that are interested in sikufenci are comparing it to the libraries listed below
Sorting:
- 古文语言理解测评基准 Classical Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard☆56Aug 23, 2023Updated 2 years ago
- Workshop on Language Technologies for Historical and Ancient Languages (LT4HALA)☆35Mar 2, 2026Updated last week
- <数字人文教程>资源合集☆110May 28, 2024Updated last year
- Hanja Understanding Evaluation Dataset☆15May 2, 2022Updated 3 years ago
- ☆41Feb 20, 2023Updated 3 years ago
- SikuBERT:四库全书的预训练语言模型(四库BERT) Pre-training Model of Siku Quanshu☆153Jul 30, 2023Updated 2 years ago
- An evaluation bentchmark for classical Chinese☆18Dec 13, 2023Updated 2 years ago
- Tokenizer POS-tagger and Dependency-parser for Classical Chinese☆19Feb 28, 2026Updated last week
- Tokenizer POS-tagger and Dependency-parser for Classical Chinese☆15Dec 30, 2025Updated 2 months ago
- ☆38Dec 24, 2022Updated 3 years ago
- A Benchmark for Classical Chinese Based on a Crowdsourcing System.☆59May 25, 2021Updated 4 years ago
- Evaluation of Natural Language Processing (NLP) tools for the Ancient Chinese language☆44Feb 26, 2026Updated last week
- This repository is intended for people who are interesting in learning/reading Classical Chinese (文言文) but be at a loss what to do to. As…☆13Jul 1, 2021Updated 4 years ago
- ☆29Nov 12, 2025Updated 3 months ago
- CHisIEC An Information Extraction Corpus for Ancient Chinese History☆18Nov 25, 2025Updated 3 months ago
- GuwenModels: 古文自然语言处理模型合集, 收录互联网上的古文相关模型及资源. A collection of Classical Chinese natural language processing models, including Classical Ch…☆194Dec 11, 2023Updated 2 years ago
- Repo for the LREC 2022 paper The Project Dialogism Novel Corpus: A Dataset for Quotation Attribution in Literary Texts.☆14Jul 27, 2022Updated 3 years ago
- RDF -to- text generator, using GANs and reinforcement learning. For Google summer of code 2020.☆14Mar 25, 2023Updated 2 years ago
- Tokenizer POS-tagger and Dependency-parser for Classical Chinese☆70Feb 23, 2026Updated 2 weeks ago
- ☆21Apr 30, 2023Updated 2 years ago
- NDL古典籍OCRのアプリケーション(ソースコードを含む)☆93Oct 14, 2025Updated 4 months ago
- Tokenizer POS-tagger and Dependency-parser for Classical Chinese☆19Dec 16, 2021Updated 4 years ago
- Collection de romans français du dix-huitième siècle (1751-1800) / Collection of Eighteenth-Century French Novels (1751-1800)☆23Apr 23, 2024Updated last year
- 近代汉语语料库数据集 自然语言处理 语料库 古代汉语 古汉语 文言文 数字人文 计算语言☆169Mar 4, 2025Updated last year
- 甲言,专注于古代汉语(古汉语/古文/文言文/文言)处理的NLP工具包,支持文言词库构建、分词、词性标注、断句和标点。Jiayan, the 1st NLP toolkit designed for Classical Chinese, supports lexicon co…☆655Nov 2, 2021Updated 4 years ago
- Digital Humanities Research Software Engineering Summer School 2023. Talks and workshops designed to give an insight into the roles and p…☆25Jul 31, 2023Updated 2 years ago
- Ancient Chinese Corpus with Word Sense Annotation☆62May 29, 2024Updated last year
- An open-source classical Chinese information processing toolkit developed by Tsinghua Natural Language Processing Group☆52Dec 13, 2018Updated 7 years ago
- Dataset for TALLIP2019 paper "Ancient-Modern Chinese Translation with a New Large Training Dataset"☆25Jul 8, 2022Updated 3 years ago
- For review of draft Unihan database changes, removals, and additions by experts.☆74Feb 13, 2026Updated 3 weeks ago
- ☆31Jan 5, 2022Updated 4 years ago
- Materials for the Text to Tech workshop at the Digital Humanities Oxford Summer School☆16Aug 8, 2025Updated 7 months ago
- [文稿整理]日本生活留学指北_哔哩哔哩_bilibili☆12May 10, 2025Updated 9 months ago
- Interactive map for the Rensselaer Polytechnic Institute campus.☆10Jan 7, 2023Updated 3 years ago
- 한양대학교 도시공학과 머신러닝☆10Aug 10, 2023Updated 2 years ago
- 译者编程进阶指南☆14Jan 21, 2024Updated 2 years ago
- ☆70Jul 6, 2020Updated 5 years ago
- A simple OCR preprocessing tool using Python with a GUI.☆33Dec 21, 2022Updated 3 years ago
- Human labeled Chinese jokes and their verification codes in Python☆11Dec 10, 2021Updated 4 years ago