CIRCSE/LT4HALA

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/CIRCSE/LT4HALA)

CIRCSE / LT4HALA

Workshop on Language Technologies for Historical and Ancient Languages (LT4HALA)

☆38

Alternatives and similar repositories for LT4HALA

Users that are interested in LT4HALA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

jizijing / C-CLUE
View on GitHub
A Benchmark for Classical Chinese Based on a Crowdsourcing System.
☆60May 25, 2021Updated 5 years ago
KoichiYasuoka / SuPar-Kanbun
View on GitHub
Tokenizer POS-tagger and Dependency-parser for Classical Chinese
☆20Jun 10, 2026Updated last month
hsc748NLP / sikufenci
View on GitHub
一个面向繁体中文古籍分词的python工具包
☆38Jan 3, 2022Updated 4 years ago
Ethan-yt / guwen-models
View on GitHub
GuwenModels: 古文自然语言处理模型合集, 收录互联网上的古文相关模型及资源. A collection of Classical Chinese natural language processing models, including Classical Ch…
☆201Dec 11, 2023Updated 2 years ago
yuting-wei / AC-EVAL
View on GitHub
The official GitHub repository for AC-EVAL, an ancient Chinese evaluation suite for large language models (LLMs)
☆17Nov 12, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
hsc748NLP / SikuBERT-for-digital-humanities-and-classical-Chinese-information-processing
View on GitHub
SikuBERT：四库全书的预训练语言模型（四库BERT） Pre-training Model of Siku Quanshu
☆167Jul 30, 2023Updated 2 years ago
frederick-wang / tongjiazi-resources
View on GitHub
CCL 2023 古汉语通假字语料库的构建及应用研究：通假字资源库
☆29Sep 23, 2023Updated 2 years ago
KoichiYasuoka / UD-Kanbun
View on GitHub
Tokenizer POS-tagger and Dependency-parser for Classical Chinese
☆76Jun 10, 2026Updated last month
Ethan-yt / CCLUE
View on GitHub
古文语言理解测评基准 Classical Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard
☆58Aug 23, 2023Updated 2 years ago
FelixMohr / NLP-with-Python
View on GitHub
Using Conditional Random Fields for segmenting Latin words written in scriptio continua
☆10May 30, 2018Updated 8 years ago
RUCAIBox / Erya
View on GitHub
☆19Oct 6, 2023Updated 2 years ago
beeevita / Classical-Chinese-NER-RE-Dataset
View on GitHub
A dataset used for NLP tasks.
☆10Apr 17, 2021Updated 5 years ago
hsc748NLP / code-for-digital-humanities-tutorial
View on GitHub
<数字人文教程>资源合集
☆119May 28, 2024Updated 2 years ago
iris2hu / ancient_chinese_sense_annotation
View on GitHub
Ancient Chinese Corpus with Word Sense Annotation
☆74May 29, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
KoichiYasuoka / GuwenCOMBO
View on GitHub
Tokenizer POS-tagger and Dependency-parser for Classical Chinese
☆15Dec 30, 2025Updated 6 months ago
kasparvonbeelen / ghi_python
View on GitHub
Programming for Historians
☆17Sep 12, 2022Updated 3 years ago
Lyn4ever29 / GuwenEE
View on GitHub
a Corpus for Classical Chinese Language Event Extraction
☆25Nov 11, 2025Updated 8 months ago
baudzhou / WYWEB
View on GitHub
An evaluation bentchmark for classical Chinese
☆20Dec 13, 2023Updated 2 years ago
ancientml / ml-for-ancient-languages
View on GitHub
Machine Learning for Ancient Languages
☆32Apr 12, 2024Updated 2 years ago
centre-for-humanities-computing / odyCy
View on GitHub
A general-purpose NLP pipeline for Ancient Greek
☆28Mar 26, 2024Updated 2 years ago
chrisdrymon / angel
View on GitHub
An Ancient Greek Morphology Tagger
☆28May 9, 2023Updated 3 years ago
jiaeyan / Jiayan
View on GitHub
甲言，专注于古代汉语(古汉语/古文/文言文/文言)处理的NLP工具包，支持文言词库构建、分词、词性标注、断句和标点。Jiayan, the 1st NLP toolkit designed for Classical Chinese, supports lexicon co…
☆678Nov 2, 2021Updated 4 years ago
nevenjovanovic / camena-neolatinlit
View on GitHub
Archive of the XML files of the Mannheim / Heidelberg CAMENA Neo-Latin project
☆20Oct 10, 2018Updated 7 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
raynardj / yuan
View on GitHub
渊 - A project for Classical Chinese
☆110Feb 23, 2022Updated 4 years ago
AgBigdataLab / Ancient-Agri-LLM
View on GitHub
🎉 Repo for Ancient-Agri-LLM.古农文大语言模型
☆10Sep 13, 2024Updated last year
TianRuiHe / GuwenLLAMA
View on GitHub
基于ChineseAlpaca微调的，专精与古汉语翻译、古汉语断句的大语言模型
☆21Aug 20, 2023Updated 2 years ago
GoThereGit / EvaHan
View on GitHub
Evaluation of Natural Language Processing (NLP) tools for the Ancient Chinese language
☆48Mar 15, 2026Updated 4 months ago
OpenGreekAndLatin / patrologia_latina-dev
View on GitHub
Machine-corrected versions of selections of the Patrologia Latina.
☆27Apr 2, 2019Updated 7 years ago
hemingkx / WordSeg
View on GitHub
A PyTorch implementation of a BiLSTM \ BERT \ Roberta (+ BiLSTM + CRF) model for Chinese Word Segmentation (中文分词) .
☆217Jul 28, 2022Updated 3 years ago
Paul-scpark / Moral-Emotion
View on GitHub
[ACL'24] Moral Emotion Dataset & Classifier
☆15Jun 26, 2026Updated 3 weeks ago
mahavivo / scripta-sinica
View on GitHub
汉语古典文本资料库
☆350Feb 3, 2018Updated 8 years ago
tangxuemei1995 / CHisIEC
View on GitHub
CHisIEC An Information Extraction Corpus for Ancient Chinese History
☆24Nov 25, 2025Updated 8 months ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
readsoftware / read
View on GitHub
Research Environment for Ancient Documents
☆46Jan 24, 2026Updated 6 months ago
nyu-mll / msgs
View on GitHub
This is a repository for the paper on testing inductive bias with scaled-down RoBERTa models.
☆21Jan 10, 2022Updated 4 years ago
tm4roon / data-augmentation-for-nlp
View on GitHub
An implementation of data augmentation methods for natural language processing tasks.
☆13Jul 25, 2024Updated last year
wilburOne / ACE_ERE_Scripts
View on GitHub
Preprocessing scripts for ACE and ERE datasets
☆15Jul 28, 2020Updated 5 years ago
Nathan-Roll1 / PSST
View on GitHub
Prosodic Speech Segmentation with Transformers
☆28Feb 25, 2024Updated 2 years ago
ssocean / AlphX-Code-For-DAR
View on GitHub
粤港澳大湾区（黄埔）国际算法算例大赛-古籍文档图像识别与分析算法比赛 Alphx队源码
☆46Mar 16, 2023Updated 3 years ago
mromanello / CitationExtractor
View on GitHub
A tool to extract canonical references from text.
☆20Jun 23, 2021Updated 5 years ago