moss-on-stone / shenbao-txtLinks
Raw text of 申報
☆26Updated 3 years ago
Alternatives and similar repositories for shenbao-txt
Users that are interested in shenbao-txt are comparing it to the libraries listed below
Sorting:
- uncover old chinese textual parallels based on sound☆14Updated 9 months ago
- Tokenizer POS-tagger and Dependency-parser for Classical Chinese☆13Updated last month
- ☆28Updated 3 months ago
- ☆37Updated 10 months ago
- Tokenizer POS-tagger and Dependency-parser for Classical Chinese☆64Updated 9 months ago
- A curated list of digital things related to the field of Chinese studies.☆33Updated 4 years ago
- ☆12Updated 3 years ago
- Tool for performing basic text analysis on the CBETA corpus☆33Updated last year
- This project aims to curate and provide a comprehensive collection of prompts designed specifically for generative AI models in the conte…☆34Updated 5 months ago
- Data and some scripts for historical social network analysis in Chinese Buddhism☆19Updated 2 years ago
- High-performance text aligner for large collections of texts☆52Updated 3 months ago
- MFTE (Multi Feature Tagger of English) Python is the Python version based on Le Foll's MFTE written in Perl. It is extended to include se…☆27Updated 3 months ago
- Umbrella repository that describes the collections contained in any given release of ELTeC☆13Updated 3 years ago
- A CWN Python binding with graph structure☆33Updated 2 years ago
- QuanSyn: A Python Package for Quantitative Syntax Analysis.☆36Updated 4 months ago
- This is a pre-trained LSTM model. This model can help you to segment unpunctuated historical Chinese texts. 這是基於 LSTM 的預訓練模型。此模型可幫助您為漢語古文…☆27Updated 3 years ago
- Buddhist Studies Authority Databases☆19Updated 3 years ago
- Foreign Relations of the United States - TEI XML source files☆38Updated last week
- <u><a href="https://circse.github.io/LT4HALA/" style="color: white">Workshop on Language Technologies for Historical and Ancient Language…☆33Updated last year
- https://sites.google.com/site/multidimensionaltagger☆36Updated last year
- Chinese character variant converter. 中文异体字转换器。☆19Updated 4 months ago
- A simple toolkit for conducting analyses using corpus methods☆26Updated 3 years ago
- Natural language processing resources for multiple languages, with an eye towards use for digital humanities.☆127Updated 4 years ago
- Linguistic search for large annotated text corpora, based on Apache Lucene☆115Updated this week
- The GitHub repository for the AI for Humanists Project☆18Updated 2 months ago
- ☆21Updated 2 years ago
- Automatic transcription models for Chinese historical documents trained with the kraken OCR engine☆14Updated last year
- CCL 2023 古汉语通假字语料库的构建及应用研究:通假字资源库☆18Updated last year
- 一个面向繁体中文古籍分词的python工具包☆34Updated 3 years ago
- Digital Humanities Across Borders☆49Updated last year