moss-on-stone / shenbao-txt
Raw text of 申報
☆26Updated 3 years ago
Alternatives and similar repositories for shenbao-txt:
Users that are interested in shenbao-txt are comparing it to the libraries listed below
- ☆12Updated 2 years ago
- ☆36Updated 6 months ago
- A curated list of digital things related to the field of Chinese studies.☆32Updated 4 years ago
- ☆28Updated last week
- Tokenizer POS-tagger and Dependency-parser for Classical Chinese☆65Updated 5 months ago
- Tokenizer POS-tagger and Dependency-parser for Classical Chinese☆13Updated last year
- This project aims to curate and provide a comprehensive collection of prompts designed specifically for generative AI models in the conte…☆32Updated last month
- uncover old chinese textual parallels based on sound☆13Updated 6 months ago
- Tokenizer POS-tagger and Dependency-parser for Classical Chinese☆15Updated last month
- A tool for ancient Chinese segmentation.☆53Updated 6 years ago
- Data and some scripts for historical social network analysis in Chinese Buddhism☆18Updated 2 years ago
- <u><a href="https://circse.github.io/LT4HALA/" style="color: white">Workshop on Language Technologies for Historical and Ancient Language…☆34Updated 11 months ago
- CCL 2023 古汉语通假字语料库的构建及应用研究:通假字资源库☆15Updated last year
- The spoken L1 corpus represents present-day spoken Chinese (Putonghua) used in mainland China, which is designed as a comparable corpus t…☆19Updated 3 years ago
- Cantonese segmentation tool 粵語分詞工具☆30Updated 4 years ago
- This is a pre-trained LSTM model. This model can help you to segment unpunctuated historical Chinese texts. 這是基於 LSTM 的預訓練模型。此模型可幫助您為漢語古文…☆26Updated 3 years ago
- https://sites.google.com/site/multidimensionaltagger☆34Updated last year
- A Package for Cantonese Tokenisation☆17Updated 3 years ago
- Chinese character variant converter. 中文异体字转换器。☆18Updated 2 weeks ago
- Buddhist Studies Authority Databases☆19Updated 3 years ago
- MFTE (Multi Feature Tagger of English) Python is the Python version based on Le Foll's MFTE written in Perl. It is extended to include se…☆24Updated 3 weeks ago
- High-performance text aligner for large collections of texts☆51Updated 2 weeks ago
- Neural Language Models for Historical Research☆25Updated 6 months ago
- Automatic transcription models for Chinese historical documents trained with the kraken OCR engine☆13Updated last year
- A simple toolkit for conducting analyses using corpus methods☆25Updated 3 years ago
- 一个面向繁体中文古籍分词的python工具包☆32Updated 3 years ago
- 古文现代文翻译平行语料库☆103Updated 3 years ago
- CyberCan is a lexicon of contemporary Cantonese based on more than 100 million pieces of internet texts from discussion forums in Hong Ko…☆12Updated 3 years ago
- Chinese Dialect Database☆17Updated 7 years ago
- ☆18Updated last year