yukiyuqichen / CHAR
Chinese character variant converter. 中文异体字转换器。
☆14Updated 3 weeks ago
Related projects: ⓘ
- This project aims to curate and provide a comprehensive collection of prompts designed specifically for generative AI models in the conte…☆27Updated 8 months ago
- This is a pre-trained LSTM model. This model can help you to segment unpunctuated historical Chinese texts. 這是基於 LSTM 的預訓練模型。此模型可幫助您為漢語古文…☆21Updated 2 years ago
- Tokenizer POS-tagger and Dependency-parser for Classical Chinese☆13Updated 11 months ago
- ☆33Updated last year
- A curated list of digital things related to the field of Chinese studies.☆29Updated 4 years ago
- Automatic transcription models for Chinese historical documents trained with the kraken OCR engine☆9Updated 11 months ago
- ☆27Updated 4 months ago
- Raw text of 申報☆16Updated 2 years ago
- Tokenizer POS-tagger and Dependency-parser for Classical Chinese☆62Updated 6 months ago
- uncover old chinese textual parallels based on sound☆12Updated this week
- ☆12Updated 2 years ago
- GuwenModels: 古文自然语言处理模型合集, 收录互联网上的古文相关模型及资源. A collection of Classical Chinese natural language processing models, including Classical Ch…☆146Updated 9 months ago
- A tool for ancient Chinese segmentation.☆53Updated 5 years ago
- Tool for performing basic text analysis on the CBETA corpus☆30Updated last year
- ☆238Updated last month
- A cute toolkit for OCR with GUI, including image preprocessing and text recognition. Works out of the box. 一只小小的OCR工具箱,包括图像预处理和 文字识别等功能,…☆11Updated 9 months ago
- Ancient Chinese Corpus with Word Sense Annotation☆39Updated 3 months ago
- Buddhist Studies Authority Databases☆19Updated 2 years ago
- 古文现代文翻译平行语料库☆95Updated 2 years ago
- 一个面向繁体中文古籍分词的python工具包☆31Updated 2 years ago
- ☆27Updated last year
- High-performance text aligner for large collections of texts☆43Updated 3 weeks ago
- <u><a href="https://circse.github.io/LT4HALA/" style="color: white">Workshop on Language Technologies for Historical and Ancient Language…☆32Updated 3 months ago
- The latest SQLite version of the China Biographical Database☆92Updated last week
- 基于ChineseAlpaca微调的,专精与古汉语翻译、古汉语断句的大语言模型☆8Updated last year
- SikuBERT:四库全书的预训练语言模型(四库BERT) Pre-training Model of Siku Quanshu☆110Updated last year
- Tokenizer POS-tagger and Dependency-parser for Classical Chinese☆11Updated 3 months ago
- AnchiBERT: A Pre-Trained Model for Ancient Chinese Language Understanding and Generation(古文预训练模型)☆59Updated 3 years ago
- NDL古典籍OCRのアプリケーション(ソースコードを含む)☆31Updated last month
- ☆14Updated last year