BrikerMan / classic_chinese_punctuate
classic Chinese punctuate experiment with keras using daizhige(殆知阁古代文献藏书) dataset
☆33Updated last year
Related projects ⓘ
Alternatives and complementary repositories for classic_chinese_punctuate
- This is a corpus of Chinese abbreviation, including negative full forms.☆189Updated 3 years ago
- 中文分词工具评估☆59Updated last year
- A tool for ancient Chinese segmentation.☆53Updated 5 years ago
- 图书名语料库。含部分电影、游戏名称。☆66Updated 7 months ago
- 中文相关词典和语料库。☆168Updated 10 years ago
- ☆91Updated last week
- An open-source classical Chinese information processing toolkit developed by Tsinghua Natural Language Processing Group☆47Updated 5 years ago
- Simple Solution for Multi-Criteria Chinese Word Segmentation☆300Updated 4 years ago
- THU Chinese Keyphrase Extraction Toolkit☆124Updated 6 years ago
- 教育行业新闻 自动文摘 语料库 自动摘要☆195Updated 6 years ago
- 一个轻量且功能全面的中文分词器,帮助学生了解分词器的工作原理。MicroTokenizer: A lightweight Chinese tokenizer designed for educational and research purposes. Provides a…☆147Updated last month
- Tokenizer POS-tagger and Dependency-parser for Classical Chinese☆62Updated this week
- 人民日报语料处理工具集 | Tools for Corpus of People's Daily☆268Updated last year
- 个人学习用。请star或fork原作者。☆27Updated 9 years ago
- chinese anti semantic word search interface based on dict crawled from online resources, ChineseAntiword,针对中文词语的反义词查询接口☆58Updated 6 years ago
- This is a pre-trained LSTM model. This model can help you to segment unpunctuated historical Chinese texts. 這是基於 LSTM 的預訓練模型。此模型可幫助您為漢語古文…☆22Updated 3 years ago
- 近代汉语语料库数据集 自然语言处理 语料库 古代汉语 古汉语 文言文 数字人文 计算语言☆147Updated last year
- Source codes and corpora of paper "Iterated Dilated Convolutions for Chinese Word Segmentation"☆136Updated 3 years ago
- ☆31Updated 5 years ago
- 夸夸语料,来自豆瓣互相表扬组数据☆75Updated 5 years ago
- Code for chinese error detection module, using n-gram and bi-lstm☆131Updated 5 years ago
- 各大中文分词性能评测☆154Updated 5 years ago
- 中文文本自动纠错☆80Updated 6 years ago
- 人民日报1998年1-4月中文标注语料库☆29Updated 6 years ago
- ☆173Updated last year