sharejing / TakinLinks
A Python toolkit for file processing, text cleaning and data splitting. 文件处理,文本清洗和数据划分的python工具包。
☆33Updated 2 years ago
Alternatives and similar repositories for Takin
Users that are interested in Takin are comparing it to the libraries listed below
Sorting:
- 中文机器阅读理解数据集☆104Updated 4 years ago
- Code & Data for our Paper "NaSGEC: Multi-Domain Chinese Grammatical Error Correction for Native Speaker Texts" (ACL 2023 Findings)☆92Updated 5 months ago
- The Corpus & Code for EMNLP 2022 paper "FCGEC: Fine-Grained Corpus for Chinese Grammatical Error Correction" | FCGEC中文语法纠错语料及STG模型☆118Updated 8 months ago
- CCL 2022 汉语学习者文本纠错评测☆141Updated 2 years ago
- 继续预训练中文bert☆31Updated 4 years ago
- 一个基于预训练的句向量生成工具☆138Updated 2 years ago
- 各大文本摘要模型-中文文本可运行的解决方案☆68Updated last year
- ChineseTextualInference project including chinese corpus build and inferecence model, 中文文本推断项目,包括88万文本蕴含中文文本蕴含数据集的翻译与构建,基于深度学习的文本蕴含判定模型构建…☆174Updated 6 years ago
- 文本智能校对大赛(Chinese Text Correction)的baseline☆68Updated 2 years ago
- OpenTextClassification is all you need for text classification! Open text classification for everyone, enjoy your NLP journey! 这可能是目前为止最全…☆207Updated last year
- 基于pytorch的百度UIE命名实体识别。☆56Updated 2 years ago
- 🌈 NERpy: Implementation of Named Entity Recognition using Python. 命名实体识别工具,支持BertSoftmax、BertSpan等模型,开箱即用。☆115Updated last year
- CCL2022汉语学习者文本纠错评测任务赛道二——CGED-8第一名解决方案☆54Updated 2 years ago
- LERT: A Linguistically-motivated Pre-trained Language Model(语言学信息增强的预训练模型LERT)☆217Updated 3 weeks ago
- 中文自然语言推理数据集(A large-scale Chinese Nature language inference and Semantic similarity calculation Dataset)☆432Updated 5 years ago
- benchmark of KgCLUE, with different models and methods☆28Updated 3 years ago
- ☆136Updated 3 years ago
- 基于seq2edit (Gector) 的中文文本纠错。☆29Updated 2 years ago
- code and data for "CSCD-NS: a Chinese Spelling Check Dataset for Native Speakers"☆71Updated 11 months ago
- A framework for cleaning Chinese dialog data☆274Updated 4 years ago
- 基于词汇信息融合的中文NER模型☆169Updated 3 years ago
- 基于模板的文本纠错;Automatically Mining Error Templates for Grammatical Error Correction☆42Updated 3 years ago
- 历届中文句法错误诊断技术评测数据集☆42Updated 3 years ago
- Unilm for Chinese Chitchat Robot.基于Unilm模型的夸夸式闲聊机器人项目。☆158Updated 4 years ago
- 基于bert进行中文文本纠错☆236Updated 2 years ago
- Source code for the paper "PLOME: Pre-training with Misspelled Knowledge for Chinese Spelling Correction" in ACL2021☆238Updated 2 years ago
- 基于SpanBert的中文指代消解,pytorch实现☆99Updated 2 years ago
- We released BERT-wwm, a Chinese pre-training model based on Whole Word Masking technology, and models closely related to this technology.…☆62Updated 2 years ago
- 大规模中文语料☆43Updated 5 years ago
- CCL 2023 汉语学习者文本纠错评测☆28Updated 2 years ago