sharejing / TakinLinks
A Python toolkit for file processing, text cleaning and data splitting. 文件处理,文本清洗和数据划分的python工具包。
☆34Updated 2 years ago
Alternatives and similar repositories for Takin
Users that are interested in Takin are comparing it to the libraries listed below
Sorting:
- 中文机器阅读理解数据集☆105Updated 4 years ago
- 🌈 NERpy: Implementation of Named Entity Recognition using Python. 命名实体识别工具,支持BertSoftmax、BertSpan等模型,开箱即用。☆115Updated last year
- 继续预训练中文bert☆31Updated 4 years ago
- LERT: A Linguistically-motivated Pre-trained Language Model(语言学信息增强的预训练模型LERT)☆219Updated 3 months ago
- Code & Data for our Paper "NaSGEC: Multi-Domain Chinese Grammatical Error Correction for Native Speaker Texts" (ACL 2023 Findings)☆93Updated 7 months ago
- 一个基于预训练的句向量生成工具☆137Updated 2 years ago
- 各大文本摘要模型-中文文本可运行的解决方案☆69Updated 2 years ago
- Mimix: A Text Generation Tool and Pretrained Chinese Models☆158Updated 11 months ago
- Unilm for Chinese Chitchat Robot.基于Unilm模型的夸夸式闲聊机器人项目。☆158Updated 4 years ago
- 基于词汇信息融合的中文NER模型☆170Updated 3 years ago
- Collections of resources from Joint Laboratory of HIT and iFLYTEK Research (HFL)☆375Updated 2 years ago
- 基于pytorch的百度UIE命名实体识别。☆56Updated 2 years ago
- OpenTextClassification is all you need for text classification! Open text classification for everyone, enjoy your NLP journey! 这可能是目前为止最全…☆208Updated last year
- 文本智能校对大赛(Chinese Text Correction)的baseline☆68Updated 3 years ago
- 基于向量召回的检索式对话系统解决方案,dense retrieval,FAQ……☆33Updated 3 years ago
- 中文标注工具,支持NER、文本分类、关系标注、对话标注等。☆82Updated last year
- 中文自然语言推理数据集(A large-scale Chinese Nature language inference and Semantic similarity calculation Dataset)☆433Updated 5 years ago
- NLU & NLG (zero-shot) depend on mengzi-t5-base-mt pretrained model☆76Updated 3 years ago
- 基于bert进行中文文本纠错☆236Updated 2 years ago
- 本人项目进行中搜集的数据集,包含原始数据和经过处理后的数据,项目持续更新。☆116Updated 4 years ago
- ChineseTextualInference project including chinese corpus build and inferecence model, 中文文本推断项目,包括88万文本蕴含中文文本蕴含数据集的翻译与构建,基于 深度学习的文本蕴含判定模型构建…☆175Updated 6 years ago
- Some Cool NLP and CV Repositories and Solutions (收集NLP中常见任务的开源解决方案、数据集、工具、学习资料等)☆162Updated 4 years ago
- 时间抽取、解析、标准化工具☆55Updated 2 years ago
- benchmark of KgCLUE, with different models and methods☆28Updated 3 years ago
- The Corpus & Code for EMNLP 2022 paper "FCGEC: Fine-Grained Corpus for Chinese Grammatical Error Correction" | FCGEC中文语法纠错语料及STG模型☆119Updated 10 months ago
- An open-source and powerful Information Extraction toolkit based on GPT (GPT for Information Extraction; GPT4IE for short)。Note: we set a…☆173Updated 2 years ago
- PERT: Pre-training BERT with Permuted Language Model☆365Updated 3 months ago
- KgCLUE: 大规模中文开源知识图谱问答☆452Updated 3 years ago
- experiments of some semantic matching models and comparison of experimental results.☆163Updated 2 years ago
- SimBERT升级版(SimBERTv2)!☆445Updated 3 years ago