sharejing / TakinLinks
A Python toolkit for file processing, text cleaning and data splitting. 文件处理,文本清洗和数据划分的python工具包。
☆34Updated 3 years ago
Alternatives and similar repositories for Takin
Users that are interested in Takin are comparing it to the libraries listed below
Sorting:
- 🌈 NERpy: Implementation of Named Entity Recognition using Python. 命名实体识别工具,支持BertSoftmax、BertSpan等模型,开箱即用。☆116Updated last year
- benchmark of KgCLUE, with different models and methods☆28Updated 3 years ago
- 中文机器阅读理解数据集☆108Updated 4 years ago
- 一个基于预训练的句向量生成工具☆138Updated 2 years ago
- 继续预训练中文bert☆31Updated 4 years ago
- BERT微调在机器翻译上的应用,哎哟,效果贼好。☆49Updated 4 years ago
- 各大文本摘要模型-中文文本可运行的解决方案☆69Updated 2 years ago
- 基于向量召回的检索式对话系统解决方案,dense retrieval,FAQ……☆34Updated 4 years ago
- ChineseTextualInference project including chinese corpus build and inferecence model, 中文文本推断项目,包括88万文本蕴含中文文本蕴含数据集的翻译与构建,基于深度学习的文本蕴含判定模型构建…☆175Updated 6 years ago
- Code & Data for our Paper "NaSGEC: Multi-Domain Chinese Grammatical Error Correction for Native Speaker Texts" (ACL 2023 Findings)☆95Updated 9 months ago
- 中文、分词、词表、核心词典、事件词表、停用词、敏感词、问答、问答数据、知识图谱、文本语料。☆170Updated 4 years ago
- LERT: A Linguistically-motivated Pre-trained Language Model(语言学信息增强的预训练模型LERT)☆220Updated 4 months ago
- 基于pytorch的百度UIE命名实体识别。☆56Updated 2 years ago
- OpenTextClassification is all you need for text classification! Open text classification for everyone, enjoy your NLP journey! 这可能是目前为止最全…☆208Updated last year
- Mimix: A Text Generation Tool and Pretrained Chinese Models☆157Updated last year
- NLU & NLG (zero-shot) depend on mengzi-t5-base-mt pretrained model☆76Updated 3 years ago
- 文本智能校对大赛(Chinese Text Correction)的baseline☆68Updated 3 years ago
- The Corpus & Code for EMNLP 2022 paper "FCGEC: Fine-Grained Corpus for Chinese Grammatical Error Correction" | FCGEC中文语法纠错语料及STG模型☆120Updated 11 months ago
- 时间抽取、解析、标准化工具☆55Updated 3 years ago
- 基于bert进行中文文本纠错☆240Updated 2 years ago
- Bert预训练模型fine-tune计算文本相似度☆111Updated 2 years ago
- 基于 LoRA 和 P-Tuning v2 的 ChatGLM-6B 高效参数微调☆55Updated 2 years ago
- ☆136Updated 4 years ago
- 中文标注工具,支持NER、文本分类、关系标注、对话标注等。☆82Updated last year
- CLUEWSC2020: WSC Winograd模式挑战中文版,中文指代消解任务☆78Updated 5 years ago
- 基于seq2edit (Gector) 的中文文本纠错。☆29Updated 3 years ago
- PERT: Pre-training BERT with Permuted Language Model☆366Updated 4 months ago
- 中文文本纠错模型,keras实现☆74Updated 4 years ago
- dialogbot, provide search-based dialogue, task-based dialogue and generative dialogue model. 对话机器人,基于问答型对话、任务型对话、聊天型对话等模型实现,支持网络检索问答,领域知识…☆333Updated last year
- 中文自然语言推理数据集(A large-scale Chinese Nature language inference and Semantic similarity calculation Dataset)☆433Updated 5 years ago