sharejing / TakinLinks
A Python toolkit for file processing, text cleaning and data splitting. 文件处理,文本清洗和数据划分的python工具包。 
☆34Updated 3 years ago
Alternatives and similar repositories for Takin
Users that are interested in Takin are comparing it to the libraries listed below
Sorting:
- 中文机器阅读理解数据集☆107Updated 4 years ago
 - 🌈 NERpy: Implementation of Named Entity Recognition using Python. 命名实体识别工具,支持BertSoftmax、BertSpan等模型,开箱即用。☆116Updated last year
 - 一个基于预训练的句向量生成工具☆137Updated 2 years ago
 - Code & Data for our Paper "NaSGEC: Multi-Domain Chinese Grammatical Error Correction for Native Speaker Texts" (ACL 2023 Findings)☆95Updated 8 months ago
 - NLU & NLG (zero-shot) depend on mengzi-t5-base-mt pretrained model☆76Updated 3 years ago
 - 各大文本摘要模型-中文文本可运行的解决方案☆69Updated 2 years ago
 - Mimix: A Text Generation Tool and Pretrained Chinese Models☆158Updated last year
 - benchmark of KgCLUE, with different models and methods☆28Updated 3 years ago
 - ChineseTextualInference project including chinese corpus build and inferecence model, 中文文本推断项目,包括88万文本蕴含中文文本蕴含数据集的翻译与构建,基于深度学习的文本蕴含判定模型构建…☆175Updated 6 years ago
 - ChatGLM2-6B微调, SFT/LoRA, instruction finetune☆110Updated 2 years ago
 - 基于 LoRA 和 P-Tuning v2 的 ChatGLM-6B 高效参数微调☆55Updated 2 years ago
 - llama信息抽取实战☆100Updated 2 years ago
 - 中文自然语言推理数据集(A large-scale Chinese Nature language inference and Semantic similarity calculation Dataset)☆433Updated 5 years ago
 - ☆136Updated 4 years ago
 - 基于bert进行中文文本纠错☆237Updated 2 years ago
 - A framework for cleaning Chinese dialog data☆274Updated 4 years ago
 - 基于向量召回的检索式对话系统解决方案,dense retrieval,FAQ……☆33Updated 3 years ago
 - PERT: Pre-training BERT with Permuted Language Model☆366Updated 3 months ago
 - 继续预训练中文bert☆31Updated 4 years ago
 - The Corpus & Code for EMNLP 2022 paper "FCGEC: Fine-Grained Corpus for Chinese Grammatical Error Correction" | FCGEC中文语法纠错语料及STG模型☆119Updated 10 months ago
 - 文本智能校对大赛(Chinese Text Correction)的baseline☆68Updated 3 years ago
 - 基于SpanBert的中文指代消解,pytorch实现☆101Updated 2 years ago
 - Unilm for Chinese Chitchat Robot.基于Unilm模型的夸夸式闲聊机器人项目。☆158Updated 4 years ago
 - 中文、分词、词表、核心词典、事件词表、停用词、敏感词、问答、问答数据、知识图谱、文本语料。☆171Updated 4 years ago
 - experiments of some semantic matching models and comparison of experimental results.☆163Updated last week
 - moss chat finetuning☆51Updated last year
 - OpenTextClassification is all you need for text classification! Open text classification for everyone, enjoy your NLP journey! 这可能是目前为止最全…☆208Updated last year
 - 基于pytorch的百度UIE命名实体识别。☆56Updated 2 years ago
 - Source code for the paper "PLOME: Pre-training with Misspelled Knowledge for Chinese Spelling Correction" in ACL2021☆239Updated 3 years ago
 - ☆59Updated 4 years ago