selfcs / stop-and-sensitive-wordsLinks
停用词和敏感词库
☆17Updated 5 years ago
Alternatives and similar repositories for stop-and-sensitive-words
Users that are interested in stop-and-sensitive-words are comparing it to the libraries listed below
Sorting:
- 基于 LoRA 和 P-Tuning v2 的 ChatGLM-6B 高效参数微调☆55Updated 2 years ago
- 大语言模型训练和服务调研☆36Updated 2 years ago
- Code & Data for our Paper "NaSGEC: Multi-Domain Chinese Grammatical Error Correction for Native Speaker Texts" (ACL 2023 Findings)☆95Updated 8 months ago
- The Corpus & Code for EMNLP 2022 paper "FCGEC: Fine-Grained Corpus for Chinese Grammatical Error Correction" | FCGEC中文语法纠错语料及STG模型☆119Updated 10 months ago
- Ziya-LLaMA-13B是IDEA基于LLaMa的130亿参数的大规模预训练模型,具备翻译,编程,文本分类,信息抽取,摘要,文案生成,常识问答和数学计算等能力。目前姜子牙通用大模型已完成大规模预训练、多任务有监督微调和人类反馈学习三阶段的训练过程。本文主要用于Ziya-…☆45Updated 2 years ago
- 文本智能校对大赛(Chinese Text Correction)的baseline☆68Updated 3 years ago
- 裁判文书数据☆11Updated 5 years ago
- 零样本学习测评基准,中文版☆57Updated 4 years ago
- moss chat finetuning☆51Updated last year
- (NBCE)Naive Bayes-based Context Extension on ChatGLM-6b☆15Updated 2 years ago
- 中国知网论文数据集,24000+篇论文信息。自然语言处理、信息管理、文本分类、文本摘要、关键词抽取、研究热点分析、数据挖掘、数据分析☆53Updated 7 months ago
- ☆22Updated 5 years ago
- WoBERT_pytorch☆41Updated 4 years ago
- 大规模中文语料☆44Updated 5 years ago
- A large high-quality corpus of Chinese synonyms 一个大型、高质量的中文同义词语料库。☆64Updated 3 years ago
- "桃李“: 国际中文教育大模型☆185Updated last year
- RelExt: A Tool for Relation Extraction from Text. 文本实体关系抽取工具。☆51Updated 3 years ago
- 文本去重☆76Updated last year
- ☆22Updated 3 years ago
- 基于seq2edit (Gector) 的中文文本纠错。☆29Updated 2 years ago
- CTC2021-中文文本纠错大赛的SOTA方案及在线演示☆73Updated 2 years ago
- 基于模板的文本纠错;Automatically Mining Error Templates for Grammatical Error Correction☆44Updated 3 years ago
- [TALLIP] General and Domain Adaptive Chinese Spelling Check with Error Consistent Pretraining☆59Updated last year
- 百川Dynamic NTK-ALiBi的代码实现:无需微调即可推理更长文本☆49Updated 2 years ago
- 中文图书数据集/数据挖掘/自然语言处理/中国图书分类法/图书情报学/数据挖掘/文本分类/☆93Updated 7 months ago
- 利用LLM+敏感词库,来自动判别是否涉及敏感词。☆132Updated 2 years ago
- 中文、分词、词表、核心词典、事件词表、停用词、敏感词、问答、问答数据、知识图谱、文本语料。☆171Updated 4 years ago
- 时间抽取、解析、标准化工具☆55Updated 3 years ago
- 1.4B sLLM for Chinese and English - HammerLLM🔨☆44Updated last year
- CCL2024 Chinese Essay Rhetoric Recognition and Understanding☆16Updated last year