selfcs / stop-and-sensitive-wordsLinks
停用词和敏感词库
☆17Updated 4 years ago
Alternatives and similar repositories for stop-and-sensitive-words
Users that are interested in stop-and-sensitive-words are comparing it to the libraries listed below
Sorting:
- 基于 LoRA 和 P-Tuning v2 的 ChatGLM-6B 高效参数微调☆55Updated 2 years ago
- 中国知网论文数据集,24000+篇论文信息。自然语言处理、信息管理、文本分类、文本摘要、关键词抽取、研究热点分析、数据挖掘、数据分析☆53Updated 5 months ago
- 面向金融领域的小样本跨类迁移事件抽取 第三名 方案及代码☆17Updated 4 years ago
- moss chat finetuning☆51Updated last year
- 打造人人都会的NLP,开源不易,记得star哦☆101Updated 2 years ago
- RelExt: A Tool for Relation Extraction from Text. 文本实体关系抽取工具。☆50Updated 3 years ago
- Tracking the hot Github repos and update daily 每天自动追踪Github热门项目☆49Updated this week
- 中文新词发现算法PNW算法,可以识别任意长度的新词。☆16Updated 2 years ago
- 词、句拼音转汉字、拼音分割、拼音补全、pygame输入中文☆15Updated 5 years ago
- 百度QA100万数据集☆48Updated last year
- 百度百科爬虫☆34Updated 5 years ago
- use chatGLM to perform text embedding☆45Updated 2 years ago
- ☆22Updated 3 years ago
- 中文文本改写☆20Updated 4 years ago
- 中文、分词、词表、核心词典、事件词表、停用词、敏感词、问答、问答数据、知识图谱、文本语料。☆169Updated 3 years ago
- 实现一种多Lora权值集成切换+Zero-Finetune零微调增强的跨模型技术方案,LLM-Base+LLM-X+Alpaca,初期,LLM-Base为Chatglm6B底座模型,LLM-X是LLAMA增强模型。该方案简易高效,目标是使此类语言模型能够低能耗广泛部署, 并最…☆117Updated 2 years ago
- 时间抽取、解析、标准化工具☆55Updated 2 years ago
- GoGPT中文指令数据集构造☆10Updated last year
- 利用LLM+敏感词库,来自动判别是否涉及敏感词。☆127Updated 2 years ago
- 文本智能校对大赛(Chinese Text Correction)的baseline☆68Updated 2 years ago
- 中文纠错☆93Updated 3 years ago
- ChatGLM2-6B微调, SFT/LoRA, instruction finetune☆109Updated 2 years ago
- clueai工具包: 3行代码3分钟,自定义需要的API!☆232Updated 2 years ago
- CodeGPT: A Code-Related Dialogue Dataset Generated by GPT and for GPT☆114Updated 2 years ago
- aigc evals☆10Updated last year
- BLOOM 模型的指令微调☆24Updated 2 years ago
- 百度百科爬虫☆75Updated last year
- NLP预/后处理工具。☆31Updated 5 months ago
- 大规模中文语料☆44Updated 5 years ago
- the newest version of llama3,source code explained line by line using Chinese☆22Updated last year