selfcs / stop-and-sensitive-words
停用词和敏感词库
☆14Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for stop-and-sensitive-words
- aigc evals☆10Updated 11 months ago
- TensorRT☆11Updated 4 years ago
- 公安网备 敏感词过滤词☆13Updated 6 years ago
- 百度QA100万数据集☆49Updated 11 months ago
- huggingface ChineseBert Tokenizer☆15Updated 2 years ago
- GoGPT中文指令数据集构造☆10Updated 9 months ago
- Datafountain-Epidemic government affairs quiz assistant competition. We divided this task into two parts: document retrieval and answer e…☆13Updated 2 years ago
- GLM (General Language Model)☆24Updated 2 years ago
- 基于simhash的文本去重算法☆19Updated 3 years ago
- NLP预/后处理工具。☆29Updated 4 months ago
- Tracking the hot Github repos and update daily 每天自动追踪Github热门项目☆42Updated this week
- ☆20Updated 2 years ago
- 基于Pytorch实现的中文文本分类脚手架,以及常用模型对比。☆18Updated 3 years ago
- 基于 LoRA 和 P-Tuning v2 的 ChatGLM-6B 高效参数微调☆54Updated last year
- 中文新词发现算法PNW算法,可以识别任意长度的新词。☆15Updated last year
- pytorch版基于gpt+nezha的中文多轮Cdial☆11Updated 2 years ago
- Large-scale exact string matching tool☆15Updated last week
- 针对保险话术培训场景设计的陪练机器人/培训机器人的demo☆18Updated 3 years ago
- 通用简单工具项目☆14Updated last month
- 【Demo】找寻近义词的三种方法☆26Updated 4 years ago
- 时间抽取、解析、标准化工具☆49Updated 2 years ago
- 大规模中文语料☆38Updated 5 years ago
- RelExt: A Tool for Relation Extraction from Text. 文本实体关系抽取工具。☆48Updated 2 years ago
- 基于Roformer的文本相似度☆12Updated 3 years ago
- 有一个通用实体关系事件抽取的任务,需要使用到UIE模框架,而且需要将起部署 到昇腾310服务器上,因为UIE模型底层使用的是ernie3.0,但是目前paddle官方还不支持ernie3.0模型在昇腾310上部署,所以才有了以下的操作,主要过程是,先试用paddle训练处模型…☆17Updated 2 years ago
- 法研杯犯罪金额提取☆12Updated 2 years ago
- 中文文本改写☆19Updated 4 years ago
- rasa_chinese 的服务 package☆18Updated 3 years ago
- (NBCE)Naive Bayes-based Context Extension on ChatGLM-6b☆14Updated last year