hkmujj / stop_words
公安网备 敏感词过滤词
☆13Updated 6 years ago
Alternatives and similar repositories for stop_words:
Users that are interested in stop_words are comparing it to the libraries listed below
- 综合的敏感词库,可用于违禁词检测☆32Updated last year
- 中文新词发现算法PNW算法,可以识别任意长度的新词。☆15Updated last year
- Chinese Couplets Dataset without vulgar words. 不包含敏感内容的对联数据集。☆69Updated 5 years ago
- Large-scale exact string matching tool☆15Updated 2 months ago
- 用于生成文本纠错模型(如Gector)需要的大量数据。☆14Updated 2 years ago
- ☆37Updated 5 years ago
- 百度百科 500 万数据集☆32Updated last year
- 通过机器学习进行敏感词的识别☆29Updated 6 years ago
- 收录常见业务安全场景中文脏词,如广告引流、辱骂、暴恐、黄赌毒政类。☆57Updated 2 years ago
- 百度QA100万数据集☆47Updated last year
- 中文纠错☆91Updated 2 years ago
- 基于自动生成知识库的智能问答系统☆17Updated 5 years ago
- Python3 package for Chinese/English OCR, with paddleocr-v4 onnx model(~14MB). 基于ppocr-v4-onnx模型推理,可实现 CPU 上毫秒级的 OCR 精准预测,通用场景中英文OCR达到开源SO…☆52Updated 3 weeks ago
- Source code and checkpoints for legal pre-trained language models.☆15Updated 3 years ago
- worddict crawler and transfer for sougpuinput wordict , 搜狗输入法词库抓取与格式转换☆25Updated 6 years ago
- 基于腾讯TexSmart分词SDK的ES分词插件☆14Updated 4 years ago
- Service for Bert model to Vector. 高效的文本转向量(Text-To-Vector)服务,支持GPU多卡、多worker、多客户端调用,开箱即用。☆10Updated 2 years ago
- 菜谱名语料库。☆15Updated 3 years ago
- 有一个通用实体关系事件抽取的任务,需要使用到UIE模框架,而且需要将起部署到昇腾310服务器上,因为UIE模型底层使用的是ernie3.0,但是目前paddle官方还不支持ernie3.0模型在昇腾310上部署,所以才有了以下的操作,主要过程是,先试用paddle训练处模型…☆17Updated 2 years ago
- 用Paddle复现论文ChineseBERT: Chinese Pretraining Enhanced by Glyph and Pinyin Information(ACL2021)☆9Updated 3 years ago
- 停用词和敏感词库☆15Updated 4 years ago
- Llama3开源模型中文版-全方位测评,基于SuperCLUE基准 | Llama3 Chinese Evaluation with SuperCLUE☆16Updated 8 months ago
- 兼容 GPT2、Bloom 等 Pytorch 框架下的语言模型、人工智能标记语言 (AIML) 和任务型对话系统 (Task) 的深度中文智能对话机器人框架☆26Updated last year
- An Open-Source Package for Chinese Open-domain Conversational Chatbot (中文闲聊对话系统,一键部署微信闲聊机器人)☆107Updated last year
- GRAIN: Gradient-based Intra-attention Pruning on Pre-trained Language Models☆17Updated last year
- Based on the Langchain framework, a retrieval and generative chatbot. 基于langchain实现的检索式和生成式问答☆22Updated this week
- English or Chinses GPT2Dialog model from GPT2-chitchat☆11Updated 4 years ago
- 中文文本改写☆19Updated 4 years ago
- 大语言模 型训练和服务调研☆35Updated last year