hkmujj / stop_words
公安网备 敏感词过滤词
☆13Updated 6 years ago
Alternatives and similar repositories for stop_words:
Users that are interested in stop_words are comparing it to the libraries listed below
- 菜谱名语料库。☆15Updated 3 years ago
- 中文新词发现算法PNW算法,可以识别任意长度的新词。☆15Updated last year
- ☆37Updated 5 years ago
- 综合的敏感词库,可用于违禁词检测☆32Updated 2 years ago
- 停用词和敏感词库☆16Updated 4 years ago
- worddict crawler and transfer for sougpuinput wordict , 搜狗输入法词库抓取与格式转换☆25Updated 6 years ago
- Chinese Couplets Dataset without vulgar words. 不包含敏感内容的对联数据集。☆72Updated 5 years ago
- 中文纠错☆92Updated 3 years ago
- Large-scale exact string matching tool☆15Updated 3 weeks ago
- 通过机器学习进行敏感词的识别☆29Updated 7 years ago
- ☆24Updated 3 years ago
- 百度百科 500 万数据集☆34Updated last year
- English or Chinses GPT2Dialog model from GPT2-chitchat☆12Updated 5 years ago
- 🤖️🐱 一个基于 Rasa 的中文聊天机器人——「锅贴」☆22Updated 3 years ago
- Sentence-Transformers Information Retrieval example on Chinese☆29Updated last year
- 新华字典:成语、谚语、词语☆32Updated 6 years ago
- 医疗语料库。医疗机构名语料库。药品本位码。☆69Updated last year
- Service for Bert model to Vector. 高效的文本转向量(Text-To-Vector)服务,支持GPU多卡、多worker、多客户端调用,开箱即用。☆10Updated 2 years ago
- 兼容 GPT2、Bloom 等 Pytorch 框架下的语言模型、人工智能标记语言 (AIML) 和任务型对话系统 (Task) 的深度中文智能对话机器人框架☆26Updated last year
- 百度QA100万数据集☆47Updated last year
- 对dbpedia和百科采集而来的语料进行清洗,得到合适的三元组☆14Updated 7 years ago
- This repository provides an implementation of the paper "A Simple yet Effective Training-free Prompt-free Approach to Chinese Spelling Co…☆62Updated 3 weeks ago
- 基于中文TaCL-BERT的中文命名实体识别及中文分词☆33Updated 3 years ago
- An Open-Source Package for Chinese Open-domain Conversational Chatbot (中文闲聊对话系统,一键部署微信闲聊机器人)☆107Updated last year
- 中文、分词、词表、核心词典、事件词表、停用词、敏感词、问答、问答数据、知识图谱、文本语料。☆159Updated 3 years ago
- 过滤词向量的 敏感词--->过滤敏感词☆7Updated 3 years ago
- Python3 package for Chinese/English OCR, with paddleocr-v4 onnx model(~14MB). 基于ppocr-v4-onnx模型推理,可实现 CPU 上毫秒级的 OCR 精准预测,通用场景中英文OCR达到开源SO…☆65Updated 2 months ago
- Let ChatGPT (Large Language Models) Serve As Data Annotator and Zero-shot/few-shot Information Extractor.☆31Updated 2 years ago
- 基于Qwen2模型进行通用信息抽取【实体/关系/事件抽取】☆30Updated 8 months ago
- 该项目主要是抽取病历文件中的一些关键信息。并将抽取的内容进行streamlit前端的展示。目前支持的文件类型:图片,pdf文件,word文件☆23Updated 2 years ago