hkmujj / stop_wordsLinks
公安网备 敏感词过滤词
☆14Updated 6 years ago
Alternatives and similar repositories for stop_words
Users that are interested in stop_words are comparing it to the libraries listed below
Sorting:
- 菜谱名语料库。☆16Updated 4 years ago
- 中文新词发现算法PNW算法,可以识别任意长度的新词。☆16Updated 2 years ago
- Large-scale exact string matching tool☆17Updated 6 months ago
- 中文纠错☆93Updated 3 years ago
- ☆25Updated 3 years ago
- ☆37Updated 6 years ago
- 🤖️🐱 一个基于 Rasa 的中文聊天机器人——「锅贴」☆22Updated 4 years ago
- Transformer模型训练的单轮对话聊天机器人☆85Updated 4 years ago
- 百度QA100万数据集☆48Updated last year
- Chinese Couplets Dataset without vulgar words. 不包含敏感内容的对联数据集。☆77Updated 5 years ago
- Service for Bert model to Vector. 高效的文本转向量(Text-To-Vector)服务,支持GPU多卡、多worker、多客户端调用,开箱即用。☆12Updated 3 years ago
- 医疗语料库。医疗机构名语料库。药品本位码。☆69Updated last year
- 中文文本改写☆20Updated 4 years ago
- 仓库主要记录 NLP 算法工程师相关的顶会论文研读笔记【文本匹配篇】☆13Updated 3 years ago
- An Open-Source Package for Chinese Open-domain Conversational Chatbot (中文闲聊对话系统,一键部署微信闲聊机器人)☆109Updated 2 years ago
- 基于腾讯TexSmart分词SDK的ES分词插件☆15Updated 5 years ago
- XVERSE-7B: A multilingual large language model developed by XVERSE Technology Inc.☆53Updated last year
- 🤖 聊天机器人示例,定制聊天机器人,聊天机器人语料导入导出☆127Updated last year
- 京东/淘宝客服对话数据公开,seq2seq生成模型设计对话系统获第二名☆44Updated 2 years ago
- Let ChatGPT (Large Language Models) Serve As Data Annotator and Zero-shot/few-shot Information Extractor.☆32Updated 2 years ago
- 大语言模型ChatGLM-6B为基座,接入文档阅读功能进行实时问答,可上传txt/docx/pdf多种文件类型。☆42Updated 2 years ago
- English or Chinses GPT2Dialog model from GPT2-chitchat☆12Updated 5 years ago
- 中文、分词、词表、核心词典、事件词表、停用词、敏感词、问答、问答数据、知识图 谱、文本语料。☆170Updated 3 years ago
- 🤖️ 聊天机器人——夫子的「自然语言理解」模块☆90Updated 2 years ago
- aigc evals☆10Updated last year
- Chinese MobileBERT(中文MobileBERT模型)☆95Updated 3 years ago
- Source code and checkpoints for legal pre-trained language models.☆15Updated 4 years ago
- 网络表情NLP,颜文字识别,颜文字表情实体识别、属性检测、新颜发现☆43Updated 5 years ago
- 通过机器学习进行敏感词的识别☆29Updated 7 years ago
- 该项目主要是抽取病历文件中的一些关键信息。并将抽取的内容进行streamlit前端的展示。目前支持的文件类型:图片,pdf文件,word文件☆24Updated 2 years ago