yuanjie-ai / ChineseSensitiveVocabulary
暴恐违禁 文本色情 政治敏感 恶意推广 低俗辱骂
☆98Updated 3 years ago
Alternatives and similar repositories for ChineseSensitiveVocabulary
Users that are interested in ChineseSensitiveVocabulary are comparing it to the libraries listed below
Sorting:
- 中文预处理语料☆109Updated 6 years ago
- 各大中文分词性能评测☆157Updated 6 years ago
- DFA 实现中文敏感词检测☆101Updated 2 years ago
- 中文、分词、词表、核心词典、事件词表、停用词、敏感词、问答、问答数据、知识图谱、文本语料。☆162Updated 3 years ago
- 中文谐音词/字库(同音词/字)Chinese Homophones☆105Updated 5 years ago
- Pytorch model for https://github.com/imcaspar/gpt2-ml☆79Updated 3 years ago
- ☆37Updated 5 years ago
- Self complemented Pinyin2Chinese demo use algorithms including Trie and HMM model , 基于隐马尔科夫模型与Trie树的拼音切分与拼音转中文的简单demo实现。☆86Updated 7 years ago
- 短文本相似度☆103Updated 3 years ago
- 通过机器学习进行敏感词的识别☆28Updated 7 years ago
- 专业领域词库构建/中文新词发现/专业词库发现☆29Updated 5 years ago
- An Open-Source Package for Chinese Open-domain Conversational Chatbot (中文闲聊对话系统,一键部署微信闲聊机器人)☆107Updated last year
- This is a corpus of Chinese abbreviation, including negative full forms.☆196Updated 3 years ago
- 常用中文停用词表及对比☆72Updated 6 years ago
- tensorflow+bert+seq2seq 周公解梦。AI遇上 玄学,说出你的梦境(dream),模型自动解析decode梦境的征兆。类似聊天机器人(chatbot,QA),你问我答。☆128Updated 5 years ago
- 常用的中文停用词表☆75Updated 7 years ago
- ☆79Updated 8 years ago
- ☆58Updated 3 years ago
- 一个简单易用的 Python 模块,用于通过字符串来操作日期/时间。正则时间提取,字符串时间解析,字符串时间提取。中文时间提取,一句话里面提取时间☆76Updated 10 months ago
- 敏感词库整理☆176Updated 9 years ago
- 中文文本改写☆19Updated 4 years ago
- 李傲龍的博客☆81Updated 10 months ago
- mirror of dongxiexidian/Chinese☆300Updated 6 years ago
- 收录常见业务安全场景中文脏词,如广告引流、辱骂、暴恐、黄赌毒政类。☆59Updated 2 years ago
- 时间关键词正则提取以及标准化☆21Updated 3 years ago
- 中文单词自动纠错☆121Updated 4 years ago
- 夸夸语料,来自豆瓣互相表扬组数据☆75Updated 6 years ago
- 从门户网站爬取新闻的摘要-标题对使用seq2seq根据摘要生成标题☆45Updated 7 years ago
- 微调预训练语言模型(BERT、Roberta、XLBert等),用于计算两个文本之间的相似度(通过句子对分类任务转换),适用于中文文本☆89Updated 4 years ago
- 基于gensim模块的中文句子相似度计算☆52Updated 6 years ago