hkmujj / stop_wordsLinks
公安网备 敏感词过滤词
☆14Updated 6 years ago
Alternatives and similar repositories for stop_words
Users that are interested in stop_words are comparing it to the libraries listed below
Sorting:
- 综合的敏感词库,可用于违禁词检测☆33Updated 2 years ago
- Large-scale exact string matching tool☆17Updated 2 months ago
- 中文文本改写☆19Updated 4 years ago
- 百度QA100万数据集☆47Updated last year
- Qimen表示的是奇门遁甲之术,用于抽取各种实体的工具。☆29Updated 5 years ago
- Sentence-Transformers Information Retrieval example on Chinese☆29Updated last year
- 百度百科 500 万数据集☆35Updated last year
- 一个快速确定文本(新闻)归属地的工具☆18Updated 4 years ago
- ☆37Updated 5 years ago
- Source code and checkpoints for legal pre-trained language models.☆15Updated 4 years ago
- 通过机器学习进行敏感词的识别☆28Updated 7 years ago
- 中文纠错☆92Updated 3 years ago
- worddict crawler and transfer for sougpuinput wordict , 搜狗输入法词库抓取与格式转换☆25Updated 7 years ago
- Chinese Couplets Dataset without vulgar words. 不包含敏感内容的对联数据集。☆73Updated 5 years ago
- Knowledge Graph Examples☆18Updated 10 months ago
- 🤖️🐱 一个基于 Rasa 的中文聊天机器人——「锅贴」☆22Updated 3 years ago
- 智能营销文案生成☆35Updated last month
- Self complemented Pinyin2Chinese demo use algorithms including Trie and HMM model , 基于隐马尔科夫模型与Trie树的拼音切分与拼音转中文的简单demo实现。☆86Updated 7 years ago
- 停用词和敏感词库☆17Updated 4 years ago
- 菜谱名语料库。☆15Updated 3 years ago
- 针对口语进行时间抽取并标准化☆13Updated 5 years ago
- Llama3开源模型中文版-全方位测评,基于SuperCLUE基准 | Llama3 Chinese Evaluation with SuperCLUE☆16Updated last year
- 中文新词发现算法PNW算法,可以识别任意长度的新词。☆15Updated last year
- 仓库主要记录 NLP 算法工程师相关的顶会论文研读笔记【文本匹配篇】☆13Updated 2 years ago
- FinanceEventGraph,金融领域事件图谱开放数据集,可用于事件图谱搭建于实验,包括3865个acquire并购事件、9093个invest投资事件,总计12960的事件☆19Updated last year
- 基于中文TaCL-BERT的中文命名实体识别及中文分词☆32Updated 3 years ago
- 基于腾讯TexSmart分词SDK的ES分词插件☆14Updated 4 years ago
- 提取中文文本中的公司名☆8Updated 6 years ago
- Python3 package for Chinese/English OCR, with paddleocr-v4 onnx model(~14MB). 基于ppocr-v4-onnx模型推理,可实现 CPU 上毫秒级的 OCR 精准预测,通用场景中英文OCR达到开源SO…☆82Updated 4 months ago
- ☆23Updated last year