hkmujj / stop_words
公安网备 敏感词过滤词
☆13Updated 6 years ago
Related projects ⓘ
Alternatives and complementary repositories for stop_words
- Large-scale exact string matching tool☆15Updated last week
- ☆37Updated 5 years ago
- 中文新词发现算法PNW算法,可以识别任意长度的新词。☆15Updated last year
- 词、句拼音转汉字、拼音分割、拼音补全、pygame输入中文☆14Updated 4 years ago
- Source code and checkpoints for legal pre-trained language models.☆15Updated 3 years ago
- 百度百科 500 万数据集☆30Updated 11 months ago
- ☆24Updated 2 years ago
- 有一个通用实体关系事件抽取的任务,需要使用到UIE模框架,而且需要将起部署到昇腾310服务器上,因为UIE模型底层使用的是ernie3.0,但是目前paddle官方还不支持ernie3.0模型在昇腾310上部署,所以才有了以下的操作,主要过程是,先试用paddle训练处模型…☆17Updated 2 years ago
- 中文文本改写☆19Updated 4 years ago
- 通过机器学习进行敏感词的识别☆29Updated 6 years ago
- Translation model based on sequence to sequence model. 基于seq2seq模型的翻译模型demo☆17Updated 6 years ago
- The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.☆70Updated last year
- Sentence-Transformers Information Retrieval example on Chinese☆29Updated 9 months ago
- worddict crawler and transfer for sougpuinput wordict , 搜狗输入法词库抓取与格式转换☆25Updated 6 years ago
- 过滤词向量的敏感词--->过滤敏感词☆7Updated 3 years ago
- 百度QA100万数据集☆49Updated 11 months ago
- 基于sentence-transformers实现文本转向量的机器人☆45Updated 2 years ago
- 大规模中文语料☆38Updated 5 years ago
- 仓库主要记录 NLP 算法工程师相关的顶会论文研读笔记【文本匹配篇】☆11Updated 2 years ago
- ☆19Updated last year
- 基于 onnxruntime 推理引擎的中文 ltp 词法分析☆13Updated 2 years ago
- An Open-Source Package for Chinese Open-domain Conversational Chatbot (中文闲聊对话系统,一键部署微信闲聊机器人)☆107Updated last year
- GRAIN: Gradient-based Intra-attention Pruning on Pre-trained Language Models☆17Updated last year
- 基于腾讯TexSmart分词SDK的ES分词插件☆14Updated 4 years ago
- Chinese Grammatical Error Diagnosis☆11Updated 3 years ago
- 基于Roformer的文本相似度☆12Updated 3 years ago
- 医疗语料库。医疗机构名语料库。药品本位码。☆57Updated 7 months ago
- KuaiSearch PERKS☆11Updated 3 years ago
- pytorch版基于gpt+nezha的中文多轮Cdial☆11Updated 2 years ago