wjx-git / IllegalTextDetection
☆75Updated last year
Alternatives and similar repositories for IllegalTextDetection:
Users that are interested in IllegalTextDetection are comparing it to the libraries listed below
- 根据维基中文语料库预训练 GloVe 中文词向量 ;Pre-train GloVe word-embedding From Chinese Wiki corpus☆71Updated last year
- The official repository of the paper: COLD: A Benchmark for Chinese Offensive Language Detection☆258Updated 2 years ago
- 评估自然语言的流畅度☆114Updated 3 years ago
- SimCSE有监督与无监督实验复现☆149Updated last year
- experiments of some semantic matching models and comparison of experimental results.☆160Updated last year
- 基于SpanBert的中文指代消解,pytorch实现☆97Updated 2 years ago
- 中文自然语言推理与语义相似度数据集☆346Updated 3 years ago
- dialogbot, provide search-based dialogue, task-based dialogue and generative dialogue model. 对话机器人,基于问答型对话、任务型对话、聊天型对话等模型实现,支持网络检索问答,领域知识…☆331Updated 11 months ago
- 3000000+语义理解与匹配数据集。可用于无监督对比学习、半监督学习等构建中文领域效果最好的预训练模型☆293Updated 2 years ago
- 收集了目前为止中文领域的MRC抽取式数据集☆118Updated 9 months ago
- SimCSE中文语义相似度对比学习模型☆84Updated 3 years ago
- 文本相似度,语义向量,文本向量,text-similarity,similarity, sentence-similarity,BERT,SimCSE,BERT-Whitening,Sentence-BERT, PromCSE, SBERT☆73Updated 4 months ago
- 基于pytorch+bert的中文文本分类☆84Updated last year
- 基于 pytorch 的 bert 实现和下游任务微调☆50Updated 2 years ago
- SimBERT升级版(SimBERTv2)!☆441Updated 3 years ago
- 无监督中文关键词抽取(Keyphrase Extraction),基于统计,基于图【LDA与PageRank(TextRank, TPR, Salience Rank, Single TPR等)】,基于嵌入【SIFRank等】,开箱即用!☆104Updated 2 years ago
- 超长文本分类(大于1000字);文档级/篇章级文本分类;主要是解决长距离依赖问题☆130Updated 3 years ago
- A framework for cleaning Chinese dialog data☆267Updated 3 years ago
- 句子匹配模型,包括无监督的SimCSE、ESimCSE、PromptBERT,和有监督的SBERT、CoSENT。☆98Updated 2 years ago
- SimCSE在中文上的复现,有监督+无监督☆274Updated last month
- 基于Pytorch的文本分类框架,支持TextCNN、Bert、Electra等。☆61Updated 2 years ago
- ☆88Updated 3 years ago
- NLP句子编码、句子embedding、语义相似度:BERT_avg、BERT_whitening、SBERT、SmiCSE☆176Updated 3 years ago
- 一个基于预训练的句向量生成工具☆136Updated 2 years ago
- 中文数据集下SimCSE+ESimCSE的实现☆191Updated 2 years ago
- ☆278Updated 2 years ago
- CINO: Pre-trained Language Models for Chinese Minority (少数民族语言预训练模型)☆233Updated 2 years ago
- 🌈 NERpy: Implementation of Named Entity Recognition using Python. 命名实体识别工具,支持BertSoftmax、BertSpan等模型,开箱即用。☆113Updated last year
- 中文无监督SimCSE Pytorch实现☆134Updated 3 years ago
- 基于prompt的中文文本分类。☆54Updated last year