entropy2333 / 2022BytedanceSecurityAICompetition_track1Links
2022字节跳动安全AI挑战赛赛道一冠军—— 基于文本和多模态数据的风险识别 题目名称:Emoji复杂文本识别
☆13Updated 2 years ago
Alternatives and similar repositories for 2022BytedanceSecurityAICompetition_track1
Users that are interested in 2022BytedanceSecurityAICompetition_track1 are comparing it to the libraries listed below
Sorting:
- 2021字节跳动安全AI挑战赛赛道一亚军—— 基于文本和多模态数据的风险识别 题目名称:色情导流用户识别☆19Updated 2 years ago
- 超长文本分类(大于1000字);文档级/篇章级文本分类;主要是解决长距离依赖问题☆132Updated 3 years ago
- 基于pytorch_bert的中文多标签分类☆91Updated 3 years ago
- THUCNews中文文本分类数据集,该数据集包含84万篇新闻文档,总计14类;在该模型的基础上测试多个版本bert分类效果。☆66Updated 4 years ago
- PyTorch使用BERT进行英语多标签文本分类☆34Updated 3 years ago
- SimCSE中文语义相似度对比学习模型☆89Updated 3 years ago
- experiments of some semantic matching models and comparison of experimental results.☆163Updated 2 years ago
- ☆86Updated last year
- Chinese NLP Data Augmentation, BERT Contextual Augmentation☆111Updated 3 years ago
- NLP句子编码、句子embedding、语义相似度:BERT_avg、BERT_whitening、SBERT、SmiCSE☆178Updated 3 years ago
- The official repository of the paper: COLD: A Benchmark for Chinese Offensive Language Detection☆288Updated 2 years ago
- 基于scrapy的层次优先队列方法爬取中文维基百科,并自动抽取结构和半结构数据☆156Updated 2 years ago
- 抽取式NLP模型(阅读理解模型,MRC)实现词义消歧(WSD)☆13Updated 3 years ago
- ☆29Updated 2 years ago
- 基于NER的文本纠错☆14Updated last year
- 利用huggingface实现文本分类☆58Updated 3 years ago
- A PyTorch implementation of a BiLSTM \ BERT \ Roberta (+ BiLSTM + CRF) model for Chinese Word Segmentation (中文分词) .☆212Updated 3 years ago
- 利用指针网络进行信息抽取,包含命名实体识别、关系抽取、事件抽取。☆129Updated 2 years ago
- All NLP you Need Here. 目前包含15个NLP demo的pytorch实现(大量代码借鉴于其他开源项目,原先是自己玩的,后来干脆也开源出来)☆289Updated this week
- 文本相似度,语义向量,文本向量,text-similarity,similarity, sentence-similarity,BERT,SimCSE,BERT-Whitening,Sentence-BERT, PromCSE, SBERT☆75Updated 10 months ago
- 文本相似度(匹配)计算,提供Baseline、训练、推理、指标分析...代码包含TensorFlow/Pytorch双版本☆179Updated 3 years ago
- 中文无监督SimCSE Pytorch实现☆135Updated 4 years ago
- ☆279Updated 3 years ago
- 各大文本摘要模型-中文文本可运行的解决方案☆69Updated 2 years ago
- 2020 CCF大数据与计算智能大赛-非结构化商业文本信息中隐私信息识别-第7名方案☆73Updated 4 years ago
- SimCSE在中文上的复现,有监督+无监督☆279Updated 7 months ago
- 基于Bilstm + CRF的信息抽取模型☆36Updated 4 years ago
- SimCSE有监督与无监督实验复现☆149Updated last year
- 中文文本纠错相关的论文、比赛和工具。☆63Updated last week
- 基于pytorch+bert的中文文本分类☆89Updated 2 years ago