DengBoCong / text-similarityLinks
文本相似度(匹配)计算,提供Baseline、训练、推理、指标分析...代码包含TensorFlow/Pytorch双版本
☆177Updated 3 years ago
Alternatives and similar repositories for text-similarity
Users that are interested in text-similarity are comparing it to the libraries listed below
Sorting:
- NLP句子编码、句子embedding、语义相似度:BERT_avg、BERT_whitening、SBERT、SmiCSE☆176Updated 3 years ago
- 中文问题句子相似度计算比赛及方案汇总☆300Updated 4 years ago
- experiments of some semantic matching models and comparison of experimental results.☆161Updated last year
- SimCSE在中文上的复现,有监督+无监督☆277Updated 3 months ago
- 基于词汇信息融合的中文NER模型☆168Updated 3 years ago
- Bert预训练模型fine-tune计算文本相似度☆104Updated 2 years ago
- 本人项目进行中搜集的数据集,包含原始数据和经过处理后的数据,项目持续更新。☆114Updated 4 years ago
- 利用指针网络进行信息抽取,包含命名实体识别、关系抽取、事件抽取。☆126Updated 2 years ago
- 基于pytorch_bert的中文多标签分类☆91Updated 3 years ago
- 微调预训练语言模型,解决多标签分类任务(可加载BERT、Roberta、Bert-wwm以及albert等知名开源tf格式的模型)☆141Updated 4 years ago
- 超长文本分类(大于1000字);文档级/篇章级文本分类;主要是解决长距离依赖问题☆130Updated 3 years ago
- multi-label-classification-4-event-type☆136Updated 2 years ago
- 中文无监督SimCSE Pytorch实现☆134Updated 3 years ago
- bert pytorch模型微调用于的多标签文本分类☆133Updated 5 years ago
- implementation several deep text match (text similarly) models for keras . cdssm, arc-ii,match_pyramid, mvlstm ,esim, drcn ,bimpm, bert, …☆290Updated 4 years ago
- ☆278Updated 3 years ago
- 文本分类baseline:BERT、半监督学习UDA、对抗学习、数据增强☆102Updated 4 years ago
- 中文文本分类、序列标注工具包(pytorch),支持中文长文本、短文本的多类、多标签分类任务,支持中文命名实体识别、词性标注、分词、抽取式文本摘要等序列标注任务。 Chinese text classification and sequence labeling toolk…☆346Updated 10 months ago
- 文本相似度,语义向量,文本向量,text-similarity,similarity, sentence-similarity,BERT,SimCSE,BERT-Whitening,Sentence-BERT, PromCSE, SBERT☆73Updated 6 months ago
- Modify Chinese text, modified on LaserTagger Model. 文本复述,基于lasertagger做中文文本数据增强。☆320Updated last year
- 天池 疫情相似句对判定大赛 线上第一名方案☆433Updated 4 years ago
- 基于GlobalPointer的实体/关系/事件抽取☆147Updated 3 years ago
- 中文NER的那些事儿☆318Updated last year
- 端到端的长本文摘要模型(法研杯2020司法摘要赛道)☆398Updated last year
- Keyphrase or Keyword Extraction 基于预训练模型的中文关键词抽取方法(论文SIFRank: A New Baseline for Unsupervised Keyphrase Extraction Based on Pre-trained La…☆429Updated 5 years ago
- 以词为基本单位的中文BERT☆467Updated 3 years ago
- pytorch中文语言模型预训练☆389Updated 4 years ago
- NEZHA: Neural Contextualized Representation for Chinese Language Understanding☆261Updated 3 years ago
- CoSENT、STS、SentenceBERT☆168Updated 3 months ago
- ☆87Updated 3 years ago