hgliyuhao / cluster
Clustering text with Bert
☆56Updated 4 years ago
Alternatives and similar repositories for cluster
Users that are interested in cluster are comparing it to the libraries listed below
Sorting:
- 微调预训练语言模型,解决多标签分类任务(可加载BERT、Roberta、Bert-wwm以及albert等知名开源tf格式的模型)☆141Updated 4 years ago
- pytorch implementation of multi-label text classification, includes kinds of models and pretrained. Especially for Chinese preprocessing.☆75Updated 5 years ago
- bert pytorch模型微调用于的多标签文本分类☆133Updated 5 years ago
- multi-label,classifier,text classification,多标签文本分类,文本分类,BERT,ALBERT,multi-label-classification,seq2seq,attention,beam search☆32Updated 2 years ago
- 中文无监督SimCSE Pytorch实现☆134Updated 3 years ago
- 文本分类baseline:BERT、半监督学习UDA、对抗学习、数据增强☆102Updated 4 years ago
- Bert预训练模型fine-tune计算文本相似度☆104Updated 2 years ago
- multi-label-classification-4-event-type☆136Updated 2 years ago
- NLP句子编码、句子embedding、语义相似度:BERT_avg、BERT_whitening、SBERT、SmiCSE☆176Updated 3 years ago
- 本NER项目包含多个中文数据集,模型采用BiLSTM+CRF、BERT+Softmax、BERT+Cascade、BERT+WOL等,最后用TFServing进行模型部署,线上推理和线下推理。☆80Updated 3 years ago
- NER任务SOTA模型BERT_MRC☆61Updated last year
- Pytorch进行长文本分类。这里用到的网络有:FastText、TextCNN、TextRNN、TextRCNN、Transformer☆48Updated 4 years ago
- ☆90Updated 4 years ago
- lic2020关系抽取比赛,使用Pytorch实现苏神的模型。☆101Updated 4 years ago
- baidu aistudio event extraction competition☆224Updated 2 years ago
- 本项目是NLP领域一些任务的基准模型实现,包括文本分类、命名实体识别、实体关系抽取、NL2SQL、CKBQA以及BERT的各种下游任务应用。☆47Updated 4 years ago
- ☆87Updated 3 years ago
- ☆60Updated 5 years ago
- Use deep models including BiLSTM, ABCNN, ESIM, RE2, BERT, etc. and evaluate on 5 Chinese NLP datasets: LCQMC, BQ Corpus, ChineseSTS, OCN…☆76Updated 3 years ago
- 中文文本句对相似度匹配-ATEC数据集☆22Updated 4 years ago
- some baselines for lic2020 (http://lic2020.cipsc.org.cn/)☆218Updated 5 years ago
- 本项目采用Keras和Keras-bert实现文本多标签分类任务,对BERT进行微调。☆67Updated 4 years ago
- 中文数据集下SimCSE+ESimCSE的实现☆192Updated 2 years ago
- 中国中文信息学会社会媒体处理专业委员会举办的2019届中文人机对话之自然语言理解竞赛☆74Updated 5 years ago
- NLP Predtrained Embeddings, Models and Datasets Collections(NLP_PEMDC). The collection will keep updating.☆64Updated 5 years ago
- 百度2020语言与智能技术竞赛:事件抽取赛道方案代码☆53Updated 4 years ago
- Pattern-Exploiting Training在中文上的简单实验☆171Updated 4 years ago
- 多标签文本分类☆30Updated 3 years ago
- 新闻文本自动摘要, 以Textrank 为基础,融入 标题特征,单句位置特征,重要实体特征,线索词特征,做句子的综合权重计算,并使用MMR算法,兼顾自动摘要的主题相关性和摘要多样性。☆26Updated 3 years ago
- 机器检索阅读联合学习,莱斯杯:全国第二届“军事智能机器阅读”挑战赛 rank6 方案☆127Updated 4 years ago