orangetwo / simple_distillLinks
Distilling Task-Specific Knowledge from BERT into Simple Neural Networks
☆17Updated 3 years ago
Alternatives and similar repositories for simple_distill
Users that are interested in simple_distill are comparing it to the libraries listed below
Sorting:
- SimCSE☆15Updated 3 years ago
- Label Mask for Multi-label Classification☆57Updated 4 years ago
- 使用Mask LM预训练任务来预训练Bert模型。训 练垂直领域语料的模型表征,提升下游任务的表现。☆47Updated 2 years ago
- SimCSE有监督与无监督实验复现☆149Updated last year
- WoBERT_pytorch☆41Updated 4 years ago
- ☆44Updated 2 years ago
- NLP句子编码、句子embedding、语义相似度:BERT_avg、BERT_whitening、SBERT、SmiCSE☆178Updated 3 years ago
- Baselines for CCKS 2022 Task "Commonsense Knowledge Salience Evaluation"☆33Updated 2 years ago
- 苏神SPACE pytorch版本复现☆42Updated 3 years ago
- 从头训练MASK BERT☆138Updated 2 years ago
- 中文无监督SimCSE Pytorch实现☆135Updated 4 years ago
- A concise implementation of SimCSE☆16Updated 4 years ago
- 中文数据集下SimCSE+ESimCSE的实现☆193Updated 3 years ago
- 句子匹配模型,包括无监督的SimCSE、ESimCSE、PromptBERT,和有监督的SBERT、CoSENT。☆100Updated 2 years ago
- Pytorch version of BERT-whitening☆309Updated 3 years ago
- 中文bigbird预训练模型☆96Updated 3 years ago
- TIANCHI-小布助手对话短文本语义匹配BERT baseline☆32Updated 4 years ago
- Source code for paper "LET: Linguistic Knowledge Enhanced Graph Transformer for Chinese Short Text Matching", AAAI2021.☆48Updated 4 years ago
- 2021搜狐校园文本匹配算法大赛Top2方案☆37Updated last year
- GAIIC2022商品标题实体识别Baseline,使用GlobalPointer实现,线上0.80349☆54Updated 3 years ago
- 文本分类baseline:BERT、半监督学 习UDA、对抗学习、数据增强☆104Updated 4 years ago
- Official implementation of AAAI-21 paper "Label Confusion Learning to Enhance Text Classification Models"☆118Updated 2 years ago
- ☆279Updated 3 years ago
- 全球人工智能技术创新大赛-赛道三-冠军方案☆239Updated 4 years ago
- 基于prompt的中文文本分类。☆55Updated 2 years ago
- The implementation our EMNLP 2021 paper "Enhanced Language Representation with Label Knowledge for Span Extraction".☆114Updated 2 years ago
- ☆41Updated 3 years ago
- Cascade bert+word vec and one layer FLAT, trained by adversarial FGM and Stochastic Weight Averaging☆23Updated 3 years ago
- ☆26Updated 2 years ago
- Code for our paper "Dual Contrastive Learning: Text Classification via Label-Aware Data Augmentation"☆162Updated 3 years ago