dropreg / R-DropLinks
☆879Updated last year
Alternatives and similar repositories for R-Drop
Users that are interested in R-Drop are comparing it to the libraries listed below
Sorting:
- RoFormer V1 & V2 pytorch☆502Updated 3 years ago
- UDA(Unsupervised Data Augmentation) implemented by pytorch☆278Updated 5 years ago
- Implementation of some unbalanced loss like focal_loss, dice_loss, DSC Loss, GHM Loss et.al☆265Updated 2 years ago
- Code for our ACL 2021 paper - ConSERT: A Contrastive Framework for Self-Supervised Sentence Representation Transfer☆542Updated 3 years ago
- Pytorch version of BERT-whitening☆306Updated 3 years ago
- Rotary Transformer☆979Updated 3 years ago
- The score code of FastBERT (ACL2020)☆606Updated 3 years ago
- 简单的向量白化改善句向量质量☆483Updated 4 years ago
- TensorFlow implementation of On the Sentence Embeddings from Pre-trained Language Models (EMNLP 2020)☆532Updated 4 years ago
- A novel method to tune language models. Codes and datasets for paper ``GPT understands, too''.☆934Updated 2 years ago
- Pytorch implementation of Supporting Clustering with Contrastive Learning, NAACL 2021☆302Updated last year
- The repo contains the code of the ACL2020 paper `Dice Loss for Data-imbalanced NLP Tasks`☆274Updated 2 years ago
- ⛵️The official PyTorch implementation for "BERT-of-Theseus: Compressing BERT by Progressive Module Replacing" (EMNLP 2020).☆313Updated 2 years ago
- ☆494Updated last year
- Implementation of paper "Towards a Unified View of Parameter-Efficient Transfer Learning" (ICLR 2022)☆535Updated 3 years ago
- 简洁易用版TinyBert:基于Bert进行知识蒸馏的预训练语言模型☆266Updated 4 years ago
- [ACL 2021] LM-BFF: Better Few-shot Fine-tuning of Language Models https://arxiv.org/abs/2012.15723☆728Updated 2 years ago
- pytorch implementation for Patient Knowledge Distillation for BERT Model Compression☆203Updated 5 years ago
- A curated list of research papers in Sentence Reprsentation Learning and a sts leaderboard of sentence embeddings.☆316Updated last year
- ☆201Updated 2 years ago
- SimCSE在中文任务上的简单实验☆604Updated last year
- A PyTorch-based knowledge distillation toolkit for natural language processing☆1,662Updated 2 years ago
- 超轻量级bert的pytorch版本,大量中文注释,容易修改结构,持续更新☆414Updated 3 years ago
- CPT: A Pre-Trained Unbalanced Transformer for Both Chinese Language Understanding and Generation☆489Updated 2 years ago
- ☆251Updated 2 years ago
- QQ浏览器2021AI算法大赛赛道一 第1名 方案☆265Updated 3 years ago
- Prefix-Tuning: Optimizing Continuous Prompts for Generation☆934Updated last year
- Implement the paper "Self-Attention with Relative Position Representations"☆135Updated 4 years ago
- MixText: Linguistically-Informed Interpolation of Hidden Space for Semi-Supervised Text Classification☆358Updated 5 years ago
- 这里是改进了pytorch的DataParallel, 用来平衡第一个GPU的显存使用量☆232Updated 4 years ago