(ACL-IJCNLP 2021) Convolutions and Self-Attention: Re-interpreting Relative Positions in Pre-trained Language Models.
☆21Jul 13, 2022Updated 3 years ago
Alternatives and similar repositories for BERT_Convolutions
Users that are interested in BERT_Convolutions are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The baseline model code for WMT 2021 Triangular MT☆13Apr 7, 2021Updated 4 years ago
- Goldfish: Monolingual language models for 350 languages.☆24Mar 4, 2026Updated 3 weeks ago
- Word acquisition in neural language models (TACL 2022).☆20Jan 30, 2025Updated last year
- ☆16May 6, 2021Updated 4 years ago
- ACL 2021: HiTransformer☆13May 29, 2021Updated 4 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Code of the COLING22 paper "uChecker: Masked Pretrained Language Models as Unsupervised Chinese Spelling Checkers"☆19Aug 17, 2022Updated 3 years ago
- Relative Positional Encoding for Transformers with Linear Complexity☆65Apr 1, 2022Updated 3 years ago
- A concise implementation of SimCSE☆16Aug 2, 2021Updated 4 years ago
- LV-BERT: Exploiting Layer Variety for BERT (Findings of ACL 2021)☆18May 10, 2023Updated 2 years ago
- ☆13Nov 7, 2021Updated 4 years ago
- Code for the paper "What Makes Better Augmentation Strategies? Augment Difficult but Not too Different" (ICLR 22)☆12Aug 28, 2023Updated 2 years ago
- Code associated with the paper **SkipBERT: Efficient Inference with Shallow Layer Skipping**, at ACL 2022☆16Jun 22, 2022Updated 3 years ago
- A pytorch implementation of the ICCV2021 workshop paper SimDis: Simple Distillation Baselines for Improving Small Self-supervised Models☆14Jul 15, 2021Updated 4 years ago
- Source code for NAACL 2021 paper "TR-BERT: Dynamic Token Reduction for Accelerating BERT Inference"☆48May 25, 2022Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Bag of MLP☆20May 31, 2021Updated 4 years ago
- Successfully training approximations to full-rank matrices for efficiency in deep learning.☆17Jan 5, 2021Updated 5 years ago
- ☆13Jun 7, 2022Updated 3 years ago
- Reproduction package of the paper "DeepCVA: Automated Commit-level Vulnerability Assessment with Deep Multi-task Learning" in Automated S…☆20Feb 25, 2023Updated 3 years ago
- Code for Depth-wise Separable Convolutions: Performance Investigations☆19Jan 28, 2020Updated 6 years ago
- ☆14Jan 4, 2021Updated 5 years ago
- pytorch版simcse无监督语义相似模型☆22May 13, 2021Updated 4 years ago
- simple translate☆12Mar 7, 2020Updated 6 years ago
- ☆48Nov 19, 2025Updated 4 months ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Pytorch version of VidLanKD: Improving Language Understanding viaVideo-Distilled Knowledge Transfer (NeurIPS 2021))☆56Feb 6, 2023Updated 3 years ago
- ☆10Nov 17, 2020Updated 5 years ago
- Official Pytorch Implementation of Length-Adaptive Transformer (ACL 2021)☆102Nov 2, 2020Updated 5 years ago
- Python implementation for paper: Feature Distillation: DNN-Oriented JPEG Compression Against Adversarial Examples☆11Jun 12, 2018Updated 7 years ago
- PyTorch implementation of NAACL 2021 paper "Multi-view Subword Regularization"☆26Jun 2, 2021Updated 4 years ago
- Locally Enhanced Self-Attention: Rethinking Self-Attention as Local and Context Terms☆20Nov 29, 2021Updated 4 years ago
- pytorch版损失函数,改写自科学空间文章,【通过互信息思想来缓解类别不平衡问题】、【将“softmax+交叉熵”推广到多标签分类问题】☆12Aug 22, 2021Updated 4 years ago
- ☆11Jun 4, 2021Updated 4 years ago
- decontamination☆27Mar 4, 2026Updated 3 weeks ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ICLR 2021, Fair Mixup: Fairness via Interpolation☆59Sep 6, 2021Updated 4 years ago
- The project is about predicting sets (of classes) from images.☆23Aug 31, 2021Updated 4 years ago
- Code Release for "On the Inductive Bias of Masked Language Modeling: From Statistical to Syntactic Dependencies"☆16Apr 13, 2021Updated 4 years ago
- Implementation of semi-supervised learning using PyTorch Lightning☆14Jul 25, 2024Updated last year
- [CVPR 2022] "The Principle of Diversity: Training Stronger Vision Transformers Calls for Reducing All Levels of Redundancy" by Tianlong C…☆25Mar 9, 2022Updated 4 years ago
- Stochastic Optimization for Global Contrastive Learning without Large Mini-batches☆20Mar 31, 2023Updated 2 years ago
- Implementation of a Quantized Transformer Model☆19Mar 20, 2019Updated 7 years ago