(ACL-IJCNLP 2021) Convolutions and Self-Attention: Re-interpreting Relative Positions in Pre-trained Language Models.
☆21Jul 13, 2022Updated 3 years ago
Alternatives and similar repositories for BERT_Convolutions
Users that are interested in BERT_Convolutions are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The baseline model code for WMT 2021 Triangular MT☆13Apr 7, 2021Updated 5 years ago
- Word acquisition in neural language models (TACL 2022).☆21Jan 30, 2025Updated last year
- ☆16May 6, 2021Updated 5 years ago
- ACL 2021: HiTransformer☆13May 29, 2021Updated 5 years ago
- ☆15Dec 10, 2021Updated 4 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Relative Positional Encoding for Transformers with Linear Complexity☆65Apr 1, 2022Updated 4 years ago
- This is the official code used for WAT 2017 Description Paper titled A Bag of Useful Tricks for Practical Neural Machine Translation: Emb…☆12Oct 24, 2017Updated 8 years ago
- pytorch implementation of "Contrastive Multiview Coding", "Momentum Contrast for Unsupervised Visual Representation Learning", and "Unsup…☆18Mar 23, 2020Updated 6 years ago
- A concise implementation of SimCSE☆16Aug 2, 2021Updated 4 years ago
- LV-BERT: Exploiting Layer Variety for BERT (Findings of ACL 2021)☆18May 10, 2023Updated 3 years ago
- ☆13Nov 7, 2021Updated 4 years ago
- Code for the paper "What Makes Better Augmentation Strategies? Augment Difficult but Not too Different" (ICLR 22)☆12Aug 28, 2023Updated 2 years ago
- Code associated with the paper **SkipBERT: Efficient Inference with Shallow Layer Skipping**, at ACL 2022☆16Jun 22, 2022Updated 3 years ago
- A pytorch implementation of the ICCV2021 workshop paper SimDis: Simple Distillation Baselines for Improving Small Self-supervised Models☆14Jul 15, 2021Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Source code for NAACL 2021 paper "TR-BERT: Dynamic Token Reduction for Accelerating BERT Inference"☆48May 25, 2022Updated 4 years ago
- Bag of MLP☆20May 31, 2021Updated 5 years ago
- Successfully training approximations to full-rank matrices for efficiency in deep learning.☆16Jan 5, 2021Updated 5 years ago
- ☆13Jun 7, 2022Updated 4 years ago
- Code for Depth-wise Separable Convolutions: Performance Investigations☆19Jan 28, 2020Updated 6 years ago
- ☆14Jan 4, 2021Updated 5 years ago
- ☆31Jun 29, 2022Updated 3 years ago
- simple translate☆12Mar 7, 2020Updated 6 years ago
- ☆49Nov 19, 2025Updated 6 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Webpage☆16Feb 16, 2024Updated 2 years ago
- Pytorch version of VidLanKD: Improving Language Understanding viaVideo-Distilled Knowledge Transfer (NeurIPS 2021))☆56Feb 6, 2023Updated 3 years ago
- ☆10Nov 17, 2020Updated 5 years ago
- Official Pytorch Implementation of Length-Adaptive Transformer (ACL 2021)☆102Nov 2, 2020Updated 5 years ago
- Python implementation for paper: Feature Distillation: DNN-Oriented JPEG Compression Against Adversarial Examples☆11Jun 12, 2018Updated 8 years ago
- PyTorch implementation of NAACL 2021 paper "Multi-view Subword Regularization"☆26Jun 2, 2021Updated 5 years ago
- pytorch版损失函数,改写自科学空间文章,【通过互信息思想来缓解类别不平衡问题】、【将“softmax+交叉熵”推广到多标签分类问题】☆12Aug 22, 2021Updated 4 years ago
- ☆11Jun 4, 2021Updated 5 years ago
- The project is about predicting sets (of classes) from images.☆23Aug 31, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ICLR 2021, Fair Mixup: Fairness via Interpolation☆59Sep 6, 2021Updated 4 years ago
- [NAACL'22] TaCL: Improving BERT Pre-training with Token-aware Contrastive Learning☆94Jun 8, 2022Updated 4 years ago
- The SimSite3D Software tools are designed to quickly search a database of three dimensional structures, in Protein Data Bank format, with…☆11Oct 18, 2018Updated 7 years ago
- Implementation of semi-supervised learning using PyTorch Lightning☆14Jul 25, 2024Updated last year
- ☆19Feb 4, 2024Updated 2 years ago
- Stochastic Optimization for Global Contrastive Learning without Large Mini-batches☆20Mar 31, 2023Updated 3 years ago
- [Findings of EMNLP 2022] Expose Backdoors on the Way: A Feature-Based Efficient Defense against Textual Backdoor Attacks☆13Feb 26, 2023Updated 3 years ago