mlpc-ucsd / BERT_ConvolutionsLinks

(ACL-IJCNLP 2021) Convolutions and Self-Attention: Re-interpreting Relative Positions in Pre-trained Language Models.

☆21

Alternatives and similar repositories for BERT_Convolutions

Users that are interested in BERT_Convolutions are comparing it to the libraries listed below

Sorting:

romebert / RomeBERT
☆16Updated 4 years ago
cheneydon / efficient-bert
This repository contains the code for the paper in Findings of EMNLP 2021: "EfficientBERT: Progressively Searching Multilayer Perceptron …
☆33Updated 2 years ago
CyndxAI / QKNorm
Code for the paper "Query-Key Normalization for Transformers"
☆45Updated 4 years ago
bojone / univae
基于Transformer的单模型、多尺度的VAE模型
☆57Updated 4 years ago
lucidrains / coco-lm-pytorch
Implementation of COCO-LM, Correcting and Contrasting Text Sequences for Language Model Pretraining, in Pytorch
☆46Updated 4 years ago
thunlp / TR-BERT
Source code for NAACL 2021 paper "TR-BERT: Dynamic Token Reduction for Accelerating BERT Inference"
☆47Updated 3 years ago
intersun / CoDIR
Code for EMNLP 2020 paper CoDIR
☆41Updated 2 years ago
haorannlp / mix
Code for "Mixed Cross Entropy Loss for Neural Machine Translation"
☆20Updated 4 years ago
lioutasb / TaLKConvolutions
Official PyTorch implementation of Time-aware Large Kernel (TaLK) Convolutions (ICML 2020)
☆29Updated 4 years ago
MGheini / xattn-transfer-for-mt
Code and data to accompany the camera-ready version of "Cross-Attention is All You Need: Adapting Pretrained Transformers for Machine Tra…
☆32Updated 3 years ago
yxuansu / Awesome_Diffusions
☆17Updated 2 years ago
nuaa-nlp / Multimodality
☆15Updated 3 years ago
lucidrains / distilled-retriever-pytorch
Implementation of the retriever distillation procedure as outlined in the paper "Distilling Knowledge from Reader to Retriever"
☆32Updated 4 years ago
iedwardwangi / MetaAdapter
☆22Updated 4 years ago
salesforce / FactLM
☆11Updated last month
JetRunner / PABEE
Code for the paper "BERT Loses Patience: Fast and Robust Inference with Early Exit".
☆65Updated 4 years ago
QData / TextAttack-Search-Benchmark
EMNLP BlackBox NLP 2020: Searching for a Search Method: Benchmarking Search Algorithms for Generating NLP Adversarial Examples
☆24Updated 4 years ago
IBM / PoWER-BERT
Method to improve inference time for BERT. This is an implementation of the paper titled "PoWER-BERT: Accelerating BERT Inference via Pro…
☆61Updated 3 months ago
CharizardAcademy / convtransformer
Code for the ACL2020 paper Character-Level Translation with Self-Attention
☆31Updated 4 years ago
lzy1732008 / GaussionTransformer
For paper《Gaussian Transformer: A Lightweight Approach for Natural Language Inference》
☆28Updated 5 years ago
LIJUNYI95 / SuperAdam
Official Pytorch Implementation for the paper 'SUPER-ADAM: Faster and Universal Framework of Adaptive Gradients'
☆17Updated 3 years ago
nng555 / ssmba
☆62Updated 3 years ago
lifu-tu / ENGINE
ENGINE: Energy-Based Inference Networks for Non-Autoregressive Machine Translation
☆25Updated 4 years ago
tatsu-lab / mlm_inductive_bias
Code Release for "On the Inductive Bias of Masked Language Modeling: From Statistical to Syntactic Dependencies"
☆16Updated 4 years ago
acmi-lab / pretraining-with-nonsense
Pretraining summarization models using a corpus of nonsense
☆13Updated 3 years ago
yxuansu / TaCL
[NAACL'22] TaCL: Improving BERT Pre-training with Token-aware Contrastive Learning
☆93Updated 3 years ago
linzehui / Curriculum-Learning-PaperList-Materials
Curriculum Learning related papers and materials
☆54Updated 4 years ago
jungokasai / T2R
☆14Updated 2 years ago
LooperXX / ManagerTower
Code for ACL 2023 Oral Paper: ManagerTower: Aggregating the Insights of Uni-Modal Experts for Vision-Language Representation Learning
☆11Updated 7 months ago
allenai / better-promptability
☆11Updated 2 years ago