Code for the paper "BERT Loses Patience: Fast and Robust Inference with Early Exit".
☆66Jun 19, 2021Updated 4 years ago
Alternatives and similar repositories for PABEE
Users that are interested in PABEE are comparing it to the libraries listed below
Sorting:
- DeeBERT: Dynamic Early Exiting for Accelerating BERT Inference☆162Mar 25, 2022Updated 3 years ago
- [NAACL 2021] Designing a Minimal Retrieve-and-Read System for Open-Domain Question Answering☆36Apr 20, 2021Updated 4 years ago
- Our implementation of Shampoo optimizer based on https://arxiv.org/pdf/1802.09568.pdf☆12Dec 23, 2019Updated 6 years ago
- Implementing activation functions from scratch in Tensorflow.☆36Feb 13, 2022Updated 4 years ago
- ⛵️The official PyTorch implementation for "BERT-of-Theseus: Compressing BERT by Progressive Module Replacing" (EMNLP 2020).☆315Jun 12, 2023Updated 2 years ago
- Compressing Representations for Self-Supervised Learning☆80Feb 18, 2021Updated 5 years ago
- Code for the Shortformer model, from the ACL 2021 paper by Ofir Press, Noah A. Smith and Mike Lewis.☆147Jul 26, 2021Updated 4 years ago
- ☆19Jan 27, 2021Updated 5 years ago
- ☆20Nov 20, 2020Updated 5 years ago
- A collection of deep learning models (PyTorch implemtation)☆19Aug 30, 2024Updated last year
- Method to improve inference time for BERT. This is an implementation of the paper titled "PoWER-BERT: Accelerating BERT Inference via Pro…☆62Sep 17, 2025Updated 5 months ago
- Official PyTorch implementation for our ICCV 2019 paper - Fooling Network Interpretation in Image Classification☆24Nov 21, 2019Updated 6 years ago
- An implementation of Compositional Attention: Disentangling Search and Retrieval by MILA☆14Jun 1, 2022Updated 3 years ago
- Natural Perturbation for Robust Question Answering☆12Apr 7, 2020Updated 5 years ago
- ☆10Jun 3, 2019Updated 6 years ago
- Spell and pronounce words with a neural network☆10Feb 13, 2017Updated 9 years ago
- This is an official implementation of our CVPR 2020 paper "Non-Local Neural Networks With Grouped Bilinear Attentional Transforms".☆12Jan 30, 2021Updated 5 years ago
- incremental symbol learning for natural language understanding☆10Jun 12, 2023Updated 2 years ago
- Repo to reproduce results for Where to Begin? On the Impact of Pre-Training and Initialization in Federated Learning☆25Apr 14, 2023Updated 2 years ago
- ☆47Jan 11, 2021Updated 5 years ago
- Implementation for ACProp ( Momentum centering and asynchronous update for adaptive gradient methdos, NeurIPS 2021)☆16Oct 11, 2021Updated 4 years ago
- It is deep recommendation model with attribute-level co-attention, which has been accepted as a short paper in SIGIR2020.☆10Aug 13, 2020Updated 5 years ago
- Convert 3D Human Pose to VMD file☆14Apr 21, 2019Updated 6 years ago
- Simple Structured Perceptron tagger in Python☆10May 30, 2017Updated 8 years ago
- ☆48Jun 8, 2020Updated 5 years ago
- Code for our ICLR Trustworthy ML 2020 workshop paper "Improved Image Wasserstein Attacks and Defenses"☆14Apr 28, 2020Updated 5 years ago
- Code for the UCL Statistical NLP course☆11Jan 19, 2015Updated 11 years ago
- Official Pytorch Implementation for the paper 'SUPER-ADAM: Faster and Universal Framework of Adaptive Gradients'☆17Jan 12, 2022Updated 4 years ago
- ☆23Oct 27, 2019Updated 6 years ago
- Hybrid Approaches to Detect Comments Violating Macro Norms on Reddit☆28Jul 18, 2019Updated 6 years ago
- The dataset and PyTorch Implementation for ACL 2020 paper "MATINF: A Jointly Labeled Large-Scale Dataset for Classification, Question Ans…☆43Sep 7, 2020Updated 5 years ago
- Source code for NAACL 2021 paper "TR-BERT: Dynamic Token Reduction for Accelerating BERT Inference"☆48May 25, 2022Updated 3 years ago
- Implementation of "Structured Multi-Hashing for Model Compression" (CVPR 2020)☆12Feb 18, 2021Updated 5 years ago
- Voice Conversion using Tacotron.☆11Dec 29, 2022Updated 3 years ago
- awesome unsupervised learning paper list☆12Jan 4, 2018Updated 8 years ago
- Cross-modal Coherence Modeling for Caption Generation☆11Jul 24, 2020Updated 5 years ago
- Code & Data for our Paper "PATTERN-BASED CHINESE HYPERNYM-HYPONYM RELATION EXTRACTION METHOD"☆12Jan 29, 2020Updated 6 years ago
- Official PyTorch implementation of the paper : ProbAct: A Probabilistic Activation Function for Deep Neural Networks.☆13Jun 10, 2019Updated 6 years ago
- McKernel: A Library for Approximate Kernel Expansions in Log-linear Time.☆13Sep 3, 2022Updated 3 years ago