Code for the paper "BERT Loses Patience: Fast and Robust Inference with Early Exit".
☆66Jun 19, 2021Updated 4 years ago
Alternatives and similar repositories for PABEE
Users that are interested in PABEE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- DeeBERT: Dynamic Early Exiting for Accelerating BERT Inference☆161Mar 25, 2022Updated 4 years ago
- [ICLR 2021: Spotlight] Source code for the paper "A Panda? No, It's a Sloth: Slowdown Attacks on Adaptive Multi-Exit Neural Network Infer…☆14Feb 16, 2022Updated 4 years ago
- An implementation of the paper 'Using Deep Networks for Scientific Discovery in Physiological Signals'☆12Aug 24, 2020Updated 5 years ago
- [NAACL 2021] Designing a Minimal Retrieve-and-Read System for Open-Domain Question Answering☆36Apr 20, 2021Updated 5 years ago
- Code for EMNLP 2021 paper: Improving Sequence-to-Sequence Pre-training via Sequence Span Rewriting☆17Nov 30, 2021Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Method to improve inference time for BERT. This is an implementation of the paper titled "PoWER-BERT: Accelerating BERT Inference via Pro…☆62Sep 17, 2025Updated 8 months ago
- ⛵️The official PyTorch implementation for "BERT-of-Theseus: Compressing BERT by Progressive Module Replacing" (EMNLP 2020).☆315Jun 12, 2023Updated 3 years ago
- Source code for NAACL 2021 paper "TR-BERT: Dynamic Token Reduction for Accelerating BERT Inference"☆48May 25, 2022Updated 4 years ago
- ☆48Jun 8, 2020Updated 6 years ago
- source code for paper: WhiteningBERT: An Easy Unsupervised Sentence Embedding Approach.☆57Apr 6, 2021Updated 5 years ago
- [EMNLP 2022] Language Model Pre-Training with Sparse Latent Typing☆14Feb 10, 2023Updated 3 years ago
- Repo to reproduce results for Where to Begin? On the Impact of Pre-Training and Initialization in Federated Learning☆25Apr 14, 2023Updated 3 years ago
- ☆15Oct 10, 2021Updated 4 years ago
- [ACL 2021] Learning to Perturb Word Embeddings for Out-of-distribution QA☆16May 11, 2022Updated 4 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆15Dec 19, 2023Updated 2 years ago
- ☆23Oct 27, 2019Updated 6 years ago
- Code for the Shortformer model, from the ACL 2021 paper by Ofir Press, Noah A. Smith and Mike Lewis.☆147Jul 26, 2021Updated 4 years ago
- An implementation of Compositional Attention: Disentangling Search and Retrieval by MILA☆14Jun 1, 2022Updated 4 years ago
- [EMNLP 2022] RLET: A Reinforcement Learning Based Approach for Explainable QA with Entailment Trees☆11Jul 15, 2023Updated 2 years ago
- A Constrained Text Generation Challenge Towards Generative Commonsense Reasoning☆142Jan 5, 2024Updated 2 years ago
- Notes of my introduction about NLP in Fudan University☆37Jul 6, 2021Updated 4 years ago
- ☆21May 24, 2024Updated 2 years ago
- Source Code for ICML 2019 Paper "Shallow-Deep Networks: Understanding and Mitigating Network Overthinking"☆37Dec 22, 2023Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆18Apr 19, 2024Updated 2 years ago
- Code for EMNLP2020 long paper: BERT-Attack: Adversarial Attack Against BERT Using BERT☆207Sep 22, 2020Updated 5 years ago
- Implementation for ACProp ( Momentum centering and asynchronous update for adaptive gradient methdos, NeurIPS 2021)☆17Oct 11, 2021Updated 4 years ago
- ☆19Jan 27, 2021Updated 5 years ago
- ☆12Sep 26, 2019Updated 6 years ago
- Repository for the paper "Optimal Subarchitecture Extraction for BERT"☆471Jun 22, 2022Updated 3 years ago
- Compressing Representations for Self-Supervised Learning☆80Feb 18, 2021Updated 5 years ago
- McKernel: A Library for Approximate Kernel Expansions in Log-linear Time.☆13Sep 3, 2022Updated 3 years ago
- Spell and pronounce words with a neural network☆10Feb 13, 2017Updated 9 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code for ICLR 2021 Paper, "Anytime Sampling for Autoregressive Models via Ordered Autoencoding"☆26Jun 6, 2023Updated 3 years ago
- This repository contains some of the codes for paper "Combining DNN partitioning and early exit" published in EdgeSys '22: Proceedings of…☆12Jul 20, 2023Updated 2 years ago
- Code for our ICLR Trustworthy ML 2020 workshop paper "Improved Image Wasserstein Attacks and Defenses"☆14Apr 28, 2020Updated 6 years ago
- Code for Episodic Memory Reader (EMR) https://arxiv.org/abs/1903.06164☆15Nov 16, 2022Updated 3 years ago
- Our implementation of Shampoo optimizer based on https://arxiv.org/pdf/1802.09568.pdf☆13Dec 23, 2019Updated 6 years ago
- Code for "Understanding and Improving Layer Normalization"☆46Dec 8, 2019Updated 6 years ago
- Release of the ConditionalQA dataset☆21Nov 2, 2021Updated 4 years ago