Code associated with the paper **SkipBERT: Efficient Inference with Shallow Layer Skipping**, at ACL 2022
☆16Jun 22, 2022Updated 3 years ago
Alternatives and similar repositories for SkipBERT
Users that are interested in SkipBERT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The jiant toolkit for general-purpose text understanding models☆22Oct 8, 2020Updated 5 years ago
- ACL 2021: HiTransformer☆13May 29, 2021Updated 5 years ago
- Embedding Recycling for Language models☆38Jul 11, 2023Updated 2 years ago
- (ACL-IJCNLP 2021) Convolutions and Self-Attention: Re-interpreting Relative Positions in Pre-trained Language Models.☆21Jul 13, 2022Updated 3 years ago
- Single-Life Reinforcement Learning☆14Dec 17, 2022Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆14Jun 25, 2022Updated 3 years ago
- Implementation and evaluation of Almanac (Automaton/Logic Multi-Agent Natural Actor-Critic), an algorithm for multi-agent reinforcement l…☆10May 5, 2022Updated 4 years ago
- ☆13Feb 5, 2025Updated last year
- [Findings of EMNLP 2022] Holistic Sentence Embeddings for Better Out-of-Distribution Detection☆18Jun 14, 2023Updated 3 years ago
- EMNLP'2022: BERTScore is Unfair: On Social Bias in Language Model-Based Metrics for Text Generation☆41Oct 19, 2022Updated 3 years ago
- Policy Transfer across Visual and Dynamics Domain Gaps via Iterative Grounding (RSS 2021)☆12Oct 22, 2021Updated 4 years ago
- ☆12Nov 16, 2020Updated 5 years ago
- Thinking is hard - automate it☆18Aug 24, 2022Updated 3 years ago
- ☆14Jul 21, 2022Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆15Jul 24, 2022Updated 3 years ago
- [ICML 2025] M-STAR (Multimodal Self-Evolving TrAining for Reasoning) Project. Diving into Self-Evolving Training for Multimodal Reasoning☆75Jul 13, 2025Updated 11 months ago
- ECIR'21: Simplified TinyBERT: Knowledge Distillation for Document Retrieval☆17Apr 25, 2021Updated 5 years ago
- code for Explicit Sparse Transformer☆60Jul 21, 2023Updated 2 years ago
- Implementation of the research paper Consistent Representation Learning for Continual Relation Extraction (Findings of ACL 2022)☆26May 16, 2022Updated 4 years ago
- Text Content Manipulation☆45Nov 16, 2020Updated 5 years ago
- LibOCXL is an access library which allows the user to implement a userspace driver for an OpenCAPI accelerator.☆13Jul 1, 2024Updated last year
- Self-supervised algorithm for learning representations from ego-centric video data. Code is tested on EPIC-Kitchens-100 and Ego4D in PyTo…☆13Oct 23, 2022Updated 3 years ago
- This repository contains the implementation of the paper -- KNOT: Knowledge Distillation using Optimal Transport for Solving NLP Tasks☆15Sep 15, 2022Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Implementation for <Understanding Robust Overftting of Adversarial Training and Beyond> in ICML'22.☆13Jul 1, 2022Updated 3 years ago
- Learning from Guided Play: A Scheduled Hierarchical Approach for Improving Exploration in Adversarial Imitation Learning Source Code☆17Aug 23, 2024Updated last year
- Code for the paper "SmoothMix: Training Confidence-calibrated Smoothed Classifiers for Certified Robustness" (NeurIPS 2021)☆21Sep 27, 2022Updated 3 years ago
- 🤗 Transformers: State-of-the-art Natural Language Processing for TensorFlow 2.0 and PyTorch.☆17Jun 5, 2025Updated last year
- VaLM: Visually-augmented Language Modeling. ICLR 2023.☆56Mar 6, 2023Updated 3 years ago
- Effective Attention Sheds Light On Interpretability - Findings of ACL2021☆11May 16, 2021Updated 5 years ago
- [COLM '24] Source-Aware Training Enables Knowledge Attribution in Language Models☆19Apr 1, 2025Updated last year
- Official Code for Scaling Adversarial Training to Large Perturbation Bounds (ECCV-2022)☆11Nov 25, 2022Updated 3 years ago
- A simple editor for discrete orthogonal geodesic nets☆18Mar 7, 2020Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Take neural networks as APIs for human-like AI.☆20Dec 4, 2019Updated 6 years ago
- [ACL 2021] Learning Relation Alignment for Calibrated Cross-modal Retrieval☆34May 16, 2023Updated 3 years ago
- Code for paper: Reward Uncertainty for Exploration in Preference-based Reinforcement Learning☆15May 26, 2022Updated 4 years ago
- Code for ACL 2018 paper "Discourse Marker Augmented Network with Reinforcement Learning for Natural Language Inference".☆17Aug 5, 2018Updated 7 years ago
- Collaborative Execution Strategies for Heterogeneous CPU-FPGA Architectures☆11Apr 23, 2019Updated 7 years ago
- ☆14Feb 27, 2021Updated 5 years ago
- Code for ACL22 findings paper: Inverse is Better! Fast and Accurate Prompt for Slot Tagging☆27Jul 13, 2022Updated 3 years ago