A pre-trained model with multi-exit transformer architecture.
☆56Dec 10, 2022Updated 3 years ago
Alternatives and similar repositories for ElasticBERT
Users that are interested in ElasticBERT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This is a repo holding codes for the paper: Code Completion with Neural Attention and Pointer Networks☆13Mar 21, 2018Updated 8 years ago
- ☆17Apr 7, 2025Updated last year
- Source code for our AAAI'22 paper 《From Dense to Sparse: Contrastive Pruning for Better Pre-trained Language Model Compression》☆25Dec 15, 2021Updated 4 years ago
- A Handy Python wrapper for common NLP evaluation scripts like BLEU.☆14Feb 10, 2020Updated 6 years ago
- A curated list of Early Exiting papers, benchmarks, and misc.☆119Oct 26, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Information Extraction related tools and models☆10Mar 16, 2023Updated 3 years ago
- Combining encoder-based language models☆11Nov 11, 2021Updated 4 years ago
- This repo contains the code for Late Prompt Tuning.☆12Dec 22, 2025Updated 3 months ago
- 恋上算法,Java版 算法面试题解大全集☆18May 17, 2020Updated 5 years ago
- ICML'2022: Black-Box Tuning for Language-Model-as-a-Service & EMNLP'2022: BBTv2: Towards a Gradient-Free Future with Large Language Model…☆271Nov 8, 2022Updated 3 years ago
- 🛠️ Tools for Transformers compression using PyTorch Lightning ⚡☆85Feb 1, 2026Updated 2 months ago
- [ACL-IJCNLP 2021] Structural Knowledge Distillation: Tractably Distilling Information for Structured Predictor☆10Jul 10, 2023Updated 2 years ago
- [EMNLP 2022] RLET: A Reinforcement Learning Based Approach for Explainable QA with Entailment Trees☆11Jul 15, 2023Updated 2 years ago
- A curated list of early exiting (LLM, CV, NLP, etc)☆71Aug 21, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [EMNLP 2023] Plan, Verify and Switch: Integrated Reasoning with Diverse X-of-Thoughts☆27Nov 4, 2023Updated 2 years ago
- This repository open-sources our GEC system submitted by THU KELab (sz) in the CCL2023-CLTC Track 1: Multidimensional Chinese Learner Tex…☆15Nov 25, 2023Updated 2 years ago
- This repository includes the masking vocabulary used in the ICLR 2021 spotlight PMI-Masking paper☆14Aug 9, 2021Updated 4 years ago
- [Findings of EMNLP 2022] Holistic Sentence Embeddings for Better Out-of-Distribution Detection☆18Jun 14, 2023Updated 2 years ago
- Mamba support for transformer lens☆19Sep 17, 2024Updated last year
- Albert for Conversational Question Answering Challenge☆22Jun 12, 2023Updated 2 years ago
- ☆16Dec 14, 2022Updated 3 years ago
- EMNLP'2022: BERTScore is Unfair: On Social Bias in Language Model-Based Metrics for Text Generation☆41Oct 19, 2022Updated 3 years ago
- Use the famous language model, xlnet, to do sequence tagging/ sequence labelling/ named entity recognition(NER) / noun extraction;☆18Sep 30, 2019Updated 6 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- NexAU (AU for Agent Universe), a general-purpose agent framework for building intelligent agents with tool capabilities.☆55Apr 7, 2026Updated last week
- Notes of my introduction about NLP in Fudan University☆37Jul 6, 2021Updated 4 years ago
- 基于arxiv的论文检索和阅读工具☆25Jan 4, 2022Updated 4 years ago
- Binary Passage Retriever (BPR) - an efficient passage retriever for open-domain question answering☆175Jun 6, 2021Updated 4 years ago
- ☆13Apr 27, 2022Updated 3 years ago
- Code of the COLING22 paper "uChecker: Masked Pretrained Language Models as Unsupervised Chinese Spelling Checkers"☆19Aug 17, 2022Updated 3 years ago
- ASSIST: Towards Label Noise-Robust Dialogue State Tracking☆10Apr 11, 2022Updated 4 years ago
- Machine Reading Comprehension Leadboard Summary☆12Jan 4, 2021Updated 5 years ago
- Light local website for displaying performances from different chat models.☆86Nov 13, 2023Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- [NeurIPS'22 Spotlight] Data and code for our paper CoNT: Contrastive Neural Text Generation☆152May 10, 2023Updated 2 years ago
- ☆12Mar 18, 2019Updated 7 years ago
- ☆147Jun 23, 2022Updated 3 years ago
- Code for Document-level Entity-based Extraction as Template Generation (EMNLP 2021)☆29Sep 23, 2021Updated 4 years ago
- Convert pdf to pages of images☆13Apr 18, 2020Updated 5 years ago
- [Findings of EMNLP 2022] Expose Backdoors on the Way: A Feature-Based Efficient Defense against Textual Backdoor Attacks☆13Feb 26, 2023Updated 3 years ago
- Code for our AAAI2021 paper: Token-Aware Virtual Adversarial Training For Language Understanding.☆25Dec 3, 2020Updated 5 years ago