A pre-trained model with multi-exit transformer architecture.
☆56Dec 10, 2022Updated 3 years ago
Alternatives and similar repositories for ElasticBERT
Users that are interested in ElasticBERT are comparing it to the libraries listed below
Sorting:
- This is a repo holding codes for the paper: Code Completion with Neural Attention and Pointer Networks☆13Mar 21, 2018Updated 7 years ago
- Combining encoder-based language models☆11Nov 11, 2021Updated 4 years ago
- Source code for our AAAI'22 paper 《From Dense to Sparse: Contrastive Pruning for Better Pre-trained Language Model Compression》☆25Dec 15, 2021Updated 4 years ago
- 🛠️ Tools for Transformers compression using PyTorch Lightning ⚡☆85Feb 1, 2026Updated last month
- ASSIST: Towards Label Noise-Robust Dialogue State Tracking☆10Apr 11, 2022Updated 3 years ago
- ☆16Dec 14, 2022Updated 3 years ago
- This repo contains the code for Late Prompt Tuning.☆12Dec 22, 2025Updated 2 months ago
- ☆17Apr 7, 2025Updated 10 months ago
- A Handy Python wrapper for common NLP evaluation scripts like BLEU.☆14Feb 10, 2020Updated 6 years ago
- [Findings of EMNLP 2022] Holistic Sentence Embeddings for Better Out-of-Distribution Detection☆18Jun 14, 2023Updated 2 years ago
- Leaderboards are widely used in NLP and push the field forward. While leaderboards are a straightforward ranking of NLP models, this simp…☆18Mar 30, 2022Updated 3 years ago
- This repository open-sources our GEC system submitted by THU KELab (sz) in the CCL2023-CLTC Track 1: Multidimensional Chinese Learner Tex…☆15Nov 25, 2023Updated 2 years ago
- Code of the COLING22 paper "uChecker: Masked Pretrained Language Models as Unsupervised Chinese Spelling Checkers"☆19Aug 17, 2022Updated 3 years ago
- Embedding Recycling for Language models☆38Jul 11, 2023Updated 2 years ago
- ICML'2022: Black-Box Tuning for Language-Model-as-a-Service & EMNLP'2022: BBTv2: Towards a Gradient-Free Future with Large Language Model…☆271Nov 8, 2022Updated 3 years ago
- [ACL'21 Findings] Why Machine Reading Comprehension Models Learn Shortcuts?☆16Aug 8, 2023Updated 2 years ago
- Binary Passage Retriever (BPR) - an efficient passage retriever for open-domain question answering☆175Jun 6, 2021Updated 4 years ago
- EMNLP'2022: BERTScore is Unfair: On Social Bias in Language Model-Based Metrics for Text Generation☆41Oct 19, 2022Updated 3 years ago
- Albert for Conversational Question Answering Challenge☆22Jun 12, 2023Updated 2 years ago
- Code associated with the paper "Entropy-based Attention Regularization Frees Unintended Bias Mitigation from Lists"☆50May 31, 2022Updated 3 years ago
- NexAU (AU for Agent Universe), a general-purpose agent framework for building intelligent agents with tool capabilities.☆49Updated this week
- This is a helper for PyTorch-BigGraph☆22Apr 7, 2020Updated 5 years ago
- As good as new. How to successfully recycle English GPT-2 to make models for other languages (ACL Findings 2021)☆48Aug 2, 2021Updated 4 years ago
- A field-tested Hebrew tokenizer for dirty texts (ben-yehuda project, bible, cc100, mc4, opensubs, oscar, twitter) focused on multi-word e…☆23Aug 13, 2022Updated 3 years ago
- ☆147Jun 23, 2022Updated 3 years ago
- A Transformer Framework Based Couplet Task☆24Oct 29, 2023Updated 2 years ago
- This is a pre-trained LSTM model. This model can help you to segment unpunctuated historical Chinese texts. 這是基於 LSTM 的預訓練模型。此模型可幫助您為漢語古文…☆28Nov 19, 2021Updated 4 years ago
- Biomedical and Clinical BERT for Portuguese Language☆62Dec 12, 2024Updated last year
- https://pypi.org/project/intent-suggestions/☆10Sep 6, 2022Updated 3 years ago
- 基于arxiv的论文检索和阅读工具☆25Jan 4, 2022Updated 4 years ago
- Code for our AAAI2021 paper: Token-Aware Virtual Adversarial Training For Language Understanding.☆25Dec 3, 2020Updated 5 years ago
- [EMNLP 2023] Plan, Verify and Switch: Integrated Reasoning with Diverse X-of-Thoughts☆27Nov 4, 2023Updated 2 years ago
- Research code for the paper "How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models"☆28Oct 3, 2021Updated 4 years ago
- Engineering the state of RNN language models (Mamba, RWKV, etc.)☆32May 25, 2024Updated last year
- 🚀🤗 A collection of templates for Hugging Face Spaces☆35Oct 9, 2023Updated 2 years ago
- Website for Learning from "Big Code"☆30Jun 19, 2021Updated 4 years ago
- Code for ACL 2021 main conference paper "Conversations Are Not Flat: Modeling the Dynamic Information Flow across Dialogue Utterances".☆94Jun 30, 2021Updated 4 years ago
- CokeBERT: Contextual Knowledge Selection and Embedding towards Enhanced Pre-Trained Language Models☆32Jul 2, 2023Updated 2 years ago
- 基于seq2edit (Gector) 的中文文本纠错。☆29Nov 15, 2022Updated 3 years ago