A pre-trained model with multi-exit transformer architecture.
☆56Dec 10, 2022Updated 3 years ago
Alternatives and similar repositories for ElasticBERT
Users that are interested in ElasticBERT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This is a repo holding codes for the paper: Code Completion with Neural Attention and Pointer Networks☆13Mar 21, 2018Updated 8 years ago
- ☆17Apr 7, 2025Updated last year
- Code for CascadeBERT, Findings of EMNLP 2021☆12Mar 30, 2022Updated 4 years ago
- Source code for our AAAI'22 paper 《From Dense to Sparse: Contrastive Pruning for Better Pre-trained Language Model Compression》☆25Dec 15, 2021Updated 4 years ago
- A Handy Python wrapper for common NLP evaluation scripts like BLEU.☆14Feb 10, 2020Updated 6 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- A curated list of Early Exiting papers, benchmarks, and misc.☆119Oct 26, 2023Updated 2 years ago
- Information Extraction related tools and models☆10Mar 16, 2023Updated 3 years ago
- Combining encoder-based language models☆11Nov 11, 2021Updated 4 years ago
- This repo contains the code for Late Prompt Tuning.☆12Dec 22, 2025Updated 5 months ago
- 恋上算法,Java版算法面试题解大全集☆18May 17, 2020Updated 6 years ago
- ICML'2022: Black-Box Tuning for Language-Model-as-a-Service & EMNLP'2022: BBTv2: Towards a Gradient-Free Future with Large Language Model…☆272Nov 8, 2022Updated 3 years ago
- 🛠️ Tools for Transformers compression using PyTorch Lightning ⚡☆85Feb 1, 2026Updated 3 months ago
- ☆29Nov 9, 2025Updated 6 months ago
- [ACL-IJCNLP 2021] Structural Knowledge Distillation: Tractably Distilling Information for Structured Predictor☆10Jul 10, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- [EMNLP 2022] RLET: A Reinforcement Learning Based Approach for Explainable QA with Entailment Trees☆11Jul 15, 2023Updated 2 years ago
- [EMNLP 2023] Plan, Verify and Switch: Integrated Reasoning with Diverse X-of-Thoughts☆27Nov 4, 2023Updated 2 years ago
- ☆12Oct 5, 2022Updated 3 years ago
- This repository open-sources our GEC system submitted by THU KELab (sz) in the CCL2023-CLTC Track 1: Multidimensional Chinese Learner Tex…☆15Nov 25, 2023Updated 2 years ago
- This repository includes the masking vocabulary used in the ICLR 2021 spotlight PMI-Masking paper☆14Aug 9, 2021Updated 4 years ago
- [Findings of EMNLP 2022] Holistic Sentence Embeddings for Better Out-of-Distribution Detection☆18Jun 14, 2023Updated 2 years ago
- Albert for Conversational Question Answering Challenge☆22Jun 12, 2023Updated 2 years ago
- ☆16Dec 14, 2022Updated 3 years ago
- Use the famous language model, xlnet, to do sequence tagging/ sequence labelling/ named entity recognition(NER) / noun extraction;☆18Sep 30, 2019Updated 6 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ACL'2023: DiffusionBERT: Improving Generative Masked Language Models with Diffusion Models☆342Feb 17, 2024Updated 2 years ago
- Notes of my introduction about NLP in Fudan University☆37Jul 6, 2021Updated 4 years ago
- Dump TheMovieDB☆28Oct 26, 2021Updated 4 years ago
- 基于arxiv的论文检索和阅读工具☆25Jan 4, 2022Updated 4 years ago
- Binary Passage Retriever (BPR) - an efficient passage retriever for open-domain question answering☆174Jun 6, 2021Updated 4 years ago
- ☆13Apr 27, 2022Updated 4 years ago
- NexAU (AU for Agent Universe), a general-purpose agent framework for building intelligent agents with tool capabilities.☆100Updated this week
- Repo to reproduce results for Where to Begin? On the Impact of Pre-Training and Initialization in Federated Learning☆25Apr 14, 2023Updated 3 years ago
- Code of the COLING22 paper "uChecker: Masked Pretrained Language Models as Unsupervised Chinese Spelling Checkers"☆19Aug 17, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ASSIST: Towards Label Noise-Robust Dialogue State Tracking☆10Apr 11, 2022Updated 4 years ago
- Light local website for displaying performances from different chat models.☆86Nov 13, 2023Updated 2 years ago
- [NeurIPS'22 Spotlight] Data and code for our paper CoNT: Contrastive Neural Text Generation☆152May 10, 2023Updated 3 years ago
- Leaderboards are widely used in NLP and push the field forward. While leaderboards are a straightforward ranking of NLP models, this simp…☆18Mar 30, 2022Updated 4 years ago
- Code for EMNLP'21 paper "Types of Out-of-Distribution Texts and How to Detect Them"☆26Nov 13, 2021Updated 4 years ago
- ☆147Jun 23, 2022Updated 3 years ago
- Code for Document-level Entity-based Extraction as Template Generation (EMNLP 2021)☆29Sep 23, 2021Updated 4 years ago