A pre-trained model with multi-exit transformer architecture.
☆56Dec 10, 2022Updated 3 years ago
Alternatives and similar repositories for ElasticBERT
Users that are interested in ElasticBERT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This is a repo holding codes for the paper: Code Completion with Neural Attention and Pointer Networks☆13Mar 21, 2018Updated 8 years ago
- Source code for our AAAI'22 paper 《From Dense to Sparse: Contrastive Pruning for Better Pre-trained Language Model Compression》☆25Dec 15, 2021Updated 4 years ago
- A Handy Python wrapper for common NLP evaluation scripts like BLEU.☆14Feb 10, 2020Updated 6 years ago
- Information Extraction related tools and models☆10Mar 16, 2023Updated 3 years ago
- Combining encoder-based language models☆11Nov 11, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- This repo contains the code for Late Prompt Tuning.☆12Dec 22, 2025Updated 5 months ago
- 恋上算法,Java版算法面试题解大全集☆18May 17, 2020Updated 6 years ago
- 🛠️ Tools for Transformers compression using PyTorch Lightning ⚡☆85Feb 1, 2026Updated 4 months ago
- ☆30Nov 9, 2025Updated 7 months ago
- [ACL-IJCNLP 2021] Structural Knowledge Distillation: Tractably Distilling Information for Structured Predictor☆11Jul 10, 2023Updated 2 years ago
- This repository open-sources our GEC system submitted by THU KELab (sz) in the CCL2023-CLTC Track 1: Multidimensional Chinese Learner Tex…☆15Nov 25, 2023Updated 2 years ago
- This repository includes the masking vocabulary used in the ICLR 2021 spotlight PMI-Masking paper☆14Aug 9, 2021Updated 4 years ago
- [Findings of EMNLP 2022] Holistic Sentence Embeddings for Better Out-of-Distribution Detection☆18Jun 14, 2023Updated 3 years ago
- Mamba support for transformer lens☆20Sep 17, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Albert for Conversational Question Answering Challenge☆21Jun 12, 2023Updated 3 years ago
- ☆16Dec 14, 2022Updated 3 years ago
- Use the famous language model, xlnet, to do sequence tagging/ sequence labelling/ named entity recognition(NER) / noun extraction;☆18Sep 30, 2019Updated 6 years ago
- ACL'2023: DiffusionBERT: Improving Generative Masked Language Models with Diffusion Models☆343Feb 17, 2024Updated 2 years ago
- Notes of my introduction about NLP in Fudan University☆37Jul 6, 2021Updated 4 years ago
- Repo to reproduce results for Where to Begin? On the Impact of Pre-Training and Initialization in Federated Learning☆26Apr 14, 2023Updated 3 years ago
- Code of the COLING22 paper "uChecker: Masked Pretrained Language Models as Unsupervised Chinese Spelling Checkers"☆19Aug 17, 2022Updated 3 years ago
- Machine Reading Comprehension Leadboard Summary☆12Jan 4, 2021Updated 5 years ago
- Light local website for displaying performances from different chat models.☆86Nov 13, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- [NeurIPS'22 Spotlight] Data and code for our paper CoNT: Contrastive Neural Text Generation☆152May 10, 2023Updated 3 years ago
- Leaderboards are widely used in NLP and push the field forward. While leaderboards are a straightforward ranking of NLP models, this simp…☆18Mar 30, 2022Updated 4 years ago
- ☆12Mar 18, 2019Updated 7 years ago
- NexAU (AU for Agent Universe), a general-purpose agent framework for building intelligent agents with tool capabilities.☆136Jun 11, 2026Updated last week
- ☆12Mar 22, 2025Updated last year
- ☆147Jun 23, 2022Updated 3 years ago
- [Findings of EMNLP 2022] Expose Backdoors on the Way: A Feature-Based Efficient Defense against Textual Backdoor Attacks☆13Feb 26, 2023Updated 3 years ago
- Paradigm shift in natural language processing☆42May 29, 2022Updated 4 years ago
- [EMNLP 2021] LM-Critic: Language Models for Unsupervised Grammatical Error Correction☆119Sep 26, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Intel Gaudi's Megatron DeepSpeed Large Language Models for training☆18Dec 19, 2024Updated last year
- [Findings of ACL 2023] Communication Efficient Federated Learning for Multilingual Machine Translation with Adapter☆12Sep 4, 2023Updated 2 years ago
- [ACL'21 Findings] Why Machine Reading Comprehension Models Learn Shortcuts?☆16Aug 8, 2023Updated 2 years ago
- As good as new. How to successfully recycle English GPT-2 to make models for other languages (ACL Findings 2021)☆48Aug 2, 2021Updated 4 years ago
- A Java annotation processor used for automatically generating better builder codes.☆11Feb 16, 2021Updated 5 years ago
- CPT: A Pre-Trained Unbalanced Transformer for Both Chinese Language Understanding and Generation☆494Dec 30, 2022Updated 3 years ago
- [Under Progress] Code & Data for the AAAI 2020 Paper "Likelihood Ratios and Generative Classifiers For Unsupervised OOD Detection In Task…☆10Jul 25, 2024Updated last year