XingLuxi / Cal-FLOPs-for-PLM
Calculating FLOPs of Pre-trained Models in NLP
☆18Updated 3 years ago
Alternatives and similar repositories for Cal-FLOPs-for-PLM:
Users that are interested in Cal-FLOPs-for-PLM are comparing it to the libraries listed below
- Code for ACL 2023 paper titled "Lifting the Curse of Capacity Gap in Distilling Language Models"☆28Updated last year
- Source code for NAACL 2021 paper "TR-BERT: Dynamic Token Reduction for Accelerating BERT Inference"☆44Updated 2 years ago
- This repository is the official implementation of our EMNLP 2022 paper ELMER: A Non-Autoregressive Pre-trained Language Model for Efficie…☆27Updated 2 years ago
- [NeurIPS 2022] "A Win-win Deal: Towards Sparse and Robust Pre-trained Language Models", Yuanxin Liu, Fandong Meng, Zheng Lin, Jiangnan Li…☆21Updated last year
- ☆43Updated 3 years ago
- domain adaptation in NLP☆52Updated 3 years ago
- Code for the ACL-2022 paper "StableMoE: Stable Routing Strategy for Mixture of Experts"☆43Updated 2 years ago
- Code for EMNLP 2021 main conference paper "Dynamic Knowledge Distillation for Pre-trained Language Models"☆40Updated 2 years ago
- ☆39Updated last year
- The sources codes of the DR-BERT model and baselines☆37Updated 3 years ago
- ☆65Updated 8 months ago
- Source code for our EMNLP'21 paper 《Raise a Child in Large Language Model: Towards Effective and Generalizable Fine-tuning》☆57Updated 3 years ago
- The official repo for the paper "Teacher Forcing Recovers Reward Functions for Text Generation"☆30Updated last year
- ☆116Updated 2 years ago
- Must-read papers on improving efficiency for pre-trained language models.☆102Updated 2 years ago
- ACL'23: Unified Demonstration Retriever for In-Context Learning☆34Updated last year
- Code of the COLING22 paper "uChecker: Masked Pretrained Language Models as Unsupervised Chinese Spelling Checkers"☆19Updated 2 years ago
- ☆15Updated 3 years ago
- ☆59Updated last year
- ☆94Updated 4 months ago
- ☆53Updated 2 years ago
- DQ-BART: Efficient Sequence-to-Sequence Model via Joint Distillation and Quantization (ACL 2022)☆50Updated last year
- ACL 2023 Dual-Alignment Pre-training for Cross-lingual Sentence Embedding☆22Updated 5 months ago
- Source code for COLING 2022 paper "Automatic Label Sequence Generation for Prompting Sequence-to-sequence Models"☆24Updated 2 years ago
- ☆78Updated 2 years ago
- Momentum Decoding: Open-ended Text Generation as Graph Exploration☆19Updated 2 years ago
- Paradigm shift in natural language processing☆42Updated 2 years ago
- Group Meeting Record for Baobao Chang Group in Peking University☆25Updated 3 years ago
- ☆32Updated 3 years ago
- ☆35Updated last year