A PyTorch-based model pruning toolkit for pre-trained language models
☆389Aug 31, 2023Updated 2 years ago
Alternatives and similar repositories for TextPruner
Users that are interested in TextPruner are comparing it to the libraries listed below
Sorting:
- A PyTorch-based knowledge distillation toolkit for natural language processing☆1,697May 8, 2023Updated 2 years ago
- GRAIN: Gradient-based Intra-attention Pruning on Pre-trained Language Models☆19Jul 12, 2023Updated 2 years ago
- Revisiting Pre-trained Models for Chinese Natural Language Processing (MacBERT)☆705Jul 15, 2025Updated 8 months ago
- ☆313Apr 6, 2023Updated 2 years ago
- [ACL 2022] Structured Pruning Learns Compact and Accurate Models https://arxiv.org/abs/2204.00408☆198May 9, 2023Updated 2 years ago
- PERT: Pre-training BERT with Permuted Language Model☆367Jul 15, 2025Updated 8 months ago
- Pre-trained Chinese ELECTRA(中文ELECTRA预训练模型)☆1,439Jul 15, 2025Updated 8 months ago
- ExpMRC: Explainability Evaluation for Machine Reading Comprehension☆62Aug 30, 2023Updated 2 years ago
- 🛠️ Tools for Transformers compression using PyTorch Lightning ⚡☆85Feb 1, 2026Updated last month
- a baseline to practice☆45Jul 6, 2021Updated 4 years ago
- The score code of FastBERT (ACL2020)☆609Oct 29, 2021Updated 4 years ago
- LERT: A Linguistically-motivated Pre-trained Language Model(语言学信息增强的预训练模型LERT)☆223Jul 15, 2025Updated 8 months ago
- LightSeq: A High Performance Library for Sequence Processing and Generation☆3,302May 16, 2023Updated 2 years ago
- Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo☆3,104May 9, 2024Updated last year
- An efficient implementation of the popular sequence models for text generation, summarization, and translation tasks. https://arxiv.org/p…☆433Aug 17, 2022Updated 3 years ago
- [NeurIPS 2022] A Fast Post-Training Pruning Framework for Transformers☆192Feb 28, 2023Updated 3 years ago
- Pre-Trained Chinese XLNet(中文XLNet预训练模型)☆1,648Jul 15, 2025Updated 8 months ago
- [EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821☆3,643Oct 16, 2024Updated last year
- Pre-Training with Whole Word Masking for Chinese BERT(中文BERT-wwm系列模型)☆10,183Jul 15, 2025Updated 8 months ago
- Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.☆3,155Jan 22, 2024Updated 2 years ago
- 一键中文数据增强包 ; NLP数据增强、bert数据增强、EDA:pip install nlpcda☆1,878Mar 18, 2025Updated last year
- 高质量中文预训练模型集合:最先进大模型、最快小模型、相似度专门模型☆816Jul 8, 2020Updated 5 years ago
- ⚡ boost inference speed of T5 models by 5x & reduce the model size by 3x.☆589Apr 24, 2023Updated 2 years ago
- a bert for retrieval and generation☆859Feb 26, 2021Updated 5 years ago
- ICML'2022: NLP From Scratch Without Large-Scale Pretraining: A Simple and Efficient Framework☆253Jan 7, 2024Updated 2 years ago
- ☆272Jul 26, 2024Updated last year
- Code for the RecAdam paper: Recall and Learn: Fine-tuning Deep Pretrained Language Models with Less Forgetting.☆119Nov 10, 2020Updated 5 years ago
- BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)☆8,286Oct 16, 2024Updated last year
- A Large-scale Chinese Short-Text Conversation Dataset and Chinese pre-training dialog models☆1,938Jun 12, 2023Updated 2 years ago
- We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tunin…☆2,800Dec 12, 2023Updated 2 years ago
- Collections of resources from Joint Laboratory of HIT and iFLYTEK Research (HFL)☆376Mar 9, 2023Updated 3 years ago
- Open Language Pre-trained Model Zoo☆1,005Nov 18, 2021Updated 4 years ago
- RoFormer升级版☆155Aug 11, 2022Updated 3 years ago
- Efficient, scalable and enterprise-grade CPU/GPU inference server for 🤗 Hugging Face transformer models 🚀☆1,687Oct 23, 2024Updated last year
- ☆880May 24, 2024Updated last year
- [NeurIPS 2023] LLM-Pruner: On the Structural Pruning of Large Language Models. Support Llama-3/3.1, Llama-2, LLaMA, BLOOM, Vicuna, Baich…☆1,111Oct 7, 2024Updated last year
- ACL22 paper: Imputing Out-of-Vocabulary Embeddings with LOVE Makes Language Models Robust with Little Cost☆42Nov 15, 2023Updated 2 years ago
- Data augmentation for NLP☆4,652Jun 24, 2024Updated last year
- Conversational Word Embedding for Retrieval-based Dialog System (ACL2020)☆30Sep 2, 2020Updated 5 years ago