A PyTorch-based model pruning toolkit for pre-trained language models
☆390Aug 31, 2023Updated 2 years ago
Alternatives and similar repositories for TextPruner
Users that are interested in TextPruner are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A PyTorch-based knowledge distillation toolkit for natural language processing☆1,697May 8, 2023Updated 2 years ago
- GRAIN: Gradient-based Intra-attention Pruning on Pre-trained Language Models☆19Jul 12, 2023Updated 2 years ago
- Revisiting Pre-trained Models for Chinese Natural Language Processing (MacBERT)☆709Apr 19, 2026Updated last week
- ☆310Apr 6, 2023Updated 3 years ago
- [ACL 2022] Structured Pruning Learns Compact and Accurate Models https://arxiv.org/abs/2204.00408☆198May 9, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- PERT: Pre-training BERT with Permuted Language Model☆370Apr 19, 2026Updated last week
- Pre-trained Chinese ELECTRA(中文ELECTRA预训练模型)☆1,437Apr 19, 2026Updated last week
- ExpMRC: Explainability Evaluation for Machine Reading Comprehension☆62Aug 30, 2023Updated 2 years ago
- 🛠️ Tools for Transformers compression using PyTorch Lightning ⚡☆85Feb 1, 2026Updated 3 months ago
- a baseline to practice☆45Jul 6, 2021Updated 4 years ago
- The score code of FastBERT (ACL2020)☆608Oct 29, 2021Updated 4 years ago
- LERT: A Linguistically-motivated Pre-trained Language Model(语言学信息增强的预训练模型LERT)☆224Apr 19, 2026Updated last week
- LightSeq: A High Performance Library for Sequence Processing and Generation☆3,300May 16, 2023Updated 2 years ago
- Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo☆3,105May 9, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- An efficient implementation of the popular sequence models for text generation, summarization, and translation tasks. https://arxiv.org/p…☆435Aug 17, 2022Updated 3 years ago
- [NeurIPS 2022] A Fast Post-Training Pruning Framework for Transformers☆193Feb 28, 2023Updated 3 years ago
- Pre-Trained Chinese XLNet(中文XLNet预训练模型)☆1,648Apr 19, 2026Updated last week
- [EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821☆3,649Oct 16, 2024Updated last year
- Pre-Training with Whole Word Masking for Chinese BERT(中文BERT-wwm系列模型)☆10,202Apr 19, 2026Updated last week
- Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.☆3,160Jan 22, 2024Updated 2 years ago
- 一键中文数据增强包 ; NLP数据增强、bert数据增强、EDA:pip install nlpcda☆1,881Mar 18, 2025Updated last year
- 高质量中文预训练模型集合:最先进大模型、最快小模型、相似度专门模型☆816Jul 8, 2020Updated 5 years ago
- ⚡ boost inference speed of T5 models by 5x & reduce the model size by 3x.☆588Apr 24, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- a bert for retrieval and generation☆860Feb 26, 2021Updated 5 years ago
- ICML'2022: NLP From Scratch Without Large-Scale Pretraining: A Simple and Efficient Framework☆253Jan 7, 2024Updated 2 years ago
- ☆271Jul 26, 2024Updated last year
- Code for the RecAdam paper: Recall and Learn: Fine-tuning Deep Pretrained Language Models with Less Forgetting.☆119Nov 10, 2020Updated 5 years ago
- BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)☆8,284Oct 16, 2024Updated last year
- A Large-scale Chinese Short-Text Conversation Dataset and Chinese pre-training dialog models☆1,941Jun 12, 2023Updated 2 years ago
- We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tunin…☆2,800Dec 12, 2023Updated 2 years ago
- Collections of resources from Joint Laboratory of HIT and iFLYTEK Research (HFL)☆376Mar 9, 2023Updated 3 years ago
- Open Language Pre-trained Model Zoo☆1,006Nov 18, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- RoFormer升级版☆155Aug 11, 2022Updated 3 years ago
- Efficient, scalable and enterprise-grade CPU/GPU inference server for 🤗 Hugging Face transformer models 🚀☆1,687Oct 23, 2024Updated last year
- ☆881May 24, 2024Updated last year
- [NeurIPS 2023] LLM-Pruner: On the Structural Pruning of Large Language Models. Support Llama-3/3.1, Llama-2, LLaMA, BLOOM, Vicuna, Baich…☆1,123Oct 7, 2024Updated last year
- ACL22 paper: Imputing Out-of-Vocabulary Embeddings with LOVE Makes Language Models Robust with Little Cost☆42Nov 15, 2023Updated 2 years ago
- Data augmentation for NLP☆4,656Jun 24, 2024Updated last year
- Conversational Word Embedding for Retrieval-based Dialog System (ACL2020)☆30Sep 2, 2020Updated 5 years ago