airaria / TextPrunerLinks
A PyTorch-based model pruning toolkit for pre-trained language models
☆388Updated last year
Alternatives and similar repositories for TextPruner
Users that are interested in TextPruner are comparing it to the libraries listed below
Sorting:
- A framework for cleaning Chinese dialog data☆274Updated 4 years ago
- Multi-language Enhanced LLaMA☆302Updated 2 years ago
- text embedding☆147Updated last year
- Code used for sourcing and cleaning the BigScience ROOTS corpus☆314Updated 2 years ago
- PromptBERT: Improving BERT Sentence Embeddings with Prompts☆339Updated last year
- Introduction to CPM☆166Updated 3 years ago
- LERT: A Linguistically-motivated Pre-trained Language Model(语言学信息增强的预训练模型LERT)☆217Updated last month
- ☆310Updated 2 years ago
- Mengzi Pretrained Models☆537Updated 2 years ago
- sentence-transformers to onnx 让sbert模型推理效率更快☆165Updated 3 years ago
- This is the official repo for paper "CSDS: A Fine-grained Chinese Dataset for Customer Service Dialogue Summarization", accepted by EMNLP…☆96Updated 2 years ago
- CCL 2022 汉语学习者文本纠错评测☆142Updated 2 years ago
- NLU & NLG (zero-shot) depend on mengzi-t5-base-mt pretrained model☆75Updated 2 years ago
- The code for our paper "NSP-BERT: A Prompt-based Zero-Shot Learner Through an Original Pre-training Task —— Next Sentence Prediction"☆229Updated 2 years ago
- ☆219Updated 2 years ago
- CPT: A Pre-Trained Unbalanced Transformer for Both Chinese Language Understanding and Generation☆490Updated 2 years ago
- ParaGen is a PyTorch deep learning framework for parallel sequence generation.☆186Updated 2 years ago
- The Corpus & Code for EMNLP 2022 paper "FCGEC: Fine-Grained Corpus for Chinese Grammatical Error Correction" | FCGEC中文语法纠错语料及STG模型☆118Updated 8 months ago
- Simple implementation of using lora form the peft library to fine-tune the chatglm-6b☆84Updated 2 years ago
- ☆281Updated last year
- 中文图书语料MD5链接☆216Updated last year
- ☆460Updated last year
- RoFormer升级版☆154Updated 3 years ago
- 比Sentence-BERT更有效的句向量方案☆375Updated 2 years ago
- 中文自然语言推理数据集(A large-scale Chinese Nature language inference and Semantic similarity calculation Dataset)☆432Updated 5 years ago
- Implementation of Chinese ChatGPT☆287Updated last year
- A wide variety of research projects developed by the SpokenNLP team of Speech Lab, Alibaba Group.☆116Updated 2 months ago
- train llama on a single A100 80G node using 🤗 transformers and 🚀 Deepspeed Pipeline Parallelism☆224Updated last year
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆69Updated 2 years ago
- 中文机器阅读理解数据集☆104Updated 4 years ago