airaria / TextPruner
A PyTorch-based model pruning toolkit for pre-trained language models
☆386Updated last year
Alternatives and similar repositories for TextPruner:
Users that are interested in TextPruner are comparing it to the libraries listed below
- sentence-transformers to onnx 让sbert模型推理效率更快☆164Updated 2 years ago
- A framework for cleaning Chinese dialog data☆267Updated 3 years ago
- text embedding☆145Updated last year
- ☆306Updated last year
- CPT: A Pre-Trained Unbalanced Transformer for Both Chinese Language Understanding and Generation☆487Updated 2 years ago
- LERT: A Linguistically-motivated Pre-trained Language Model(语言学信息增强的预训练模型LERT)☆203Updated last year
- A curated list of research papers in Sentence Reprsentation Learning and a sts leaderboard of sentence embeddings.☆313Updated last year
- ☆265Updated 7 months ago
- PromptBERT: Improving BERT Sentence Embeddings with Prompts☆333Updated last year
- Collections of resources from Joint Laboratory of HIT and iFLYTEK Research (HFL)☆368Updated last year
- A novel method to tune language models. Codes and datasets for paper ``GPT understands, too''.☆927Updated 2 years ago
- ☆278Updated 10 months ago
- Code used for sourcing and cleaning the BigScience ROOTS corpus☆308Updated last year
- 比Sentence-BERT更有效的句向量方案☆366Updated 2 years ago
- A wide variety of research projects developed by the SpokenNLP team of Speech Lab, Alibaba Group.☆114Updated last month
- The code for our paper "NSP-BERT: A Prompt-based Zero-Shot Learner Through an Original Pre-training Task —— Next Sentence Prediction"☆228Updated 2 years ago
- 简单的向量白化改善句向量质量☆484Updated 3 years ago
- PERT: Pre-training BERT with Permuted Language Model☆357Updated last year
- SimBERT升级版(SimBERTv2)!☆441Updated 2 years ago
- CINO: Pre-trained Language Models for Chinese Minority (少数民族语言预训练模型)☆229Updated last year
- Codebase for RetroMAE and beyond.☆251Updated 8 months ago
- A Multi-modal Model Chinese Spell Checker Released on ACL2021.☆155Updated last year
- ICML'2022: NLP From Scratch Without Large-Scale Pretraining: A Simple and Efficient Framework☆259Updated last year
- [NeurIPS'22 Spotlight] A Contrastive Framework for Neural Text Generation☆469Updated 11 months ago
- 3000000+语义理解与匹配数据集。可用于无监督对比学习、半监督学习等构建中文领域效果最好的预训练模型☆292Updated 2 years ago
- EVA: Large-scale Pre-trained Chit-Chat Models☆307Updated last year
- 中文图书语料MD5链接☆216Updated last year
- 使用sentencepiece中BPE训练中文词表,并在transformers中进行使用。☆116Updated last year
- ☆456Updated 8 months ago
- Tracking the progress in NLG for task-oriented dialogue system (resources, code, and new frontiers etc.)☆134Updated 3 years ago