this is roberta wwm base distilled model which was distilled from roberta wwm by roberta wwm large
☆67Mar 30, 2020Updated 6 years ago
Alternatives and similar repositories for roberta-wwm-base-distill
Users that are interested in roberta-wwm-base-distill are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆61Nov 14, 2019Updated 6 years ago
- DistilBERT for Chinese 海量中文预训练蒸馏bert模型☆96Dec 5, 2019Updated 6 years ago
- BERT distillation(基于BERT的蒸馏实验 )☆316Jul 30, 2020Updated 5 years ago
- 多语言降噪预训练模型MBart的中文生成任务☆11May 27, 2021Updated 4 years ago
- LGEB: Benchmark of Language Generation Evaluation☆16Oct 21, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- RoBERTa中文预训练模型: RoBERTa for Chinese☆2,783Jul 22, 2024Updated last year
- 对ACL2020 FastBERT论文的复现,论文地址//arxiv.org/pdf/2004.02178.pdf☆192Dec 15, 2021Updated 4 years ago
- 基于20W金融资讯训练得到的词向量☆26Jan 19, 2018Updated 8 years ago
- Open Language Pre-trained Model Zoo☆1,006Nov 18, 2021Updated 4 years ago
- Feel free to fine tune large BERT models with Multi-GPU and FP16 support.☆192Mar 9, 2020Updated 6 years ago
- A PyTorch-based knowledge distillation toolkit for natural language processing☆1,696May 8, 2023Updated 2 years ago
- ☆278Dec 8, 2020Updated 5 years ago
- Implementation of the cw2vec model☆29Jul 20, 2018Updated 7 years ago
- A simple Keras implementation of Paper "Text Matching as Image Recognition"☆28Jun 28, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Python toolkit for Chinese Language Understanding Evaluation benchmark.☆15May 22, 2023Updated 2 years ago
- Python toolkit for Chinese Language Understanding(CLUE) Evaluation benchmark☆133May 22, 2023Updated 2 years ago
- Code for our ACL 2021 paper - ConSERT: A Contrastive Framework for Self-Supervised Sentence Representation Transfer☆541Dec 10, 2021Updated 4 years ago
- 中文预训练XLNet模型: Pre-Trained Chinese XLNet_Large☆229Sep 13, 2019Updated 6 years ago
- roBERTa training for SQuAD☆50Mar 2, 2020Updated 6 years ago
- ☆47Jan 21, 2021Updated 5 years ago
- TensorFlow code and pre-trained models for BERT and ERNIE☆146Jun 5, 2019Updated 6 years ago
- [ACL 2020] DeFormer: Decomposing Pre-trained Transformers for Faster Question Answering☆121May 22, 2023Updated 2 years ago
- datagrand 2019 information extraction competition rank9☆130Dec 29, 2019Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Bert finetune for CMRC2018, CJRC, DRCD, CHID, C3☆184Jun 4, 2020Updated 5 years ago
- ACL 2019论文复现:Improving Multi-turn Dialogue Modelling with Utterance ReWriter☆138Jan 23, 2020Updated 6 years ago
- Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.☆3,157Jan 22, 2024Updated 2 years ago
- Knowledge Distillation from BERT☆54Jan 7, 2019Updated 7 years ago
- 24*2个预训练的小型BERT模型,NLPer炼丹利器☆51Apr 12, 2020Updated 6 years ago
- A LITE BERT FOR SELF-SUPERVISED LEARNING OF LANGUAGE REPRESENTATIONS, 海量中文预训练ALBERT模型☆3,984Nov 21, 2022Updated 3 years ago
- ACL 2021: HiTransformer☆13May 29, 2021Updated 4 years ago
- Ensemble of 10 modified BERT Base models for prediction of best answers for queries on search engines.☆16Jan 1, 2019Updated 7 years ago
- KuaiSearch PERKS☆12Nov 16, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- reimplementing Neural Summarization by Extracting Sentences and Words☆16Dec 12, 2018Updated 7 years ago
- The dataset and PyTorch Implementation for ACL 2020 paper "MATINF: A Jointly Labeled Large-Scale Dataset for Classification, Question Ans…☆43Sep 7, 2020Updated 5 years ago
- NLU: domain-intent-slot; text2SQL☆74Apr 18, 2020Updated 6 years ago
- Keras implementation of CoVe☆50Sep 17, 2018Updated 7 years ago
- Math24o: 高中奥林匹克数学竞赛测评集 High School Olympiad Mathematics Chinese Benchmark☆11Mar 27, 2025Updated last year
- Cross-Lingual Machine Reading Comprehension (EMNLP 2019)☆67Nov 6, 2019Updated 6 years ago
- Easy Data Augmentation for NLP on Chinese☆16Aug 3, 2019Updated 6 years ago