this is roberta wwm base distilled model which was distilled from roberta wwm by roberta wwm large
☆67Mar 30, 2020Updated 6 years ago
Alternatives and similar repositories for roberta-wwm-base-distill
Users that are interested in roberta-wwm-base-distill are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆61Nov 14, 2019Updated 6 years ago
- DistilBERT for Chinese 海量中文预训练蒸馏bert模型☆96Dec 5, 2019Updated 6 years ago
- BERT distillation(基于BERT的蒸馏实验 )☆317Jul 30, 2020Updated 5 years ago
- 多语言降噪预训练模型MBart的中文生成任务☆11May 27, 2021Updated 4 years ago
- LGEB: Benchmark of Language Generation Evaluation☆16Oct 21, 2022Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- RoBERTa中文预训练模型: RoBERTa for Chinese☆2,786Jul 22, 2024Updated last year
- 对ACL2020 FastBERT论文的复现,论文地址//arxiv.org/pdf/2004.02178.pdf☆192Dec 15, 2021Updated 4 years ago
- 基于20W金融资讯训练得到的词向量☆26Jan 19, 2018Updated 8 years ago
- (AAAI'20) The source code for the paper "Joint Parsing and Generation for Abstractive Summarization".☆24Apr 22, 2020Updated 6 years ago
- Open Language Pre-trained Model Zoo☆1,006Nov 18, 2021Updated 4 years ago
- Feel free to fine tune large BERT models with Multi-GPU and FP16 support.☆192Mar 9, 2020Updated 6 years ago
- A PyTorch-based knowledge distillation toolkit for natural language processing☆1,698May 8, 2023Updated 3 years ago
- ☆278Dec 8, 2020Updated 5 years ago
- Implementation of the cw2vec model☆29Jul 20, 2018Updated 7 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A simple Keras implementation of Paper "Text Matching as Image Recognition"☆28Jun 28, 2023Updated 2 years ago
- Python toolkit for Chinese Language Understanding Evaluation benchmark.☆15May 22, 2023Updated 2 years ago
- Python toolkit for Chinese Language Understanding(CLUE) Evaluation benchmark☆133May 22, 2023Updated 2 years ago
- 中文预训练XLNet模型: Pre-Trained Chinese XLNet_Large☆229Sep 13, 2019Updated 6 years ago
- roBERTa training for SQuAD☆50Mar 2, 2020Updated 6 years ago
- ☆47Jan 21, 2021Updated 5 years ago
- TensorFlow code and pre-trained models for BERT and ERNIE☆146Jun 5, 2019Updated 6 years ago
- [ACL 2020] DeFormer: Decomposing Pre-trained Transformers for Faster Question Answering☆121May 22, 2023Updated 2 years ago
- datagrand 2019 information extraction competition rank9☆130Dec 29, 2019Updated 6 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Bert finetune for CMRC2018, CJRC, DRCD, CHID, C3☆184Jun 4, 2020Updated 5 years ago
- ACL 2019论文复现:Improving Multi-turn Dialogue Modelling with Utterance ReWriter☆138Jan 23, 2020Updated 6 years ago
- Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.☆3,163Jan 22, 2024Updated 2 years ago
- Knowledge Distillation from BERT☆54Jan 7, 2019Updated 7 years ago
- 24*2个预训练的小型BERT模型,NLPer炼丹利器☆51Apr 12, 2020Updated 6 years ago
- A LITE BERT FOR SELF-SUPERVISED LEARNING OF LANGUAGE REPRESENTATIONS, 海量中文预训练ALBERT模型☆3,981Nov 21, 2022Updated 3 years ago
- 利用预训练的中文模型实现基于bert的语义匹配模型 数据集为LCQMC官方数据☆198Dec 19, 2019Updated 6 years ago
- ACL 2021: HiTransformer☆13May 29, 2021Updated 4 years ago
- Ensemble of 10 modified BERT Base models for prediction of best answers for queries on search engines.☆16Jan 1, 2019Updated 7 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- KuaiSearch PERKS☆12Nov 16, 2021Updated 4 years ago
- reimplementing Neural Summarization by Extracting Sentences and Words☆16Dec 12, 2018Updated 7 years ago
- The dataset and PyTorch Implementation for ACL 2020 paper "MATINF: A Jointly Labeled Large-Scale Dataset for Classification, Question Ans…☆43Sep 7, 2020Updated 5 years ago
- NLU: domain-intent-slot; text2SQL☆74Apr 18, 2020Updated 6 years ago
- Keras implementation of CoVe☆50Sep 17, 2018Updated 7 years ago
- Cross-Lingual Machine Reading Comprehension (EMNLP 2019)☆67Nov 6, 2019Updated 6 years ago
- Easy Data Augmentation for NLP on Chinese☆16Aug 3, 2019Updated 6 years ago