this is roberta wwm base distilled model which was distilled from roberta wwm by roberta wwm large
☆67Mar 30, 2020Updated 6 years ago
Alternatives and similar repositories for roberta-wwm-base-distill
Users that are interested in roberta-wwm-base-distill are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆61Nov 14, 2019Updated 6 years ago
- DistilBERT for Chinese 海量中文预训练蒸馏bert模型☆96Dec 5, 2019Updated 6 years ago
- BERT distillation(基于BERT的蒸馏实验 )☆317Jul 30, 2020Updated 5 years ago
- 多语言降噪预训练模型MBart的中文生成任务☆11May 27, 2021Updated 5 years ago
- LGEB: Benchmark of Language Generation Evaluation☆16Oct 21, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- RoBERTa中文预训练模型: RoBERTa for Chinese☆2,788Jul 22, 2024Updated last year
- 对ACL2020 FastBERT论文的复现,论文地址//arxiv.org/pdf/2004.02178.pdf☆192Dec 15, 2021Updated 4 years ago
- Open Language Pre-trained Model Zoo☆1,006Nov 18, 2021Updated 4 years ago
- Feel free to fine tune large BERT models with Multi-GPU and FP16 support.☆192Mar 9, 2020Updated 6 years ago
- ☆278Dec 8, 2020Updated 5 years ago
- Implementation of the cw2vec model☆29Jul 20, 2018Updated 7 years ago
- A simple Keras implementation of Paper "Text Matching as Image Recognition"☆27Jun 28, 2023Updated 2 years ago
- Python toolkit for Chinese Language Understanding Evaluation benchmark.☆15May 22, 2023Updated 3 years ago
- Python toolkit for Chinese Language Understanding(CLUE) Evaluation benchmark☆133May 22, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Code for our ACL 2021 paper - ConSERT: A Contrastive Framework for Self-Supervised Sentence Representation Transfer☆541Dec 10, 2021Updated 4 years ago
- 中文预训练XLNet模型: Pre-Trained Chinese XLNet_Large☆229Sep 13, 2019Updated 6 years ago
- ☆47Jan 21, 2021Updated 5 years ago
- TensorFlow code and pre-trained models for BERT and ERNIE☆146Jun 5, 2019Updated 6 years ago
- [ACL 2020] DeFormer: Decomposing Pre-trained Transformers for Faster Question Answering☆121May 22, 2023Updated 3 years ago
- datagrand 2019 information extraction competition rank9☆130Dec 29, 2019Updated 6 years ago
- Bert finetune for CMRC2018, CJRC, DRCD, CHID, C3☆184Jun 4, 2020Updated 5 years ago
- ACL 2019论文复现:Improving Multi-turn Dialogue Modelling with Utterance ReWriter☆138Jan 23, 2020Updated 6 years ago
- Knowledge Distillation from BERT☆54Jan 7, 2019Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- 24*2个预训练的小型BERT模型,NLPer炼丹利器☆51Apr 12, 2020Updated 6 years ago
- A LITE BERT FOR SELF-SUPERVISED LEARNING OF LANGUAGE REPRESENTATIONS, 海量中 文预训练ALBERT模型☆3,980Nov 21, 2022Updated 3 years ago
- 利用预训练的中文模型实现基于bert的语义匹配模型 数据集为LCQMC官方数据☆198Dec 19, 2019Updated 6 years ago
- ACL 2021: HiTransformer☆13May 29, 2021Updated 5 years ago
- Ensemble of 10 modified BERT Base models for prediction of best answers for queries on search engines.☆16Jan 1, 2019Updated 7 years ago
- KuaiSearch PERKS☆12Nov 16, 2021Updated 4 years ago
- reimplementing Neural Summarization by Extracting Sentences and Words☆16Dec 12, 2018Updated 7 years ago
- The dataset and PyTorch Implementation for ACL 2020 paper "MATINF: A Jointly Labeled Large-Scale Dataset for Classification, Question Ans…☆43Sep 7, 2020Updated 5 years ago
- NLU: domain-intent-slot; text2SQL☆74Apr 18, 2020Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Keras implementation of CoVe☆50Sep 17, 2018Updated 7 years ago
- Cross-Lingual Machine Reading Comprehension (EMNLP 2019)☆67Nov 6, 2019Updated 6 years ago
- Easy Data Augmentation for NLP on Chinese☆16Aug 3, 2019Updated 6 years ago
- 高质量中文预训练模型集合:最先进大模型、最快小模型、相似度专门模型☆812Jul 8, 2020Updated 5 years ago
- finetune bert for small dataset text classification in a few-shot learning manner using ProtoNet☆27Nov 25, 2020Updated 5 years ago
- A transformer model that should be able to solve a simple NER task☆11Mar 7, 2019Updated 7 years ago
- Code for Fact-level Extractive Summarization with Hierarchical Graph Mask on BERT (coling 2020)☆16Mar 25, 2023Updated 3 years ago