xiongma / roberta-wwm-base-distillView external linksLinks
this is roberta wwm base distilled model which was distilled from roberta wwm by roberta wwm large
☆66Mar 30, 2020Updated 5 years ago
Alternatives and similar repositories for roberta-wwm-base-distill
Users that are interested in roberta-wwm-base-distill are comparing it to the libraries listed below
Sorting:
- ☆61Nov 14, 2019Updated 6 years ago
- DistilBERT for Chinese 海量中文预训练蒸馏bert模型☆95Dec 5, 2019Updated 6 years ago
- BERT distillation(基于BERT的蒸馏实验 )☆314Jul 30, 2020Updated 5 years ago
- RoBERTa中文预训练模型: RoBERTa for Chinese☆2,773Jul 22, 2024Updated last year
- (AAAI'20) The source code for the paper "Joint Parsing and Generation for Abstractive Summarization".☆24Apr 22, 2020Updated 5 years ago
- LGEB: Benchmark of Language Generation Evaluation☆16Oct 21, 2022Updated 3 years ago
- datagrand 2019 information extraction competition rank9☆130Dec 29, 2019Updated 6 years ago
- ☆279Dec 8, 2020Updated 5 years ago
- 基于20W金融资讯训练得到的词向量☆25Jan 19, 2018Updated 8 years ago
- AIR retriever for Multi-Hop QA (ACL 2020 paper)☆30Jul 18, 2020Updated 5 years ago
- 对ACL2020 FastBERT论文的复现,论文地址//arxiv.org/pdf/2004.02178.pdf☆193Dec 15, 2021Updated 4 years ago
- Open Language Pre-trained Model Zoo☆1,004Nov 18, 2021Updated 4 years ago
- Feel free to fine tune large BERT models with Multi-GPU and FP16 support.☆192Mar 9, 2020Updated 5 years ago
- A simple Keras implementation of Paper "Text Matching as Image Recognition"☆28Jun 28, 2023Updated 2 years ago
- roBERTa training for SQuAD☆50Mar 2, 2020Updated 5 years ago
- Implementation of the cw2vec model☆29Jul 20, 2018Updated 7 years ago
- finetune bert for small dataset text classification in a few-shot learning manner using ProtoNet☆27Nov 25, 2020Updated 5 years ago
- ACL 2021: HiTransformer☆13May 29, 2021Updated 4 years ago
- A transformer model that should be able to solve a simple NER task☆11Mar 7, 2019Updated 6 years ago
- 多语言降噪预训练模型MBart的中文生成任务☆12May 27, 2021Updated 4 years ago
- [ACL 2020] DeFormer: Decomposing Pre-trained Transformers for Faster Question Answering☆121May 22, 2023Updated 2 years ago
- Knowledge Distillation from BERT☆54Jan 7, 2019Updated 7 years ago
- Python toolkit for Chinese Language Understanding(CLUE) Evaluation benchmark☆133May 22, 2023Updated 2 years ago
- Code for our ACL 2021 paper - ConSERT: A Contrastive Framework for Self-Supervised Sentence Representation Transfer☆542Dec 10, 2021Updated 4 years ago
- 中文预训练XLNet模型: Pre-Trained Chinese XLNet_Large☆229Sep 13, 2019Updated 6 years ago
- Bert finetune for CMRC2018, CJRC, DRCD, CHID, C3☆184Jun 4, 2020Updated 5 years ago
- ACL 2019论文复现:Improving Multi-turn Dialogue Modelling with Utterance ReWriter☆138Jan 23, 2020Updated 6 years ago
- A PyTorch-based knowledge distillation toolkit for natural language processing☆1,698May 8, 2023Updated 2 years ago
- Ensemble of 10 modified BERT Base models for prediction of best answers for queries on search engines.☆16Jan 1, 2019Updated 7 years ago
- Easy Data Augmentation for NLP on Chinese☆16Aug 3, 2019Updated 6 years ago
- Python toolkit for Chinese Language Understanding Evaluation benchmark.☆15May 22, 2023Updated 2 years ago
- A LITE BERT FOR SELF-SUPERVISED LEARNING OF LANGUAGE REPRESENTATIONS, 海量中文预训练ALBERT模型☆3,986Nov 21, 2022Updated 3 years ago
- A large-scale cleaned Chinese chitchat corpus and Chinese dialogpt models☆35Aug 6, 2020Updated 5 years ago
- keras sparse implement of margin-softmax☆100Jul 31, 2018Updated 7 years ago
- Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.☆3,155Jan 22, 2024Updated 2 years ago
- 利用预训练的中文模型实现基于bert的语义匹配模型 数据集为LCQMC官方数据☆198Dec 19, 2019Updated 6 years ago
- TensorFlow code and pre-trained models for BERT and ERNIE☆146Jun 5, 2019Updated 6 years ago
- 基于回译增强数据,目前整合了百度、有道、谷歌(需翻墙)翻译。☆21Nov 5, 2020Updated 5 years ago
- pytorch读取tfrecords,构造数据流☆18May 1, 2019Updated 6 years ago