xiongma/roberta-wwm-base-distill

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/xiongma/roberta-wwm-base-distill)

xiongma / roberta-wwm-base-distill

this is roberta wwm base distilled model which was distilled from roberta wwm by roberta wwm large

☆67

Alternatives and similar repositories for roberta-wwm-base-distill

Users that are interested in roberta-wwm-base-distill are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

saurabhkulkarni77 / DistillBERT
View on GitHub
☆61Nov 14, 2019Updated 6 years ago
CLUEbenchmark / DistilBert
View on GitHub
DistilBERT for Chinese 海量中文预训练蒸馏bert模型
☆95Dec 5, 2019Updated 6 years ago
qiangsiwei / bert_distill
View on GitHub
BERT distillation（基于BERT的蒸馏实验）
☆316Jul 30, 2020Updated 5 years ago
momo-journey / mbart-chinese
View on GitHub
多语言降噪预训练模型MBart的中文生成任务
☆11May 27, 2021Updated 5 years ago
brightmart / roberta_zh
View on GitHub
RoBERTa中文预训练模型: RoBERTa for Chinese
☆2,793Jul 22, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
CLUEbenchmark / LGEB
View on GitHub
LGEB: Benchmark of Language Generation Evaluation
☆16Oct 21, 2022Updated 3 years ago
BitVoyage / FastBERT
View on GitHub
对ACL2020 FastBERT论文的复现，论文地址//arxiv.org/pdf/2004.02178.pdf
☆191Dec 15, 2021Updated 4 years ago
KaiQiangSong / joint_parse_summ
View on GitHub
(AAAI'20) The source code for the paper "Joint Parsing and Generation for Abstractive Summarization".
☆24Apr 22, 2020Updated 6 years ago
ZhuiyiTechnology / pretrained-models
View on GitHub
Open Language Pre-trained Model Zoo
☆1,003Nov 18, 2021Updated 4 years ago
haoyuhu / bert-multi-gpu
View on GitHub
Feel free to fine tune large BERT models with Multi-GPU and FP16 support.
☆193Mar 9, 2020Updated 6 years ago
MatthewZhuang / word2vec-Finance
View on GitHub
基于20W金融资讯训练得到的词向量
☆27Jan 19, 2018Updated 8 years ago
airaria / TextBrewer
View on GitHub
A PyTorch-based knowledge distillation toolkit for natural language processing
☆1,705May 8, 2023Updated 3 years ago
DataTerminatorX / Keyword-BERT
View on GitHub
☆277Dec 8, 2020Updated 5 years ago
noobiegz / cw2vec
View on GitHub
Implementation of the cw2vec model
☆29Jul 20, 2018Updated 8 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
ChineseGLUE / PyCLUE
View on GitHub
Python toolkit for Chinese Language Understanding Evaluation benchmark.
☆15May 22, 2023Updated 3 years ago
wyu-du / MatchPyramid-for-semantic-matching
View on GitHub
A simple Keras implementation of Paper "Text Matching as Image Recognition"
☆27Jun 28, 2023Updated 3 years ago
CLUEbenchmark / PyCLUE
View on GitHub
Python toolkit for Chinese Language Understanding(CLUE) Evaluation benchmark
☆133May 22, 2023Updated 3 years ago
yym6472 / ConSERT
View on GitHub
Code for our ACL 2021 paper - ConSERT: A Contrastive Framework for Self-Supervised Sentence Representation Transfer
☆542Dec 10, 2021Updated 4 years ago
brightmart / xlnet_zh
View on GitHub
中文预训练XLNet模型: Pre-Trained Chinese XLNet_Large
☆228Sep 13, 2019Updated 6 years ago
ecchochan / roberta-squad
View on GitHub
roBERTa training for SQuAD
☆49Mar 2, 2020Updated 6 years ago
shuohangwang / Cross-Thought
View on GitHub
☆47Jan 21, 2021Updated 5 years ago
wipen / bert_and_ernie
View on GitHub
TensorFlow code and pre-trained models for BERT and ERNIE
☆147Jun 5, 2019Updated 7 years ago
StonyBrookNLP / deformer
View on GitHub
[ACL 2020] DeFormer: Decomposing Pre-trained Transformers for Faster Question Answering
☆120May 22, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
lonePatient / daguan_2019_rank9
View on GitHub
datagrand 2019 information extraction competition rank9
☆130Dec 29, 2019Updated 6 years ago
ewrfcas / bert_cn_finetune
View on GitHub
Bert finetune for CMRC2018, CJRC, DRCD, CHID, C3
☆185Jun 4, 2020Updated 6 years ago
liu-nlper / dialogue-utterance-rewriter
View on GitHub
ACL 2019论文复现：Improving Multi-turn Dialogue Modelling with Utterance ReWriter
☆138Jan 23, 2020Updated 6 years ago
huawei-noah / Pretrained-Language-Model
View on GitHub
Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.
☆3,162Jan 22, 2024Updated 2 years ago
kevinmtian / distill-bert
View on GitHub
Knowledge Distillation from BERT
☆54Jan 7, 2019Updated 7 years ago
sfzhou5678 / PretrainedLittleBERTs
View on GitHub
24*2个预训练的小型BERT模型，NLPer炼丹利器
☆51Apr 12, 2020Updated 6 years ago
pengming617 / bert_textMatching
View on GitHub
利用预训练的中文模型实现基于bert的语义匹配模型数据集为LCQMC官方数据
☆196Dec 19, 2019Updated 6 years ago
brightmart / albert_zh
View on GitHub
A LITE BERT FOR SELF-SUPERVISED LEARNING OF LANGUAGE REPRESENTATIONS, 海量中文预训练ALBERT模型
☆3,981Nov 21, 2022Updated 3 years ago
wuch15 / HiTransformer
View on GitHub
ACL 2021: HiTransformer
☆13May 29, 2021Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
KuaiSearchPERKS / PERKS
View on GitHub
KuaiSearch PERKS
☆12Nov 16, 2021Updated 4 years ago
naiveHobo / HoboBERT
View on GitHub
Ensemble of 10 modified BERT Base models for prediction of best answers for queries on search engines.
☆16Jan 1, 2019Updated 7 years ago
duo0301 / TextSumma
View on GitHub
reimplementing Neural Summarization by Extracting Sentences and Words
☆16Dec 12, 2018Updated 7 years ago
WHUIR / MATINF
View on GitHub
The dataset and PyTorch Implementation for ACL 2020 paper "MATINF: A Jointly Labeled Large-Scale Dataset for Classification, Question Ans…
☆42Sep 7, 2020Updated 5 years ago
sz128 / Natural-language-understanding-papers
View on GitHub
NLU: domain-intent-slot; text2SQL
☆74Apr 18, 2020Updated 6 years ago
rgsachin / CoVe
View on GitHub
Keras implementation of CoVe
☆50Sep 17, 2018Updated 7 years ago
CLUEbenchmark / CLUEPretrainedModels
View on GitHub
高质量中文预训练模型集合：最先进大模型、最快小模型、相似度专门模型
☆810Jul 8, 2020Updated 6 years ago