MobileBERT: a Compact Task-Agnostic BERT for Resource-Limited Devices
☆72May 19, 2020Updated 6 years ago
Alternatives and similar repositories for MobileBert_PyTorch
Users that are interested in MobileBert_PyTorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ⛵️The official PyTorch implementation for "BERT-of-Theseus: Compressing BERT by Progressive Module Replacing" (EMNLP 2020).☆315Jun 12, 2023Updated 2 years ago
- CLUE baseline pytorch CLUE的pytorch版本基线☆75Apr 3, 2020Updated 6 years ago
- Chinese MobileBERT(中文MobileBERT模型)☆99Mar 2, 2022Updated 4 years ago
- 简洁易用版TinyBert:基于Bert进行知识蒸馏的预训练语言模型☆272Oct 24, 2020Updated 5 years ago
- 针对NER领域提供从线下训练到线上部署的一整套闭环流程☆14Jun 16, 2021Updated 4 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- NEZHA: Neural Contextualized Representation for Chinese Language Understanding☆259Aug 13, 2021Updated 4 years ago
- 这里是用torch写的简洁版的GLUE评测代码。This is the simpler code written by torch in GLUE.☆19Jul 10, 2022Updated 3 years ago
- Semi-supervised Domain Adaptation of Machine Translation☆12Dec 8, 2022Updated 3 years ago
- Bootstrapping loss function implementation in pytorch☆36Dec 3, 2020Updated 5 years ago
- 高性能小模型测评 Shared Tasks in NLPCC 2020. Task 1 - Light Pre-Training Chinese Language Model for NLP Task☆60Jun 1, 2020Updated 5 years ago
- pytorch implement of Label Smoothing☆32Dec 16, 2019Updated 6 years ago
- Tuning BERT☆10Jun 28, 2022Updated 3 years ago
- Calculating Expected Time for training LLM.☆39Apr 17, 2023Updated 3 years ago
- IEEE Investment Ranking Challenge solution (4th place)☆10Jun 1, 2018Updated 7 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Unofficial Pytorch implementation of MiniLM and MiniLMv2☆23Jan 30, 2022Updated 4 years ago
- pytorch implementation for Patient Knowledge Distillation for BERT Model Compression☆204Sep 20, 2019Updated 6 years ago
- ☆12Oct 10, 2021Updated 4 years ago
- ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators☆91Aug 24, 2021Updated 4 years ago
- ☆15Sep 10, 2019Updated 6 years ago
- Elasticsearch, MongoDB, Tornado Server, RESTful API, Python, Information Retrieval, Machine Learning, Web Crawler☆17Jan 9, 2023Updated 3 years ago
- Datasets and source codes for paper "Is Multi-Hop Reasoning Really Explainable? Towards Benchmarking Reasoning Interpretability"☆17Nov 17, 2021Updated 4 years ago
- Keras implementation of Padam from "Closing the Generalization Gap of Adaptive Gradient Methods in Training Deep Neural Networks"☆17Sep 6, 2018Updated 7 years ago
- A temporary repo to share the DMBERT code for Event Detection☆13Apr 19, 2020Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆257Oct 4, 2022Updated 3 years ago
- Utility code for easier django admin development☆25Jan 26, 2026Updated 3 months ago
- Training HuggingFace models using fastai☆11Jul 22, 2021Updated 4 years ago
- This repo contains a PyTorch implementation of a pretrained BERT model for text classification.☆109Aug 30, 2019Updated 6 years ago
- Al-Qur'an yang dikemas dalam bentuk ChatBot☆15Dec 1, 2020Updated 5 years ago
- 基于BERT的预训练语言模型实现,分为两步:预训练和微调。目前已包括BERT、Roberta、ALbert三个模型,且皆可支持Whole Word Mask模式。☆17Feb 1, 2020Updated 6 years ago
- Tensorflow implementation of Attention-over-Attention Neural Networks for Reading Comprehension☆28Sep 25, 2016Updated 9 years ago
- This is the source code for Efficient Sequential Recommendation for Long Term User Interest Via Personalization.☆31Nov 18, 2025Updated 6 months ago
- 多模型中文cnews新闻文本分类☆59Mar 25, 2020Updated 6 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Evolving Normalization-Activation Layers☆19Apr 10, 2020Updated 6 years ago
- DIY_resnet+迁移学习+风格迁移☆18May 20, 2019Updated 7 years ago
- A collection of various NLP datasets, mainly Indonesia-related languages.☆15Apr 23, 2022Updated 4 years ago
- Quora Paraphrasing Dataset Bahasa Indonesia Version☆11Apr 18, 2021Updated 5 years ago
- ☆53Mar 25, 2020Updated 6 years ago
- ☆17Sep 10, 2021Updated 4 years ago
- 对ACL2020 FastBERT论文的复现,论文地址//arxiv.org/pdf/2004.02178.pdf☆192Dec 15, 2021Updated 4 years ago