24*2个预训练的小型BERT模型,NLPer炼丹利器
☆51Apr 12, 2020Updated 5 years ago
Alternatives and similar repositories for PretrainedLittleBERTs
Users that are interested in PretrainedLittleBERTs are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- EMNLP 2020: Filtering before Iteratively Referring for Knowledge-Grounded Response Selection in Retrieval-Based Chatbots☆12Dec 15, 2020Updated 5 years ago
- record and share my reading everyday☆12Apr 1, 2016Updated 9 years ago
- ☆11Mar 22, 2020Updated 6 years ago
- An unofficial implementation of Poly-encoder (Poly-encoders: Transformer Architectures and Pre-training Strategies for Fast and Accurate …☆248Jun 12, 2023Updated 2 years ago
- 中文生成式预训练模型☆99Aug 28, 2020Updated 5 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- 针对NER领域提供从线下训练到线上部署的一整套闭环流程☆14Jun 16, 2021Updated 4 years ago
- 2021搜狐校园文本匹配算法大赛☆16Jun 4, 2021Updated 4 years ago
- 本项目是CCKS2020实体链指比赛baseline(pytorch)☆19Aug 15, 2020Updated 5 years ago
- motivation: 系统整理NLP各个方向需要阅读的论文☆34Oct 28, 2020Updated 5 years ago
- this is roberta wwm base distilled model which was distilled from roberta wwm by roberta wwm large☆66Mar 30, 2020Updated 6 years ago
- solve text generation tasks by the language model GPT2, including papers, code, demo demos, and hands-on tutorials. 使用语言模型GPT2来解决文本生成任务的…☆26Aug 27, 2019Updated 6 years ago
- 中文短句相似度☆17Nov 6, 2017Updated 8 years ago
- ☆63Jan 2, 2020Updated 6 years ago
- PyTorch version for Sequential Matching Network☆21Mar 29, 2019Updated 7 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- multi-gpu pre-training in one machine for BERT without horovod (Data Parallelism)☆171Dec 27, 2025Updated 3 months ago
- some demos of Knowledge Distillation in NLP☆23Dec 31, 2020Updated 5 years ago
- ☆12Jul 20, 2022Updated 3 years ago
- Source Code for paper "NERO: A Neural Rule Grounding Framework for Label-Efficient Relation Extraction", WWW 2020☆46May 6, 2020Updated 5 years ago
- CCKS2021答非所问竞赛冠军方案☆27Oct 8, 2021Updated 4 years ago
- Webpage for the DSTC8 - NOESIS II: Predicting Responses☆48Mar 24, 2023Updated 3 years ago
- Leaderboards, Datasets and Papers for Multi-Turn Response Selection in Retrieval-Based Chatbots☆202May 31, 2021Updated 4 years ago
- Structural Pre-training for Dialogue Comprehension (ACL 2021)☆10Apr 25, 2022Updated 3 years ago
- ☆167Apr 19, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- 高质量中文预训练模型集合:最先进大模型、最快小模型、相似度专门模型☆816Jul 8, 2020Updated 5 years ago
- Implementation for paper "A Knowledge-Enhanced Pretraining Model for Commonsense Story Generation"☆24Mar 1, 2020Updated 6 years ago
- 基于轻量级的albert实现albert+BiLstm+CRF☆93May 25, 2023Updated 2 years ago
- Paper Lists, Notes and Slides, Focus on NLP. For summarization, please refer to https://github.com/xcfcode/Summarization-Papers☆165Jun 12, 2022Updated 3 years ago
- An attempt to make Google BERT closer to production before Hugging Face Transformers etc.☆28Sep 10, 2020Updated 5 years ago
- Witwicky: An implementation of Transformer in PyTorch.☆22Aug 17, 2020Updated 5 years ago
- ☆440Apr 25, 2025Updated 11 months ago
- Distilling BERT using natural language generation.