24*2个预训练的小型BERT模型,NLPer炼丹利器
☆51Apr 12, 2020Updated 6 years ago
Alternatives and similar repositories for PretrainedLittleBERTs
Users that are interested in PretrainedLittleBERTs are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- EMNLP 2020: Filtering before Iteratively Referring for Knowledge-Grounded Response Selection in Retrieval-Based Chatbots☆12Dec 15, 2020Updated 5 years ago
- record and share my reading everyday☆12Apr 1, 2016Updated 10 years ago
- ☆11Mar 22, 2020Updated 6 years ago
- An unofficial implementation of Poly-encoder (Poly-encoders: Transformer Architectures and Pre-training Strategies for Fast and Accurate …☆248Jun 12, 2023Updated 2 years ago
- 中文生成式预训练模型☆99Aug 28, 2020Updated 5 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- 针对NER领域提供从线下训练到线上部署的一整套闭环流程☆14Jun 16, 2021Updated 4 years ago
- 2021搜狐校园文本匹配算法大赛☆16Jun 4, 2021Updated 4 years ago
- The code for Template-GPT-2 Generation Model for Logic2Text Dataset☆18Jun 1, 2020Updated 5 years ago
- 本项目是CCKS2020实体链指比赛baseline(pytorch)☆19Aug 15, 2020Updated 5 years ago
- this is roberta wwm base distilled model which was distilled from roberta wwm by roberta wwm large☆67Mar 30, 2020Updated 6 years ago
- solve text generation tasks by the language model GPT2, including papers, code, demo demos, and hands-on tutorials. 使用语言模型GPT2来解决文本生成任务的…☆26Aug 27, 2019Updated 6 years ago
- Implementing activation functions from scratch in Tensorflow.☆36Feb 13, 2022Updated 4 years ago
- ☆63Jan 2, 2020Updated 6 years ago
- multi-gpu pre-training in one machine for BERT without horovod (Data Parallelism)☆171Dec 27, 2025Updated 4 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- PyTorch version for Sequential Matching Network☆21Mar 29, 2019Updated 7 years ago
- Multi-modal data augmentation for machine learning☆16Jun 4, 2019Updated 6 years ago
- some demos of Knowledge Distillation in NLP☆23Dec 31, 2020Updated 5 years ago
- HSML Dynamic version for ICML 2019☆12Jul 11, 2019Updated 6 years ago
- ☆12Jul 20, 2022Updated 3 years ago
- Source Code for paper "NERO: A Neural Rule Grounding Framework for Label-Efficient Relation Extraction", WWW 2020☆46May 6, 2020Updated 6 years ago
- CCKS2021答非所问竞赛冠军方案☆27Oct 8, 2021Updated 4 years ago
- Webpage for the DSTC8 - NOESIS II: Predicting Responses☆48Mar 24, 2023Updated 3 years ago
- Leaderboards, Datasets and Papers for Multi-Turn Response Selection in Retrieval-Based Chatbots☆202May 31, 2021Updated 4 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- 高质量中文预训练模型集合:最先进大模型、最快小模型、相似度专门模型☆816Jul 8, 2020Updated 5 years ago
- Structural Pre-training for Dialogue Comprehension (ACL 2021)☆10Apr 25, 2022Updated 4 years ago
- Implementation for paper "A Knowledge-Enhanced Pretraining Model for Commonsense Story Generation"☆24Mar 1, 2020Updated 6 years ago
- 基于轻量级的albert实现albert+BiLstm+CRF☆93May 25, 2023Updated 2 years ago
- Paper Lists, Notes and Slides, Focus on NLP. For summarization, please refer to https://github.com/xcfcode/Summarization-Papers☆165Jun 12, 2022Updated 3 years ago
- Open solution to the Santander Value Prediction Challenge☆39Jun 22, 2022Updated 3 years ago
- TensorFlow code and pre-trained models for BERT☆24Apr 19, 2019Updated 7 years ago
- 汽车主题情感分析大赛冠军☆27Dec 10, 2018Updated 7 years ago
- An attempt to make Google BERT closer to production before Hugging Face Transformers etc.☆28Sep 10, 2020Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆440Apr 25, 2025Updated last year
- code of paper "Defending Against Alignment-Breaking Attacks via Robustly Aligned LLM"☆14Nov 17, 2023Updated 2 years ago
- Regression Guided by Relative Ranking Using Convolutional Neural Network (R^3 CNN) for Facial Beauty Prediction☆16Feb 5, 2026Updated 3 months ago
- Distilling BERT using natural language generation.☆39Aug 13, 2023Updated 2 years ago
- IPRE: a Dataset for Inter-Personal Relationship Extraction☆95Aug 10, 2019Updated 6 years ago
- 用于tesseract中box位置标注和修改以及对box进行训练生成字库☆13Nov 18, 2020Updated 5 years ago
- MT Tutorial for the JSALT 2019 Summer School☆48Jun 24, 2019Updated 6 years ago