Fine-tuned BERT on SQuAd 2.0 Dataset. Applied Knowledge Distillation (KD) and fine-tuned DistilBERT (student) using BERT as the teacher model. Reduced the size of the original BERT by 40%.
☆26Feb 13, 2021Updated 5 years ago
Alternatives and similar repositories for Question-Answering-with-BERT-and-Knowledge-Distillation
Users that are interested in Question-Answering-with-BERT-and-Knowledge-Distillation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Use BERT for Question Answering and finetune train with SQuAD 2.0☆15Oct 12, 2019Updated 6 years ago
- Reading comprehension based question-answering model for news articles.☆11Jun 22, 2022Updated 4 years ago
- BERT based pretrained model using SQuAD 2.0 Dataset for Question-Answering☆30Mar 19, 2021Updated 5 years ago
- ☆19Apr 5, 2022Updated 4 years ago
- bert蒸馏实践,包含BiLSTM蒸馏BERT和TinyBert☆13Apr 23, 2022Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- NER with Deep contextualized word representations (Elmo)☆23Jul 22, 2018Updated 7 years ago
- CS172 Final project: Text Image Super-Resolution Reconstruction☆14Jun 15, 2020Updated 6 years ago
- This repo contains the code for Late Prompt Tuning.☆12Dec 22, 2025Updated 6 months ago
- Patient Letter Generation☆12Aug 22, 2024Updated last year
- 자연어 처리 기반 [한글 서술형 수학문제 데이터셋] 공개 저장소입니다.☆14Jun 12, 2023Updated 3 years ago
- Master thesis with code investigating methods for incorporating long-context reasoning in low-resource languages, without the need to pre…☆35Aug 19, 2021Updated 4 years ago
- Question-Directed Graph Attention Network for Numerical Reasoning over Text☆10Aug 14, 2020Updated 5 years ago
- ☆17May 31, 2023Updated 3 years ago
- Re-Implementation of SPARTA model☆13Oct 1, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Code for "DocLens: Multi-aspect Fine-grained Evaluation for Medical Text Generation" (ACL 2024)☆22May 18, 2024Updated 2 years ago
- Code for "Contextualized Embeddings in Named-Entity Recognition", ECIR 2020☆13Jul 25, 2024Updated last year
- Official PyTorch implementation of https://arxiv.org/abs/2210.06340 (NeurIPS ‘22)☆21Nov 14, 2022Updated 3 years ago
- A quick and easy-to-use Rust library for Excel file editing.☆32Apr 17, 2026Updated 2 months ago
- ☆15Dec 20, 2020Updated 5 years ago
- ☆13Nov 9, 2021Updated 4 years ago
- knowledge distillation on BERT☆29Apr 11, 2020Updated 6 years ago
- AudioVisual Diarization - Supervised and Unsupervised☆15Nov 22, 2022Updated 3 years ago
- 🎓Automatically Update circult-eda-mlsys-tinyml Papers Daily using Github Actions (Update Every 8th hours)☆10Jun 22, 2026Updated last week
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Implementations of different neural network pruning techniques☆14Aug 10, 2023Updated 2 years ago
- Official source code repository for QueryBlazer: Efficient Query Autocompletion Framework☆20Jun 16, 2025Updated last year
- 使用Few-Shot方法来做文本分类任务,基于THUCNews数据☆10Jun 4, 2020Updated 6 years ago
- Sample data generated by Synthea☆16May 28, 2026Updated last month
- You Only Search Once: On Lightweight Differentiable Architecture Search for Resource-Constrained Embedded Platforms☆12Apr 17, 2023Updated 3 years ago
- Bidirectional Encoder Representations from Transformers (BERT) transfer learning for named entity recognition and de-identification of se…☆10Aug 3, 2019Updated 6 years ago
- A PyTorch Implementation of YOLOv3☆14Apr 16, 2019Updated 7 years ago
- Master's thesis project in collaboration with Rasa, focusing on knowledge distillation from BERT into different very small networks and a…☆13Sep 30, 2022Updated 3 years ago
- Korean-Math-Word-Problem-Solver☆22Dec 21, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Anomaly Detection In An IoT-Acquired Environmental Sensor Data☆12Oct 2, 2018Updated 7 years ago
- 한글 단어 혹은 문장 이미지를 받아 텍스트를 반환하는 Text Recognition Model☆20Jun 14, 2020Updated 6 years ago
- ☆13Jun 2, 2022Updated 4 years ago
- Keypoint dataset for airplane☆10Dec 28, 2019Updated 6 years ago
- This is project to analyze korquad 2.0☆23Jun 22, 2022Updated 4 years ago
- Code for the CIKM 2019 Paper: How Does BERT Answer Questions? A Layer-Wise Analysis of Transformer Representations☆32Jun 12, 2023Updated 3 years ago
- ☆11Jul 17, 2020Updated 5 years ago