Fine-tuned BERT on SQuAd 2.0 Dataset. Applied Knowledge Distillation (KD) and fine-tuned DistilBERT (student) using BERT as the teacher model. Reduced the size of the original BERT by 40%.
☆26Feb 13, 2021Updated 5 years ago
Alternatives and similar repositories for Question-Answering-with-BERT-and-Knowledge-Distillation
Users that are interested in Question-Answering-with-BERT-and-Knowledge-Distillation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Use BERT for Question Answering and finetune train with SQuAD 2.0☆15Oct 12, 2019Updated 6 years ago
- Reading comprehension based question-answering model for news articles.☆11Jun 22, 2022Updated 3 years ago
- BERT based pretrained model using SQuAD 2.0 Dataset for Question-Answering☆30Mar 19, 2021Updated 5 years ago
- ☆19Apr 5, 2022Updated 4 years ago
- bert蒸馏实践,包含BiLSTM蒸馏BERT和TinyBert☆13Apr 23, 2022Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- CS172 Final project: Text Image Super-Resolution Reconstruction☆14Jun 15, 2020Updated 5 years ago
- This repo contains the code for Late Prompt Tuning.☆12Dec 22, 2025Updated 5 months ago
- 자연어 처리 기반 [한글 서술형 수학문제 데이터셋] 공개 저장소입니다.☆14Jun 12, 2023Updated 2 years ago
- Master thesis with code investigating methods for incorporating long-context reasoning in low-resource languages, without the need to pre…☆35Aug 19, 2021Updated 4 years ago
- Question-Directed Graph Attention Network for Numerical Reasoning over Text☆10Aug 14, 2020Updated 5 years ago
- The implementation of "RQUGE: Reference-Free Metric for Evaluating Question Generation by Answering the Question" [ACL 2023]☆17Apr 7, 2024Updated 2 years ago
- ☆17May 31, 2023Updated 3 years ago
- Re-Implementation of SPARTA model☆13Oct 1, 2021Updated 4 years ago
- Official PyTorch implementation of https://arxiv.org/abs/2210.06340 (NeurIPS ‘22)☆21Nov 14, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A quick and easy-to-use Rust library for Excel file editing.☆32Apr 17, 2026Updated last month
- ☆15Dec 20, 2020Updated 5 years ago
- 🎓Automatically Update circult-eda-mlsys-tinyml Papers Daily using Github Actions (Update Every 8th hours)☆10Jun 1, 2026Updated last week
- Distilling Task-Specific Knowledge from Teacher Model into BiLSTM☆31Dec 14, 2024Updated last year
- ☆33Jun 2, 2026Updated last week
- Implementations of different neural network pruning techniques☆14Aug 10, 2023Updated 2 years ago
- MFAQ: a Multilingual FAQ Dataset☆18Sep 17, 2023Updated 2 years ago
- Very basic Flask application with an interactive form, using Flask-WTF and Flask-Bootstrap☆11Mar 26, 2017Updated 9 years ago
- Sample data generated by Synthea☆15May 28, 2026Updated last week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- You Only Search Once: On Lightweight Differentiable Architecture Search for Resource-Constrained Embedded Platforms☆12Apr 17, 2023Updated 3 years ago
- A Tight-fisted Optimizer (Tiger), implemented in PyTorch.☆12Jun 26, 2024Updated last year
- Bidirectional Encoder Representations from Transformers (BERT) transfer learning for named entity recognition and de-identification of se…☆10Aug 3, 2019Updated 6 years ago
- A PyTorch Implementation of YOLOv3☆14Apr 16, 2019Updated 7 years ago
- Master's thesis project in collaboration with Rasa, focusing on knowledge distillation from BERT into different very small networks and a…☆13Sep 30, 2022Updated 3 years ago
- assignments and group case studies from PGDMLAI course by upGrad & IIITB☆13Aug 7, 2023Updated 2 years ago
- Korean-Math-Word-Problem-Solver☆22Dec 21, 2023Updated 2 years ago
- ☆13Jun 2, 2022Updated 4 years ago
- Keypoint dataset for airplane☆10Dec 28, 2019Updated 6 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- This is project to analyze korquad 2.0☆23Jun 22, 2022Updated 3 years ago
- ☆11Jul 17, 2020Updated 5 years ago
- A novel BERT model with online Elastic Weight Loss that learns and consolidates information to mimic memory while minimizing catastrophic…☆10Jul 25, 2024Updated last year
- ☆15Oct 24, 2023Updated 2 years ago
- Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.☆13Jul 15, 2024Updated last year
- Pytorch implementation of our paper accepted by NeurIPS 2022 -- Learning Best Combination for Efficient N:M Sparsity☆22Jan 13, 2023Updated 3 years ago
- Repository for managing python tools that model standoff annotations for i2b2 2014 challenge☆14May 12, 2015Updated 11 years ago