Fine-tuned BERT on SQuAd 2.0 Dataset. Applied Knowledge Distillation (KD) and fine-tuned DistilBERT (student) using BERT as the teacher model. Reduced the size of the original BERT by 40%.
☆26Feb 13, 2021Updated 5 years ago
Alternatives and similar repositories for Question-Answering-with-BERT-and-Knowledge-Distillation
Users that are interested in Question-Answering-with-BERT-and-Knowledge-Distillation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Use BERT for Question Answering and finetune train with SQuAD 2.0☆15Oct 12, 2019Updated 6 years ago
- Reading comprehension based question-answering model for news articles.☆11Jun 22, 2022Updated 3 years ago
- ☆19Apr 5, 2022Updated 4 years ago
- [ACM MM 2023] The released code of paper "Deconfounded Visual Question Generation with Causal Inference"☆11Sep 3, 2024Updated last year
- This repo contains the code for Late Prompt Tuning.☆12Dec 22, 2025Updated 3 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Patient Letter Generation☆12Aug 22, 2024Updated last year
- Master thesis with code investigating methods for incorporating long-context reasoning in low-resource languages, without the need to pre…☆35Aug 19, 2021Updated 4 years ago
- ☆92Sep 29, 2021Updated 4 years ago
- Reading comprehension with ALBERT transformer model☆15Dec 14, 2021Updated 4 years ago
- Question-Directed Graph Attention Network for Numerical Reasoning over Text☆10Aug 14, 2020Updated 5 years ago
- ☆17May 31, 2023Updated 2 years ago
- Re-Implementation of SPARTA model☆13Oct 1, 2021Updated 4 years ago
- Code for "DocLens: Multi-aspect Fine-grained Evaluation for Medical Text Generation" (ACL 2024)☆22May 18, 2024Updated last year
- Code for "Contextualized Embeddings in Named-Entity Recognition", ECIR 2020☆13Jul 25, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Official PyTorch implementation of https://arxiv.org/abs/2210.06340 (NeurIPS ‘22)☆21Nov 14, 2022Updated 3 years ago
- Caching library with support for multiple cache backends☆16Updated this week
- ☆15Dec 20, 2020Updated 5 years ago
- Build end-to-end Machine Learning pipeline to predict accessibility of playgrounds in NYC☆15Jul 9, 2020Updated 5 years ago
- 🎓Automatically Update circult-eda-mlsys-tinyml Papers Daily using Github Actions (Update Every 8th hours)☆10Updated this week
- Distilling Task-Specific Knowledge from Teacher Model into BiLSTM☆31Dec 14, 2024Updated last year
- Implementations of different neural network pruning techniques☆14Aug 10, 2023Updated 2 years ago
- MFAQ: a Multilingual FAQ Dataset☆18Sep 17, 2023Updated 2 years ago
- ☆14Jul 13, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Official source code repository for QueryBlazer: Efficient Query Autocompletion Framework☆20Jun 16, 2025Updated 10 months ago
- Speaker Diarization using GRU in PyTorch☆11Aug 29, 2020Updated 5 years ago
- Very basic Flask application with an interactive form, using Flask-WTF and Flask-Bootstrap☆11Mar 26, 2017Updated 9 years ago
- You Only Search Once: On Lightweight Differentiable Architecture Search for Resource-Constrained Embedded Platforms☆12Apr 17, 2023Updated 3 years ago
- Sample data generated by Synthea☆15Mar 5, 2026Updated last month
- Master's thesis project in collaboration with Rasa, focusing on knowledge distillation from BERT into different very small networks and a…☆13Sep 30, 2022Updated 3 years ago
- assignments and group case studies from PGDMLAI course by upGrad & IIITB☆13Aug 7, 2023Updated 2 years ago
- Anomaly Detection In An IoT-Acquired Environmental Sensor Data☆12Oct 2, 2018Updated 7 years ago
- 한글 단어 혹은 문장 이미지를 받아 텍스트를 반환하는 Text Recognition Model☆19Jun 14, 2020Updated 5 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Keypoint dataset for airplane☆10Dec 28, 2019Updated 6 years ago
- This is project to analyze korquad 2.0☆23Jun 22, 2022Updated 3 years ago
- Code accompanying AWS blog post "Build a Semantic Search Engine for Tabular Columns with Transformers and Amazon OpenSearch Service"☆18Nov 9, 2023Updated 2 years ago
- ☆15Oct 24, 2023Updated 2 years ago
- A bert-fusing architecture for twitter sentiment analysis. accepted in AACL-IJCNLP 2020 Student Research Workshop.☆11Jun 12, 2023Updated 2 years ago
- Facebook Hatebook Memes Challenge☆12Jan 28, 2021Updated 5 years ago
- A collection of my data science articles published in Towards Data Science and Towards AI.☆16Sep 19, 2025Updated 7 months ago