AristotelisPap / Question-Answering-with-BERT-and-Knowledge-Distillation
Fine-tuned BERT on SQuAd 2.0 Dataset. Applied Knowledge Distillation (KD) and fine-tuned DistilBERT (student) using BERT as the teacher model. Reduced the size of the original BERT by 40%.
☆25Updated 4 years ago
Alternatives and similar repositories for Question-Answering-with-BERT-and-Knowledge-Distillation:
Users that are interested in Question-Answering-with-BERT-and-Knowledge-Distillation are comparing it to the libraries listed below
- pyTorch implementation of Recurrence over BERT (RoBERT) based on this paper https://arxiv.org/abs/1910.10781 and comparison with pyTorch …☆81Updated 2 years ago
- ☆27Updated 4 months ago
- ☆42Updated 4 years ago
- DSTC9 Submission☆18Updated 4 years ago
- Source codes of Neural Quality Estimation with Multiple Hypotheses for Grammatical Error Correction☆43Updated 3 years ago
- The source code of "Language Models are Few-shot Multilingual Learners" (MRL @ EMNLP 2021)☆52Updated 2 years ago
- Improved version of GECToR☆60Updated last year
- [NAACL'22] TaCL: Improving BERT Pre-training with Token-aware Contrastive Learning☆93Updated 2 years ago
- Implementation of paper "Learning to Encode Text as Human-Readable Summaries using GAN"☆66Updated 5 years ago
- ☆38Updated 2 years ago
- ☆25Updated 4 years ago
- Use BERT for Question Answering and finetune train with SQuAD 2.0☆15Updated 5 years ago
- Official Implementation of "DialogLM: Pre-trained Model for Long Dialogue Understanding and Summarization."☆139Updated 2 years ago
- Question Answering task using Deep Learning on SQuAD dataset☆21Updated 2 years ago
- The official code of the "Frustratingly Easy System Combination for Grammatical Error Correction" paper☆56Updated last year
- This is a simple implementation of how to leverage a Language Model for a prompt-based learning model☆44Updated 3 years ago
- Source code for SummaReranker☆37Updated last year
- Gradient accumulation on tf.estimator☆12Updated 4 years ago
- ☆92Updated 3 years ago
- This repository contains the code for paper Prompting ELECTRA Few-Shot Learning with Discriminative Pre-Trained Models.☆48Updated 2 years ago
- ☆10Updated 3 years ago
- Code for "A BERT-based Distractor Generation Scheme with Multi-tasking and Negative Answer Training Strategies."☆28Updated 3 years ago
- Official PyTorch Implementation of SSMix (Findings of ACL 2021)☆62Updated 3 years ago
- ☆30Updated 4 years ago
- ☆29Updated 2 years ago
- Bert for CoQA based on PyTorch.☆43Updated last year
- This repository contains the PyTorch implementation of the paper STaCK: Sentence Ordering with Temporal Commonsense Knowledge appearing a…☆28Updated 2 years ago
- ☆33Updated 2 years ago
- Finetune multiple pre-trained Transformer-based models to solve Vietnamese Fake News Detection problem (ReINTEL) in VLSP2020 shared task☆18Updated 4 years ago
- "End-to-End Abstractive Summarization for Meetings" paper - Unofficial PyTorch Implementation☆53Updated 2 years ago