AristotelisPap / Question-Answering-with-BERT-and-Knowledge-Distillation
Fine-tuned BERT on SQuAd 2.0 Dataset. Applied Knowledge Distillation (KD) and fine-tuned DistilBERT (student) using BERT as the teacher model. Reduced the size of the original BERT by 40%.
☆25Updated 4 years ago
Alternatives and similar repositories for Question-Answering-with-BERT-and-Knowledge-Distillation:
Users that are interested in Question-Answering-with-BERT-and-Knowledge-Distillation are comparing it to the libraries listed below
- ☆42Updated 4 years ago
- Code associated with the "Data Augmentation using Pre-trained Transformer Models" paper☆52Updated last year
- ☆27Updated 3 months ago
- pyTorch implementation of Recurrence over BERT (RoBERT) based on this paper https://arxiv.org/abs/1910.10781 and comparison with pyTorch …☆80Updated 2 years ago
- Code associated with the "Data Augmentation using Pre-trained Transformer Models" paper☆135Updated last year
- DSTC9 Submission☆18Updated 3 years ago
- ☆25Updated 4 years ago
- Question Answering task using Deep Learning on SQuAD dataset☆21Updated 2 years ago
- Language Models as Few-Shot Learner for Task-Oriented Dialogue Systems☆22Updated 3 years ago
- The source code of "Language Models are Few-shot Multilingual Learners" (MRL @ EMNLP 2021)☆52Updated 2 years ago
- Source codes of Neural Quality Estimation with Multiple Hypotheses for Grammatical Error Correction☆43Updated 3 years ago
- Use BERT for Question Answering and finetune train with SQuAD 2.0☆15Updated 5 years ago
- Implementation of paper "Learning to Encode Text as Human-Readable Summaries using GAN"☆66Updated 5 years ago
- Named Entity Recognition with Pretrained XLM-RoBERTa☆88Updated 3 years ago
- Thank you BART! Rewarding Pre-Trained Models Improves Formality Style Transfer (ACL 2021)☆30Updated 2 years ago
- Abstractive summarization using Bert2Bert framework.☆31Updated 4 years ago
- Transformer Encoder with Char information for text classification☆15Updated 5 years ago
- A repository for our AAAI-2020 Cross-lingual-NER paper. Code will be updated shortly.☆47Updated 2 years ago
- [NAACL'22] TaCL: Improving BERT Pre-training with Token-aware Contrastive Learning☆93Updated 2 years ago
- This is a simple implementation of how to leverage a Language Model for a prompt-based learning model☆44Updated 3 years ago
- ☆22Updated 4 years ago
- ☆38Updated 2 years ago
- a large scientific paraphrase dataset for longer paraphrase generation☆38Updated 2 years ago
- Code and resources for the paper "BERT-QE: Contextualized Query Expansion for Document Re-ranking".☆50Updated 3 years ago
- The PyTorch implementation of ReCoSa(the Relevant Contexts with Self-attention) for dialogue generation using the multi-head attention an…☆22Updated last year
- Official PyTorch Implementation of SSMix (Findings of ACL 2021)☆62Updated 3 years ago
- ☆57Updated last year
- Codes for our ACL21 paper: Language Model as an Annotator: Exploring DialoGPT for Dialogue Summarization☆94Updated 3 years ago
- This repository contains the code for paper Prompting ELECTRA Few-Shot Learning with Discriminative Pre-Trained Models.☆47Updated 2 years ago
- Multi-Task Pre-Training for Plug-and-Play Task-Oriented Dialogue System (ACL 2022)☆159Updated last year