AristotelisPap / Question-Answering-with-BERT-and-Knowledge-Distillation
Fine-tuned BERT on SQuAd 2.0 Dataset. Applied Knowledge Distillation (KD) and fine-tuned DistilBERT (student) using BERT as the teacher model. Reduced the size of the original BERT by 40%.
☆25Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for Question-Answering-with-BERT-and-Knowledge-Distillation
- Use BERT for Question Answering and finetune train with SQuAD 2.0☆15Updated 5 years ago
- Source codes of Neural Quality Estimation with Multiple Hypotheses for Grammatical Error Correction☆43Updated 3 years ago
- The source code of "Language Models are Few-shot Multilingual Learners" (MRL @ EMNLP 2021)☆52Updated 2 years ago
- This is a simple implementation of how to leverage a Language Model for a prompt-based learning model☆44Updated 2 years ago
- Improved version of GECToR☆60Updated last year
- ☆42Updated 4 years ago
- Prompt-learning methods used BERT4Keras (PET, EFL and NSP-BERT), both for Chinese and English.☆29Updated 2 years ago
- pyTorch implementation of Recurrence over BERT (RoBERT) based on this paper https://arxiv.org/abs/1910.10781 and comparison with pyTorch …☆79Updated 2 years ago
- ☆25Updated 4 years ago
- The PyTorch implementation of ReCoSa(the Relevant Contexts with Self-attention) for dialogue generation using the multi-head attention an…☆21Updated last year
- This repository contains the code for paper Prompting ELECTRA Few-Shot Learning with Discriminative Pre-Trained Models.☆45Updated 2 years ago
- Code associated with the "Data Augmentation using Pre-trained Transformer Models" paper☆133Updated last year
- Implementation of paper "Learning to Encode Text as Human-Readable Summaries using GAN"☆65Updated 5 years ago
- This repository contains materials for our tutorial on automatic grammatical error correction: R. Grundkiewicz, C. Bryant, M. Felice: A C…☆38Updated 3 years ago
- KE-Dialogue: Injecting knowledge graph into a fully end-to-end dialogue system.☆45Updated 2 years ago
- ☆26Updated 10 months ago
- The code repository for the paper "Dimsum @LaySumm 20: BART-based Approach for Scientific Document Summarization".☆24Updated 4 years ago
- The repository for the paper: Multilingual Translation via Grafting Pre-trained Language Models☆24Updated 3 years ago
- Code associated with the "Data Augmentation using Pre-trained Transformer Models" paper☆51Updated last year
- DSTC9 Submission☆18Updated 3 years ago
- ☆40Updated 3 years ago
- PyTorch implementations of algorithms for knowledge distillation.☆56Updated 4 years ago
- TEXTOIR: An Integrated and Visualized Platform for Text Open Intent Recognition (ACL 2021)☆47Updated 2 years ago
- ☆31Updated last year
- SpanNER: Named EntityRe-/Recognition as Span Prediction☆124Updated 2 years ago
- Named Entity Recognition with Pretrained XLM-RoBERTa☆87Updated 3 years ago
- Thank you BART! Rewarding Pre-Trained Models Improves Formality Style Transfer (ACL 2021)☆30Updated 2 years ago
- Library of various Few-Shot Learning frameworks for text classification☆60Updated last year
- Multilingual Code-Switching for Zero-Shot Cross-Lingual Intent Prediction and Slot Filling☆10Updated 2 years ago
- Code for EMNLP2020 paper: "Tell Me How to Ask Again: Question Data Augmentation with Controllable Rewriting in Continuous Space"☆26Updated 3 years ago