AristotelisPap / Question-Answering-with-BERT-and-Knowledge-Distillation
Fine-tuned BERT on SQuAd 2.0 Dataset. Applied Knowledge Distillation (KD) and fine-tuned DistilBERT (student) using BERT as the teacher model. Reduced the size of the original BERT by 40%.
☆25Updated 4 years ago
Alternatives and similar repositories for Question-Answering-with-BERT-and-Knowledge-Distillation:
Users that are interested in Question-Answering-with-BERT-and-Knowledge-Distillation are comparing it to the libraries listed below
- The source code of "Language Models are Few-shot Multilingual Learners" (MRL @ EMNLP 2021)☆52Updated 2 years ago
- Source codes of Neural Quality Estimation with Multiple Hypotheses for Grammatical Error Correction☆43Updated 3 years ago
- ☆42Updated 4 years ago
- ☆33Updated 2 years ago
- This is a simple implementation of how to leverage a Language Model for a prompt-based learning model☆44Updated 3 years ago
- Code associated with the "Data Augmentation using Pre-trained Transformer Models" paper☆52Updated last year
- Multilingual Code-Switching for Zero-Shot Cross-Lingual Intent Prediction and Slot Filling☆10Updated 2 years ago
- Code associated with the "Data Augmentation using Pre-trained Transformer Models" paper☆134Updated last year
- ☆27Updated 4 months ago
- Implementation of paper "Learning to Encode Text as Human-Readable Summaries using GAN"☆66Updated 5 years ago
- ☆52Updated 4 years ago
- ☆47Updated 2 years ago
- pyTorch implementation of Recurrence over BERT (RoBERT) based on this paper https://arxiv.org/abs/1910.10781 and comparison with pyTorch …☆81Updated 2 years ago
- Source code for paper "Learning from Noisy Labels for Entity-Centric Information Extraction", EMNLP 2021☆55Updated 3 years ago
- Prompt-learning methods used BERT4Keras (PET, EFL and NSP-BERT), both for Chinese and English.☆29Updated 2 years ago
- Improved version of GECToR☆60Updated last year
- Library of various Few-Shot Learning frameworks for text classification☆60Updated 2 years ago
- Robust Self-augmentation for NER with Meta-reweighting☆29Updated 2 years ago
- [ACL 2020] Structure-Level Knowledge Distillation For Multilingual Sequence Labeling☆72Updated 2 years ago
- A repository for our AAAI-2020 Cross-lingual-NER paper. Code will be updated shortly.☆47Updated 2 years ago
- Source codes for the paper "Multi-View Sequence-to-Sequence Models with Conversational Structure for Abstractive Dialogue Summarization"☆90Updated last year
- This is the official code for Extractive Summarization of Long Documents by Combining Global and Local Context☆69Updated 4 years ago
- a large scientific paraphrase dataset for longer paraphrase generation☆38Updated 2 years ago
- This repository contains the code for paper Prompting ELECTRA Few-Shot Learning with Discriminative Pre-Trained Models.☆48Updated 2 years ago
- A long version of BART model based on Longformer model☆23Updated last year
- DSTC9 Submission☆18Updated 4 years ago
- Transformer Encoder with Char information for text classification☆15Updated 5 years ago
- Text summarization with python and transformer☆13Updated last year
- ☆12Updated 5 years ago
- The code repository for the paper "Dimsum @LaySumm 20: BART-based Approach for Scientific Document Summarization".☆24Updated 4 years ago