AristotelisPap / Question-Answering-with-BERT-and-Knowledge-DistillationLinks
Fine-tuned BERT on SQuAd 2.0 Dataset. Applied Knowledge Distillation (KD) and fine-tuned DistilBERT (student) using BERT as the teacher model. Reduced the size of the original BERT by 40%.
☆25Updated 4 years ago
Alternatives and similar repositories for Question-Answering-with-BERT-and-Knowledge-Distillation
Users that are interested in Question-Answering-with-BERT-and-Knowledge-Distillation are comparing it to the libraries listed below
Sorting:
- pyTorch implementation of Recurrence over BERT (RoBERT) based on this paper https://arxiv.org/abs/1910.10781 and comparison with pyTorch …☆82Updated 2 years ago
- BERTserini☆26Updated 2 years ago
- The source code of "Language Models are Few-shot Multilingual Learners" (MRL @ EMNLP 2021)☆53Updated 3 years ago
- CharBERT: Character-aware Pre-trained Language Model (COLING2020)☆121Updated 4 years ago
- This is the official code for Extractive Summarization of Long Documents by Combining Global and Local Context☆69Updated 4 years ago
- Implementation of Self-adjusting Dice Loss from "Dice Loss for Data-imbalanced NLP Tasks" paper☆108Updated 4 years ago
- This is a simple implementation of how to leverage a Language Model for a prompt-based learning model☆46Updated 3 years ago
- ☆27Updated 9 months ago
- Official Implementation of "DialogLM: Pre-trained Model for Long Dialogue Understanding and Summarization."☆141Updated 2 years ago
- A repository for our AAAI-2020 Cross-lingual-NER paper. Code will be updated shortly.☆47Updated 2 years ago
- Few-Shot-Intent-Detection includes popular challenging intent detection datasets with/without OOS queries and state-of-the-art baselines …☆151Updated 2 years ago
- The jiant toolkit for general-purpose text understanding models☆22Updated 4 years ago
- Use BERT for Question Answering and finetune train with SQuAD 2.0☆15Updated 5 years ago
- Self-supervised NER prototype - updated version (69 entity types - 17 broad entity groups). Uses pretrained BERT models with no fine tuni…☆78Updated 3 years ago
- ☆42Updated 4 years ago
- Language-agnostic BERT Sentence Embedding (LaBSE)☆153Updated 5 years ago
- Named Entity Recognition with Small Strongly Labeled and Large Weakly Labeled Data☆100Updated 2 years ago
- Using BERT For Classifying Documents with Long Texts, check my latest post: https://armandolivares.tech/☆41Updated 5 years ago
- Code associated with the "Data Augmentation using Pre-trained Transformer Models" paper☆135Updated 2 years ago
- A long version of BART model based on Longformer model☆23Updated 2 years ago
- ☆42Updated 5 years ago
- Entity extraction using BERT + CRF for single-tun / multi-turn setting in dialogues☆30Updated 3 years ago
- PyTorch – SMART: Robust and Efficient Fine-Tuning for Pre-trained Natural Language Models.☆62Updated 3 years ago
- Source codes of Neural Quality Estimation with Multiple Hypotheses for Grammatical Error Correction☆43Updated 4 years ago
- ☆47Updated 2 years ago
- MobileBERT and DistilBERT for extractive summarization☆91Updated 2 years ago
- A simple recipe for training and inferencing Transformer architecture for Multi-Task Learning on custom datasets. You can find two approa…☆96Updated 3 years ago
- [EMNLP 2021] Text AutoAugment: Learning Compositional Augmentation Policy for Text Classification☆130Updated 2 years ago
- BERT for joint intent classification and slot filling☆39Updated 5 years ago
- Resources for the "CTRLsum: Towards Generic Controllable Text Summarization" paper☆147Updated 4 months ago