AristotelisPap / Question-Answering-with-BERT-and-Knowledge-DistillationLinks

Fine-tuned BERT on SQuAd 2.0 Dataset. Applied Knowledge Distillation (KD) and fine-tuned DistilBERT (student) using BERT as the teacher model. Reduced the size of the original BERT by 40%.

☆25

Alternatives and similar repositories for Question-Answering-with-BERT-and-Knowledge-Distillation

Users that are interested in Question-Answering-with-BERT-and-Knowledge-Distillation are comparing it to the libraries listed below

Sorting:

MaksTarnavskyi / gector-large
Improved version of GECToR
☆59Updated 2 years ago
helmy-elrais / RoBERT_Recurrence_over_BERT
pyTorch implementation of Recurrence over BERT (RoBERT) based on this paper https://arxiv.org/abs/1910.10781 and comparison with pyTorch …
☆82Updated 2 years ago
GeorgeLuImmortal / Hierarchical-BERT-Model-with-Limited-Labelled-Data
☆42Updated 3 years ago
cofe-ai / fast-gector
☆59Updated 2 years ago
varunkumar-dev / TransformersDataAugmentation
Code associated with the "Data Augmentation using Pre-trained Transformer Models" paper
☆134Updated 2 years ago
jianguoz / Few-Shot-Intent-Detection
Few-Shot-Intent-Detection includes popular challenging intent detection datasets with/without OOS queries and state-of-the-art baselines …
☆147Updated 2 years ago
JudeLee19 / HMNet-End-to-End-Abstractive-Summarization-for-Meetings
"End-to-End Abstractive Summarization for Meetings" paper - Unofficial PyTorch Implementation
☆53Updated 2 years ago
mawentao277 / CharBERT
CharBERT: Character-aware Pre-trained Language Model (COLING2020)
☆120Updated 4 years ago
cylnlp / dialogsum
DialogSum: A Real-life Scenario Dialogue Summarization Dataset - Findings of ACL 2021
☆179Updated 8 months ago
jasonwei20 / triplet-loss
☆42Updated 4 years ago
shahrukhx01 / multitask-learning-transformers
A simple recipe for training and inferencing Transformer architecture for Multi-Task Learning on custom datasets. You can find two approa…
☆97Updated 3 years ago
microsoft / DialogLM
Official Implementation of "DialogLM: Pre-trained Model for Long Dialogue Understanding and Summarization."
☆141Updated 2 years ago
fursovia / self-adj-dice
Implementation of Self-adjusting Dice Loss from "Dice Loss for Data-imbalanced NLP Tasks" paper
☆108Updated 4 years ago
michiyasunaga / LM-Critic
[EMNLP 2021] LM-Critic: Language Models for Unsupervised Grammatical Error Correction
☆119Updated 3 years ago
facebookresearch / bart_ls
Long-context pretrained encoder-decoder models
☆96Updated 2 years ago
laihuiyuan / pre-trained-formality-transfer
Thank you BART! Rewarding Pre-Trained Models Improves Formality Style Transfer (ACL 2021)
☆30Updated 2 years ago
facebookresearch / ELECTRA-Fewshot-Learning
This repository contains the code for paper Prompting ELECTRA Few-Shot Learning with Discriminative Pre-Trained Models.
☆48Updated 3 years ago
amzn / multiconer-baseline
☆47Updated 2 years ago
nusnlp / esc
The official code of the "Frustratingly Easy System Combination for Grammatical Error Correction" paper
☆56Updated last year
gentaiscool / few-shot-lm
The source code of "Language Models are Few-shot Multilingual Learners" (MRL @ EMNLP 2021)
☆53Updated 3 years ago
neulab / SpanNER
SpanNER: Named EntityRe-/Recognition as Span Prediction
☆132Updated 3 years ago
patil-suraj / longbart
A long version of BART model based on Longformer model
☆23Updated 2 years ago
ahmetgunduz / pytorch-nlp-project-template
A Light and Modular PyTorch NLP Project Template
☆59Updated 4 years ago
lancopku / text-autoaugment
[EMNLP 2021] Text AutoAugment: Learning Compositional Augmentation Policy for Text Classification
☆129Updated 2 years ago
andreamad8 / FSB
The Few-Shot Bot: Prompt-Based Learning for Dialogue Systems
☆118Updated 3 years ago
THUDM / FewNLU
☆66Updated 3 years ago
Wendy-Xiao / Extsumm_local_global_context
This is the official code for Extractive Summarization of Long Documents by Combining Global and Local Context
☆69Updated 4 years ago
clovaai / ssmix
Official PyTorch Implementation of SSMix (Findings of ACL 2021)
☆62Updated 4 years ago
amazon-science / fact-check-summarization
☆78Updated last year
rrmenon10 / ADAPET
[EMNLP 2021] Improving and Simplifying Pattern Exploiting Training
☆153Updated 3 years ago