JosselinSomervilleRoberts / BERT-Multitask-learning
Multitask-learning of a BERT backbone. Allows to easily train a BERT model with state-of-the-art method such as PCGrad, Gradient Vaccine, PALs, Scheduling, Class imbalance handling and many optimizations
☆19Updated last year
Alternatives and similar repositories for BERT-Multitask-learning:
Users that are interested in BERT-Multitask-learning are comparing it to the libraries listed below
- Fine-tuning LLM with LoRA (Low-Rank Adaptation) from scratch (Oct 2023)☆19Updated last year
- A simple recipe for training and inferencing Transformer architecture for Multi-Task Learning on custom datasets. You can find two approa…☆96Updated 2 years ago
- Minimalist BERT implementation assignment for CS11-711☆82Updated 2 years ago
- ☆71Updated 7 months ago
- A extension of Transformers library to include T5ForSequenceClassification class.☆38Updated 2 years ago
- ConTextual Mask Auto-Encoder for Dense Passage Retrieval☆35Updated 6 months ago
- code for the paper "Zero-Shot Text Classification with Self-Training" for EMNLP 2022☆49Updated 2 years ago
- This repo contains the dataset and description for Ruddit and its variants.☆34Updated 3 years ago
- [WWW 2022] Topic Discovery via Latent Space Clustering of Pretrained Language Model Representations☆88Updated 3 years ago
- PyTorch – SMART: Robust and Efficient Fine-Tuning for Pre-trained Natural Language Models.☆62Updated 2 years ago
- Define Transformers, T5 model and RoBERTa Encoder decoder model for product names generation☆48Updated 3 years ago
- CS 224N Winter 2023 Default Final Project: Multitask BERT☆25Updated 2 years ago
- Uncertainty-Aware Reliable Text Classification (KDD 2021)☆17Updated 2 years ago
- pyTorch implementation of Recurrence over BERT (RoBERT) based on this paper https://arxiv.org/abs/1910.10781 and comparison with pyTorch …☆82Updated 2 years ago
- ☆20Updated 3 years ago
- Materials for ACL-2022 tutorial: Knowledge-Augmented Methods for Natural Language Processing☆288Updated 2 years ago
- ☆61Updated 2 years ago
- Official Implementation for "Self-Gudied Contrastive Learning for BERT Sentence Representations (ACL 2021)"☆27Updated 2 years ago
- Vocabulary Trimming (VT) is a model compression technique, which reduces a multilingual LM vocabulary to a target language by deleting ir…☆37Updated 6 months ago
- [NeurIPS 2021] COCO-LM: Correcting and Contrasting Text Sequences for Language Model Pretraining☆118Updated last year
- ☆19Updated 3 years ago
- ☆17Updated 4 years ago
- [COLING'22] Code for our paper: "COLO: A Contrastive Learning based Re-ranking Framework for One-Stage Summarization"☆22Updated 2 years ago
- ☆42Updated 3 years ago
- ☆78Updated 2 years ago
- ☆35Updated 3 years ago
- EMNLP 2022: "MABEL: Attenuating Gender Bias using Textual Entailment Data" https://arxiv.org/abs/2210.14975☆37Updated last year
- The purpose of this repository is to introduce new dialogue-level commonsense inference datasets and tasks. We chose dialogues as the dat…☆64Updated 2 years ago
- Tool for converting LLMs from uni-directional to bi-directional by removing causal mask for tasks like classification and sentence embedd…☆56Updated 4 months ago
- My Implementation for the paper EDA: Easy Data Augmentation Techniques for Boosting Performance on Text Classification Tasks using Tensor…☆12Updated 3 years ago