JosselinSomervilleRoberts / BERT-Multitask-learningLinks
Multitask-learning of a BERT backbone. Allows to easily train a BERT model with state-of-the-art method such as PCGrad, Gradient Vaccine, PALs, Scheduling, Class imbalance handling and many optimizations
☆19Updated 2 years ago
Alternatives and similar repositories for BERT-Multitask-learning
Users that are interested in BERT-Multitask-learning are comparing it to the libraries listed below
Sorting:
- A simple recipe for training and inferencing Transformer architecture for Multi-Task Learning on custom datasets. You can find two approa…☆98Updated 3 years ago
- code for the paper "Zero-Shot Text Classification with Self-Training" for EMNLP 2022☆51Updated 3 months ago
- Python implementation of an N-gram language model with Laplace smoothing and sentence generation.☆87Updated 7 years ago
- ☆19Updated 4 years ago
- PyTorch – SMART: Robust and Efficient Fine-Tuning for Pre-trained Natural Language Models.☆62Updated 3 years ago
- This is a simple implementation of how to leverage a Language Model for a prompt-based learning model☆45Updated 3 years ago
- PERFECT: Prompt-free and Efficient Few-shot Learning with Language Models☆110Updated last week
- Trans-Encoder: Unsupervised sentence-pair modelling through self- and mutual-distillations☆134Updated 4 months ago
- Code for ACL 2023 paper "HiTIN: Hierarchy-aware Tree Isomorphism Network for Hierarchical Text Classification"☆37Updated last year
- This repository contains a custom implementation of the BERT model, fine-tuned for specific tasks, along with an implementation of Low Ra…☆78Updated 2 years ago
- A extension of Transformers library to include T5ForSequenceClassification class.☆40Updated 2 years ago
- Tool for converting LLMs from uni-directional to bi-directional by removing causal mask for tasks like classification and sentence embedd…☆63Updated last year
- A Simple but Powerful SOTA NER Model | Official Code For Label Supervised LLaMA Finetuning☆154Updated last year
- ☆42Updated 4 years ago
- [WWW 2022] Topic Discovery via Latent Space Clustering of Pretrained Language Model Representations☆91Updated 3 years ago
- Minimalist BERT implementation assignment for CS11-711☆83Updated 3 years ago
- ☆93Updated last year
- Code for our paper "Dual Contrastive Learning: Text Classification via Label-Aware Data Augmentation"☆165Updated 3 years ago
- pyTorch implementation of Recurrence over BERT (RoBERT) based on this paper https://arxiv.org/abs/1910.10781 and comparison with pyTorch …☆82Updated 3 years ago
- This repo contains the dataset and description for Ruddit and its variants.☆36Updated 3 years ago
- Early solution for Google AI4Code competition☆76Updated 3 years ago
- ICML'2022: NLP From Scratch Without Large-Scale Pretraining: A Simple and Efficient Framework☆255Updated last year
- Interpreting Language Models with Contrastive Explanations (EMNLP 2022 Best Paper Honorable Mention)☆62Updated 3 years ago
- Fine tune a T5 transformer model using PyTorch & Transformers🤗☆219Updated 4 years ago
- Neural information retrieval / Semantic search / Bi-encoders☆174Updated 2 years ago
- [COLING'22] Code for our paper: "COLO: A Contrastive Learning based Re-ranking Framework for One-Stage Summarization"☆22Updated 3 years ago
- Define Transformers, T5 model and RoBERTa Encoder decoder model for product names generation☆48Updated 4 years ago
- This repository is the official implementation of our paper MVP: Multi-task Supervised Pre-training for Natural Language Generation.☆73Updated 3 years ago
- The model implementations for T5 encoder decoder soft prompt tuning for text generation.☆25Updated 3 years ago
- Code and Data for the ACL 2022 paper "Rethinking Self-Supervision Objectives for Generalizable Coherence Modeling"☆11Updated 3 years ago