JosselinSomervilleRoberts / BERT-Multitask-learningLinks
Multitask-learning of a BERT backbone. Allows to easily train a BERT model with state-of-the-art method such as PCGrad, Gradient Vaccine, PALs, Scheduling, Class imbalance handling and many optimizations
☆19Updated last year
Alternatives and similar repositories for BERT-Multitask-learning
Users that are interested in BERT-Multitask-learning are comparing it to the libraries listed below
Sorting:
- code for the paper "Zero-Shot Text Classification with Self-Training" for EMNLP 2022☆50Updated 4 months ago
- A simple recipe for training and inferencing Transformer architecture for Multi-Task Learning on custom datasets. You can find two approa…☆96Updated 3 years ago
- Define Transformers, T5 model and RoBERTa Encoder decoder model for product names generation☆48Updated 3 years ago
- PERFECT: Prompt-free and Efficient Few-shot Learning with Language Models☆109Updated 3 years ago
- PyTorch – SMART: Robust and Efficient Fine-Tuning for Pre-trained Natural Language Models.☆62Updated 3 years ago
- Trans-Encoder: Unsupervised sentence-pair modelling through self- and mutual-distillations☆134Updated last month
- Python implementation of an N-gram language model with Laplace smoothing and sentence generation.☆87Updated 7 years ago
- Interpreting Language Models with Contrastive Explanations (EMNLP 2022 Best Paper Honorable Mention)☆62Updated 3 years ago
- Code for "Finetuning Pretrained Transformers into Variational Autoencoders"☆39Updated 3 years ago
- ICML'2022: NLP From Scratch Without Large-Scale Pretraining: A Simple and Efficient Framework☆255Updated last year
- This repository contains a custom implementation of the BERT model, fine-tuned for specific tasks, along with an implementation of Low Ra…☆77Updated last year
- Code for our paper "Dual Contrastive Learning: Text Classification via Label-Aware Data Augmentation"☆162Updated 3 years ago
- ☆46Updated 3 years ago
- Minimalist BERT implementation assignment for CS11-711☆83Updated 2 years ago
- ☆19Updated 4 years ago
- data collator for UL2 and U-PaLM☆29Updated 2 years ago
- ☆33Updated 3 years ago
- ☆25Updated last year
- [TMLR'23] Contrastive Search Is What You Need For Neural Text Generation☆121Updated 2 years ago
- Interpretable unified language safety checking with large language models☆31Updated 2 years ago
- Code for the NAACL 2022 long paper "DiffCSE: Difference-based Contrastive Learning for Sentence Embeddings"☆296Updated 2 years ago
- Fine tune a T5 transformer model using PyTorch & Transformers🤗☆218Updated 4 years ago
- Tool for converting LLMs from uni-directional to bi-directional by removing causal mask for tasks like classification and sentence embedd…☆61Updated 9 months ago
- Text Diffusion Model with Encoder-Decoder Transformers for Sequence-to-Sequence Generation [NAACL 2024]☆97Updated 2 years ago
- ☆60Updated 2 years ago
- This repo contains the dataset and description for Ruddit and its variants.☆34Updated 3 years ago
- Text classification with Foundation Language Model LLaMA☆114Updated 2 years ago
- [COLING'22] Code for our paper: "COLO: A Contrastive Learning based Re-ranking Framework for One-Stage Summarization"☆22Updated 2 years ago
- Leveraging ChatGPT for Text Data Augmentation☆48Updated 11 months ago
- ☆52Updated 4 years ago