JosselinSomervilleRoberts / BERT-Multitask-learning
Multitask-learning of a BERT backbone. Allows to easily train a BERT model with state-of-the-art method such as PCGrad, Gradient Vaccine, PALs, Scheduling, Class imbalance handling and many optimizations
☆18Updated last year
Alternatives and similar repositories for BERT-Multitask-learning:
Users that are interested in BERT-Multitask-learning are comparing it to the libraries listed below
- A simple recipe for training and inferencing Transformer architecture for Multi-Task Learning on custom datasets. You can find two approa…☆95Updated 2 years ago
- This repository contains a custom implementation of the BERT model, fine-tuned for specific tasks, along with an implementation of Low Ra…☆69Updated last year
- ☆61Updated 2 years ago
- Fine-tuning LLM with LoRA (Low-Rank Adaptation) from scratch (Oct 2023)☆19Updated last year
- ☆44Updated 2 years ago
- PyTorch – SMART: Robust and Efficient Fine-Tuning for Pre-trained Natural Language Models.☆61Updated 2 years ago
- Interpretable unified language safety checking with large language models☆30Updated last year
- Define Transformers, T5 model and RoBERTa Encoder decoder model for product names generation☆48Updated 3 years ago
- Interpreting Language Models with Contrastive Explanations (EMNLP 2022 Best Paper Honorable Mention)☆62Updated 2 years ago
- Minimalist BERT implementation assignment for CS11-711☆81Updated 2 years ago
- My solutions for 2019/2021 CS224n Assignments.☆33Updated 4 years ago
- Pytorch implementation of “Recursive Non-Autoregressive Graph-to-Graph Transformer for Dependency Parsing with Iterative Refinement”☆62Updated 4 years ago
- Paper List for Contrastive Learning for Natural Language Processing☆554Updated last year
- Code and Resources for the paper, "Better to Ask in English: Cross-Lingual Evaluation of Large Language Models for Healthcare Queries"☆16Updated last year
- ☆11Updated 3 years ago
- Implementation of ICLR 21 paper: Probing BERT in Hyperbolic Spaces☆58Updated 4 years ago
- [NAACL 2021] This is the code for our paper `Fine-Tuning Pre-trained Language Model with Weak Supervision: A Contrastive-Regularized Self…☆202Updated 2 years ago
- pyTorch implementation of Recurrence over BERT (RoBERT) based on this paper https://arxiv.org/abs/1910.10781 and comparison with pyTorch …☆80Updated 2 years ago
- A PyTorch implementation of "Graph Convolutional Networks for Text Classification." (AAAI 2019)☆126Updated 4 years ago
- code for the paper "Zero-Shot Text Classification with Self-Training" for EMNLP 2022☆49Updated 2 years ago
- [Neurips2023] Source code for Lift Yourself Up: Retrieval-augmented Text Generation with Self Memory☆59Updated last year
- ☆175Updated last year
- Hierarchical Attention Transformers (HAT)☆51Updated last year
- Long Document Summarization Papers☆145Updated last year
- domain adaptation in NLP☆53Updated 3 years ago
- Efficient Attention for Long Sequence Processing☆93Updated last year
- Materials for ACL-2022 tutorial: Knowledge-Augmented Methods for Natural Language Processing☆287Updated 2 years ago
- ☆23Updated 8 months ago
- ☆78Updated 2 years ago
- Official Implementation for "Self-Gudied Contrastive Learning for BERT Sentence Representations (ACL 2021)"☆27Updated 2 years ago