JosselinSomervilleRoberts / BERT-Multitask-learningLinks

Multitask-learning of a BERT backbone. Allows to easily train a BERT model with state-of-the-art method such as PCGrad, Gradient Vaccine, PALs, Scheduling, Class imbalance handling and many optimizations

☆19

Alternatives and similar repositories for BERT-Multitask-learning

Users that are interested in BERT-Multitask-learning are comparing it to the libraries listed below

Sorting:

alexriggio / BERT-LoRA-TensorRT
This repository contains a custom implementation of the BERT model, fine-tuned for specific tasks, along with an implementation of Low Ra…
☆77Updated last year
zcgzcgzcg1 / WSDM2023_Knowledge_NLP_Tutorial
☆61Updated 2 years ago
BYU-PCCL / leveraging-llms-for-mcqa
This is the code for the ICLR 2023 paper "Leveraging Large Language Models for Multiple Choice Question Answering."
☆40Updated 2 years ago
tsmatz / finetune_llm_with_lora
Fine-tuning LLM with LoRA (Low-Rank Adaptation) from scratch (Oct 2023)
☆22Updated this week
snowood1 / BERT-ENN
Uncertainty-Aware Reliable Text Classification (KDD 2021)
☆18Updated 2 years ago
claws-lab / XLingEval
Code and Resources for the paper, "Better to Ask in English: Cross-Lingual Evaluation of Large Language Models for Healthcare Queries"
☆16Updated last year
shahrukhx01 / multitask-learning-transformers
A simple recipe for training and inferencing Transformer architecture for Multi-Task Learning on custom datasets. You can find two approa…
☆97Updated 2 years ago
WhereIsAI / BiLLM
Tool for converting LLMs from uni-directional to bi-directional by removing causal mask for tasks like classification and sentence embedd…
☆60Updated 6 months ago
yueyu1030 / Patron
[ACL 2023] The code for our ACL'23 paper Cold-Start Data Selection for Few-shot Language Model Fine-tuning: A Prompt-Based Uncertainty Pr…
☆22Updated last year
Hannibal046 / SelfMemory
[Neurips2023] Source code for Lift Yourself Up: Retrieval-augmented Text Generation with Self Memory
☆60Updated 2 years ago
osainz59 / t5-encoder
A extension of Transformers library to include T5ForSequenceClassification class.
☆38Updated 2 years ago
fabrahman / ReBART
Code for EMNLP 2021 paper: "Is Everything in Order? A Simple Way to Order Sentences"
☆42Updated last year
zhehengluoK / Biomedical-Text-Summarization-Survey
This repository lists papers, codes, and datasets in Biomedical Text Summarisation based on PLM
☆23Updated 2 years ago
faridlazuarda / cultural-llm-papers
A curated list of research papers and resources on Cultural LLM.
☆44Updated 9 months ago
HrishikeshVish / Fairpy
☆23Updated 11 months ago
zjunlp / knowledge-rumination
[EMNLP 2023] Knowledge Rumination for Pre-trained Language Models
☆17Updated 2 years ago
caskcsg / ir
Collections of IR Research
☆35Updated last month
nianlonggu / Local-Citation-Recommendation
Code for ECIR 2022 paper Local Citation Recommendation with Hierarchical-Attention Text Encoder and SciBERT-based Reranking
☆26Updated 11 months ago
viswavi / few-shot-clustering
☆76Updated 9 months ago
kasnerz / tabgenie
A multi-purpose toolkit for table-to-text generation: web interface, Python bindings, CLI commands.
☆55Updated last year
kayoyin / interpret-lm
Interpreting Language Models with Contrastive Explanations (EMNLP 2022 Best Paper Honorable Mention)
☆62Updated 3 years ago
ServiceNow / data-augmentation-with-llms
Data Augmentation for Intent Classification with Off-the-Shelf Large Language Models is a ServiceNow Research project
☆29Updated 2 years ago
princeton-nlp / MABEL
EMNLP 2022: "MABEL: Attenuating Gender Bias using Textual Entailment Data" https://arxiv.org/abs/2210.14975
☆38Updated last year
Ziems / llm-url
☆35Updated last year
caskcsg / TextSmoothing
☆35Updated 3 years ago
4AI / LS-LLaMA
A Simple but Powerful SOTA NER Model | Official Code For Label Supervised LLaMA Finetuning
☆155Updated last year
Ravoxsg / efficient_unified_crs
Source code for PECRS (EACL 2024)
☆10Updated last year
jpwahle / emnlp23-paraphrase-types
The official implementation of the EMNLP 2023 paper "Paraphrase Types for Generation and Detection"
☆13Updated 8 months ago
yixinL7 / Direct-Style-Transfer
☆8Updated 4 years ago
mukhal / PromptRank
[ACL 2023] Few-shot Reranking for Multi-hop QA via Language Model Prompting
☆27Updated 2 years ago