shahrukhx01 / multitask-learning-transformersLinks

A simple recipe for training and inferencing Transformer architecture for Multi-Task Learning on custom datasets. You can find two approaches for achieving this in this repo.

☆97

Alternatives and similar repositories for multitask-learning-transformers

Users that are interested in multitask-learning-transformers are comparing it to the libraries listed below

Sorting:

helmy-elrais / RoBERT_Recurrence_over_BERT
pyTorch implementation of Recurrence over BERT (RoBERT) based on this paper https://arxiv.org/abs/1910.10781 and comparison with pyTorch …
☆82Updated 2 years ago
archinetai / smart-pytorch
PyTorch – SMART: Robust and Efficient Fine-Tuning for Pre-trained Natural Language Models.
☆63Updated 3 years ago
yueyu1030 / COSINE
[NAACL 2021] This is the code for our paper `Fine-Tuning Pre-trained Language Model with Weak Supervision: A Contrastive-Regularized Self…
☆203Updated 2 years ago
GeorgeLuImmortal / Hierarchical-BERT-Model-with-Limited-Labelled-Data
☆42Updated 3 years ago
georgian-io / Transformers-Domain-Adaptation
[DEPRECATED] Adapt Transformer-based language models to new text domains
☆87Updated last year
edumunozsala / RoBERTa_Encoder_Decoder_Product_Names
Define Transformers, T5 model and RoBERTa Encoder decoder model for product names generation
☆48Updated 3 years ago
DFKI-NLP / thermostat
Collection of NLP model explanations and accompanying analysis tools
☆144Updated 2 years ago
helboukkouri / character-bert
Main repository for "CharacterBERT: Reconciling ELMo and BERT for Word-Level Open-Vocabulary Representations From Characters"
☆201Updated last year
amzn / multiconer-baseline
☆47Updated 2 years ago
allenai / sequential_sentence_classification
https://arxiv.org/pdf/1909.04054
☆79Updated 2 years ago
patil-suraj / exploring-T5
A repo to explore different NLP tasks which can be solved using T5
☆172Updated 4 years ago
cgmhaicenter / exBERT
☆59Updated 2 years ago
FreddeFrallan / Contrastive-Tension
State of the art Semantic Sentence Embeddings
☆99Updated 3 years ago
yg211 / bert_nli
A Natural Language Inference (NLI) model based on Transformers (BERT and ALBERT)
☆132Updated last year
forest-snow / alps
Code accompanying EMNLP 2020 paper "Cold-start Active Learning through Self-supervised Language Modeling".
☆40Updated 4 years ago
kuutsav / information-retrieval
Neural information retrieval / Semantic search / Bi-encoders
☆173Updated 2 years ago
esdurmus / Wikilingua
Multilingual abstractive summarization dataset extracted from WikiHow.
☆94Updated 4 months ago
IBM / low-resource-text-classification-framework
Research framework for low resource text classification that allows the user to experiment with classification models and active learning…
☆101Updated 3 years ago
allenai / PRIMER
The official code for PRIMERA: Pyramid-based Masked Sentence Pre-training for Multi-document Summarization
☆157Updated 2 years ago
wzhouad / NLL-IE
Source code for paper "Learning from Noisy Labels for Entity-Centric Information Extraction", EMNLP 2021
☆55Updated 3 years ago
abrazinskas / sigir2022-opinion-summarization-tutorial
This repository contains materials for the SIGIR 2022 tutorial on opinion summarization.
☆34Updated 3 years ago
fursovia / self-adj-dice
Implementation of Self-adjusting Dice Loss from "Dice Loss for Data-imbalanced NLP Tasks" paper
☆108Updated 4 years ago
jianguoz / Few-Shot-Intent-Detection
Few-Shot-Intent-Detection includes popular challenging intent detection datasets with/without OOS queries and state-of-the-art baselines …
☆147Updated 2 years ago
amazon-science / efficient-longdoc-classification
☆45Updated 3 years ago
AdamStein97 / Semi-Supervised-BERT-NER
☆33Updated 2 years ago
csebuetnlp / xl-sum
This repository contains the code, data, and models of the paper titled "XL-Sum: Large-Scale Multilingual Abstractive Summarization for 4…
☆273Updated last year
Guzpenha / transformer_rankers
A library to conduct ranking experiments with transformers.
☆160Updated 2 years ago
fabrahman / ReBART
Code for EMNLP 2021 paper: "Is Everything in Order? A Simple Way to Order Sentences"
☆42Updated last year
ccdv-ai / convert_checkpoint_to_lsg
Efficient Attention for Long Sequence Processing
☆97Updated last year
sf-wa-326 / phrase-bert-topic-model
☆87Updated 3 years ago