A simple recipe for training and inferencing Transformer architecture for Multi-Task Learning on custom datasets. You can find two approaches for achieving this in this repo.
☆101Jul 14, 2022Updated 3 years ago
Alternatives and similar repositories for multitask-learning-transformers
Users that are interested in multitask-learning-transformers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Multi-task modelling extensions for huggingface transformers☆21Mar 3, 2023Updated 3 years ago
- A simple project training 3 separate NLP tasks simultaneously using Multitask-Learning☆23Jun 12, 2023Updated 2 years ago
- Multi-task modelling extensions for huggingface transformers☆14Jul 8, 2025Updated 9 months ago
- Easy modernBERT fine-tuning and multi-task learning☆65Mar 13, 2026Updated last month
- BERT for Multitask Learning☆544Apr 12, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆13Jul 26, 2023Updated 2 years ago
- ☆12Jun 6, 2020Updated 5 years ago
- Multitask Learning with Pretrained Transformers☆40Mar 20, 2021Updated 5 years ago
- 📄 Evidence Retrieval and Claim Verification for the FEVER shared task using Transformer Networks☆12Feb 21, 2020Updated 6 years ago
- ☆10Jul 27, 2018Updated 7 years ago
- ☆13Nov 19, 2022Updated 3 years ago
- Implementation of Semantic Parsing with BERT and compositional pre-training on GeoQuery☆11Mar 20, 2019Updated 7 years ago
- Task Compass: Scaling Multi-task Pre-training with Task Prefix (EMNLP 2022: Findings) (stay tuned & more will be updated)☆22Oct 17, 2022Updated 3 years ago
- Generate multiple choice fill-in-the-blank questions from any article.☆13Dec 8, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- The Code of Aspect-based Sentiment Analysis via Multitask Learning for Online Reviews☆17Jan 8, 2023Updated 3 years ago
- Data and code for "Understanding Linearity of Cross-Lingual Word Embedding Mappings" (TMLR 2022)☆12Jun 8, 2022Updated 3 years ago
- BERT Probe: A python package for probing attention based robustness to character and word based adversarial evaluation. Also, with recipe…☆18Jun 24, 2022Updated 3 years ago
- Arabic News Stance Corpus☆11Feb 5, 2021Updated 5 years ago
- Framework for unified summarisation and evaluation of English documents using state-of-the-art models and measures.☆33May 13, 2024Updated last year
- Course for Interpreting ML Models☆52Feb 16, 2023Updated 3 years ago
- COMIC: This is the code repo of our TMM2019 work titled "COMIC: Towards a Compact Image Captioning Model with Attention".☆15Jun 22, 2021Updated 4 years ago
- MFAQ: a Multilingual FAQ Dataset☆18Sep 17, 2023Updated 2 years ago
- ACL 2023 paper "A Critical Evaluation of Evaluations for Long-form Question Answering"☆21Mar 22, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- NLP command-line assistant powered by OpenAI☆21Jan 27, 2024Updated 2 years ago
- Comparing PyTorch, JIT and ONNX for inference with Transformers☆20Feb 22, 2021Updated 5 years ago
- ☆63Nov 27, 2022Updated 3 years ago
- ☆13Mar 30, 2026Updated last month
- The NLPStatTest project☆12Mar 12, 2022Updated 4 years ago
- Resources for our IJCAI 2020 paper, TopicKA: Generating Commonsense Knowledge-Aware Dialogue Responses Towards the Recommended Topic Fact☆12Nov 30, 2020Updated 5 years ago
- 🦮 Code and pretrained models for Findings of ACL 2022 paper "LaPraDoR: Unsupervised Pretrained Dense Retriever for Zero-Shot Text Retrie…☆49Apr 25, 2022Updated 4 years ago
- GERNERMED++ is a transfer-learning-based open neural NER model for medical entities designed for German data.☆10Oct 20, 2023Updated 2 years ago
- The corresponding code from our paper "DeCLUTR: Deep Contrastive Learning for Unsupervised Textual Representations". Do not hesitate to o…☆378Apr 21, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Training and evaluation code for the paper "Headless Language Models: Learning without Predicting with Contrastive Weight Tying" (https:/…☆29Apr 17, 2024Updated 2 years ago
- The offcial repository for 'CharacterBERT and Self-Teaching for Improving the Robustness of Dense Retrievers on Queries with Typos', SIGI…☆16May 4, 2022Updated 4 years ago
- NERC-fr: Supervised Named Entity Recognition for French☆13Jul 10, 2015Updated 10 years ago
- multi_task_NLP is a utility toolkit enabling NLP developers to easily train and infer a single model for multiple tasks.☆374Nov 21, 2022Updated 3 years ago
- Fine-tuned BERT on SQuAd 2.0 Dataset. Applied Knowledge Distillation (KD) and fine-tuned DistilBERT (student) using BERT as the teacher m…☆26Feb 13, 2021Updated 5 years ago
- Agent observability and replay tooling for AI safety & interpretability research.☆103Mar 19, 2026Updated last month
- GPTNERMED is a language model-generated, synthetic dataset and an open neural NER model for medical entities designed for German data.☆15Oct 5, 2023Updated 2 years ago