A simple recipe for training and inferencing Transformer architecture for Multi-Task Learning on custom datasets. You can find two approaches for achieving this in this repo.
☆101Jul 14, 2022Updated 3 years ago
Alternatives and similar repositories for multitask-learning-transformers
Users that are interested in multitask-learning-transformers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A simple project training 3 separate NLP tasks simultaneously using Multitask-Learning☆23Jun 12, 2023Updated 2 years ago
- Easy modernBERT fine-tuning and multi-task learning☆65Mar 13, 2026Updated last month
- ☆13Jul 26, 2023Updated 2 years ago
- ☆12Jun 6, 2020Updated 5 years ago
- Multitask Learning with Pretrained Transformers☆40Mar 20, 2021Updated 5 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- 📄 Evidence Retrieval and Claim Verification for the FEVER shared task using Transformer Networks☆12Feb 21, 2020Updated 6 years ago
- ☆13Nov 19, 2022Updated 3 years ago
- Implementation of Semantic Parsing with BERT and compositional pre-training on GeoQuery☆11Mar 20, 2019Updated 7 years ago
- Task Compass: Scaling Multi-task Pre-training with Task Prefix (EMNLP 2022: Findings) (stay tuned & more will be updated)☆22Oct 17, 2022Updated 3 years ago
- Data and code for "Understanding Linearity of Cross-Lingual Word Embedding Mappings" (TMLR 2022)☆12Jun 8, 2022Updated 3 years ago
- BERT Probe: A python package for probing attention based robustness to character and word based adversarial evaluation. Also, with recipe…☆18Jun 24, 2022Updated 3 years ago
- Arabic News Stance Corpus☆11Feb 5, 2021Updated 5 years ago
- Framework for unified summarisation and evaluation of English documents using state-of-the-art models and measures.☆33May 13, 2024Updated last year
- Token-free Language Modeling with ByGPT5 & Friends!☆12Jul 18, 2025Updated 8 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Course for Interpreting ML Models☆52Feb 16, 2023Updated 3 years ago
- COMIC: This is the code repo of our TMM2019 work titled "COMIC: Towards a Compact Image Captioning Model with Attention".☆15Jun 22, 2021Updated 4 years ago
- Tài liệu học tập tại Khoa CNTT, Trường ĐH Khoa học Tự nhiên, ĐHQG-HCM của 1 sinh viên K23☆23Mar 1, 2026Updated last month
- A neural text style transfer model☆12Jun 23, 2019Updated 6 years ago
- MFAQ: a Multilingual FAQ Dataset☆18Sep 17, 2023Updated 2 years ago
- NLP command-line assistant powered by OpenAI☆21Jan 27, 2024Updated 2 years ago
- ☆63Nov 27, 2022Updated 3 years ago
- ☆13Mar 30, 2026Updated 2 weeks ago
- Resources for our IJCAI 2020 paper, TopicKA: Generating Commonsense Knowledge-Aware Dialogue Responses Towards the Recommended Topic Fact☆12Nov 30, 2020Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- 🦮 Code and pretrained models for Findings of ACL 2022 paper "LaPraDoR: Unsupervised Pretrained Dense Retriever for Zero-Shot Text Retrie…☆49Apr 25, 2022Updated 3 years ago
- GERNERMED++ is a transfer-learning-based open neural NER model for medical entities designed for German data.☆10Oct 20, 2023Updated 2 years ago
- The corresponding code from our paper "DeCLUTR: Deep Contrastive Learning for Unsupervised Textual Representations". Do not hesitate to o…☆378Apr 21, 2023Updated 2 years ago
- TopicScan: Visualization and validation interface for NMF Topic Modeling☆23Jul 23, 2020Updated 5 years ago
- Training and evaluation code for the paper "Headless Language Models: Learning without Predicting with Contrastive Weight Tying" (https:/…☆29Apr 17, 2024Updated last year
- The offcial repository for 'CharacterBERT and Self-Teaching for Improving the Robustness of Dense Retrievers on Queries with Typos', SIGI…☆16May 4, 2022Updated 3 years ago
- NERC-fr: Supervised Named Entity Recognition for French☆13Jul 10, 2015Updated 10 years ago
- multi_task_NLP is a utility toolkit enabling NLP developers to easily train and infer a single model for multiple tasks.☆373Nov 21, 2022Updated 3 years ago
- Fine-tuned BERT on SQuAd 2.0 Dataset. Applied Knowledge Distillation (KD) and fine-tuned DistilBERT (student) using BERT as the teacher m…☆26Feb 13, 2021Updated 5 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A package for fine tuning of pretrained NLP transformers using Semi Supervised Learning☆14Oct 27, 2021Updated 4 years ago
- GPTNERMED is a language model-generated, synthetic dataset and an open neural NER model for medical entities designed for German data.☆16Oct 5, 2023Updated 2 years ago
- This is the Grammarly's Yahoo Answers Formality Corpus☆108Jul 7, 2025Updated 9 months ago
- XtremeDistil framework for distilling/compressing massive multilingual neural network models to tiny and efficient models for AI at scale☆157Dec 20, 2023Updated 2 years ago
- English-Thai Machine Translation with OPUS data☆19Feb 10, 2020Updated 6 years ago
- Ensembling Hugging Face transformers made easy☆61Dec 24, 2022Updated 3 years ago
- data collator for UL2 and U-PaLM☆29Aug 20, 2023Updated 2 years ago