A simple recipe for training and inferencing Transformer architecture for Multi-Task Learning on custom datasets. You can find two approaches for achieving this in this repo.
☆99Jul 14, 2022Updated 3 years ago
Alternatives and similar repositories for multitask-learning-transformers
Users that are interested in multitask-learning-transformers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Multi-task modelling extensions for huggingface transformers☆21Mar 3, 2023Updated 3 years ago
- A simple project training 3 separate NLP tasks simultaneously using Multitask-Learning☆23Jun 12, 2023Updated 2 years ago
- Multi-task modelling extensions for huggingface transformers☆13Jul 8, 2025Updated 8 months ago
- Easy modernBERT fine-tuning and multi-task learning☆64Mar 13, 2026Updated last week
- ☆15Oct 19, 2020Updated 5 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- BERT for Multitask Learning☆544Apr 12, 2023Updated 2 years ago
- ☆13Jul 26, 2023Updated 2 years ago
- ☆12Jun 6, 2020Updated 5 years ago
- 📄 Evidence Retrieval and Claim Verification for the FEVER shared task using Transformer Networks☆12Feb 21, 2020Updated 6 years ago
- ☆10Jul 27, 2018Updated 7 years ago
- ☆13Nov 19, 2022Updated 3 years ago
- Loss-Balanced Task Weighting to Reduce Negative Transfer in Multi-Task Learning, AAAI-SA'19☆30Sep 23, 2019Updated 6 years ago
- Implementation of Semantic Parsing with BERT and compositional pre-training on GeoQuery☆11Mar 20, 2019Updated 7 years ago
- Task Compass: Scaling Multi-task Pre-training with Task Prefix (EMNLP 2022: Findings) (stay tuned & more will be updated)☆22Oct 17, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- The Code of Aspect-based Sentiment Analysis via Multitask Learning for Online Reviews☆17Jan 8, 2023Updated 3 years ago
- Data and code for "Understanding Linearity of Cross-Lingual Word Embedding Mappings" (TMLR 2022)☆12Jun 8, 2022Updated 3 years ago
- BERT Probe: A python package for probing attention based robustness to character and word based adversarial evaluation. Also, with recipe…☆18Jun 24, 2022Updated 3 years ago
- Arabic News Stance Corpus☆11Feb 5, 2021Updated 5 years ago
- Framework for unified summarisation and evaluation of English documents using state-of-the-art models and measures.☆33May 13, 2024Updated last year
- Token-free Language Modeling with ByGPT5 & Friends!☆12Jul 18, 2025Updated 8 months ago
- Course for Interpreting ML Models☆52Feb 16, 2023Updated 3 years ago
- Question Answering on Tabular Data with HuggingFace Transformers Pipeline & TAPAS☆25Dec 25, 2021Updated 4 years ago
- Tài liệu học tập tại Khoa CNTT, Trường ĐH Khoa học Tự nhiên, ĐHQG-HCM của 1 sinh viên K23☆23Mar 1, 2026Updated 3 weeks ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A neural text style transfer model☆12Jun 23, 2019Updated 6 years ago
- MFAQ: a Multilingual FAQ Dataset☆18Sep 17, 2023Updated 2 years ago
- ACL 2023 paper "A Critical Evaluation of Evaluations for Long-form Question Answering"☆21Mar 22, 2024Updated 2 years ago
- Comparing PyTorch, JIT and ONNX for inference with Transformers☆20Feb 22, 2021Updated 5 years ago
- ☆13Apr 25, 2025Updated 11 months ago
- The NLPStatTest project☆12Mar 12, 2022Updated 4 years ago
- Resources for our IJCAI 2020 paper, TopicKA: Generating Commonsense Knowledge-Aware Dialogue Responses Towards the Recommended Topic Fact☆12Nov 30, 2020Updated 5 years ago
- 🦮 Code and pretrained models for Findings of ACL 2022 paper "LaPraDoR: Unsupervised Pretrained Dense Retriever for Zero-Shot Text Retrie…☆49Apr 25, 2022Updated 3 years ago
- ☆16Dec 25, 2021Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- The corresponding code from our paper "DeCLUTR: Deep Contrastive Learning for Unsupervised Textual Representations". Do not hesitate to o…☆378Apr 21, 2023Updated 2 years ago
- PyTorch implementation of multi-task learning architectures, incl. MTI-Net (ECCV2020).☆840Jan 13, 2022Updated 4 years ago
- Training and evaluation code for the paper "Headless Language Models: Learning without Predicting with Contrastive Weight Tying" (https:/…☆28Apr 17, 2024Updated last year
- Agent observability and replay tooling for AI safety & interpretability research.☆79Updated this week
- The offcial repository for 'CharacterBERT and Self-Teaching for Improving the Robustness of Dense Retrievers on Queries with Typos', SIGI…☆16May 4, 2022Updated 3 years ago
- multi_task_NLP is a utility toolkit enabling NLP developers to easily train and infer a single model for multiple tasks.☆373Nov 21, 2022Updated 3 years ago
- GPTNERMED is a language model-generated, synthetic dataset and an open neural NER model for medical entities designed for German data.☆16Oct 5, 2023Updated 2 years ago