A simple recipe for training and inferencing Transformer architecture for Multi-Task Learning on custom datasets. You can find two approaches for achieving this in this repo.
☆101Jul 14, 2022Updated 3 years ago
Alternatives and similar repositories for multitask-learning-transformers
Users that are interested in multitask-learning-transformers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Multi-task modelling extensions for huggingface transformers☆21Mar 3, 2023Updated 3 years ago
- A simple project training 3 separate NLP tasks simultaneously using Multitask-Learning☆23Jun 12, 2023Updated 2 years ago
- Multi-task modelling extensions for huggingface transformers☆14Jul 8, 2025Updated 10 months ago
- Easy modernBERT fine-tuning and multi-task learning☆65Mar 13, 2026Updated 2 months ago
- ☆12Jun 6, 2020Updated 5 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- 📄 Evidence Retrieval and Claim Verification for the FEVER shared task using Transformer Networks☆12Feb 21, 2020Updated 6 years ago
- ☆13Nov 19, 2022Updated 3 years ago
- Implementation of Semantic Parsing with BERT and compositional pre-training on GeoQuery☆11Mar 20, 2019Updated 7 years ago
- Generate multiple choice fill-in-the-blank questions from any article.☆13Dec 8, 2022Updated 3 years ago
- Data and code for "Understanding Linearity of Cross-Lingual Word Embedding Mappings" (TMLR 2022)☆12Jun 8, 2022Updated 3 years ago
- BERT Probe: A python package for probing attention based robustness to character and word based adversarial evaluation. Also, with recipe…☆18Jun 24, 2022Updated 3 years ago
- Arabic News Stance Corpus☆11Feb 5, 2021Updated 5 years ago
- Framework for unified summarisation and evaluation of English documents using state-of-the-art models and measures.☆33May 13, 2024Updated 2 years ago
- Token-free Language Modeling with ByGPT5 & Friends!☆12Jul 18, 2025Updated 10 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Course for Interpreting ML Models☆52Feb 16, 2023Updated 3 years ago
- Tài liệu học tập tại Khoa CNTT, Trường ĐH Khoa học Tự nhiên, ĐHQG-HCM của 1 sinh viên K23☆21Mar 1, 2026Updated 2 months ago
- A neural text style transfer model☆12Jun 23, 2019Updated 6 years ago
- MFAQ: a Multilingual FAQ Dataset☆18Sep 17, 2023Updated 2 years ago
- ACL 2023 paper "A Critical Evaluation of Evaluations for Long-form Question Answering"☆21Mar 22, 2024Updated 2 years ago
- NLP command-line assistant powered by OpenAI☆21Jan 27, 2024Updated 2 years ago
- Comparing PyTorch, JIT and ONNX for inference with Transformers☆19Feb 22, 2021Updated 5 years ago
- ☆64Nov 27, 2022Updated 3 years ago
- The NLPStatTest project☆12Mar 12, 2022Updated 4 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Resources for our IJCAI 2020 paper, TopicKA: Generating Commonsense Knowledge-Aware Dialogue Responses Towards the Recommended Topic Fact☆12Nov 30, 2020Updated 5 years ago
- 🦮 Code and pretrained models for Findings of ACL 2022 paper "LaPraDoR: Unsupervised Pretrained Dense Retriever for Zero-Shot Text Retrie…☆49Apr 25, 2022Updated 4 years ago
- ☆16Dec 25, 2021Updated 4 years ago
- The corresponding code from our paper "DeCLUTR: Deep Contrastive Learning for Unsupervised Textual Representations". Do not hesitate to o…☆378Apr 21, 2023Updated 3 years ago
- TopicScan: Visualization and validation interface for NMF Topic Modeling☆23Jul 23, 2020Updated 5 years ago
- Abstractive and Extractive Text summarization using Transformers.☆86Jun 9, 2023Updated 2 years ago
- Training and evaluation code for the paper "Headless Language Models: Learning without Predicting with Contrastive Weight Tying" (https:/…☆29Apr 17, 2024Updated 2 years ago
- The offcial repository for 'CharacterBERT and Self-Teaching for Improving the Robustness of Dense Retrievers on Queries with Typos', SIGI…☆16May 4, 2022Updated 4 years ago
- ☆13Mar 30, 2026Updated last month
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- multi_task_NLP is a utility toolkit enabling NLP developers to easily train and infer a single model for multiple tasks.☆374Nov 21, 2022Updated 3 years ago
- A package for fine tuning of pretrained NLP transformers using Semi Supervised Learning☆14Oct 27, 2021Updated 4 years ago
- GPTNERMED is a language model-generated, synthetic dataset and an open neural NER model for medical entities designed for German data.☆15Oct 5, 2023Updated 2 years ago
- jiant is an nlp toolkit☆1,675Jul 6, 2023Updated 2 years ago
- Code for gradient rollback, which explains predictions of neural matrix factorization models, as for example used for knowledge base comp…☆21Mar 16, 2021Updated 5 years ago
- English-Thai Machine Translation with OPUS data☆19Feb 10, 2020Updated 6 years ago
- Efficient few-shot learning with cross-encoders.☆65Feb 16, 2024Updated 2 years ago