Multitask-learning of a BERT backbone. Allows to easily train a BERT model with state-of-the-art method such as PCGrad, Gradient Vaccine, PALs, Scheduling, Class imbalance handling and many optimizations
☆20Oct 8, 2023Updated 2 years ago
Alternatives and similar repositories for BERT-Multitask-learning
Users that are interested in BERT-Multitask-learning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- CS 224N Winter 2023 Default Final Project: Multitask BERT☆26Mar 23, 2023Updated 3 years ago
- A simple project training 3 separate NLP tasks simultaneously using Multitask-Learning☆23Jun 12, 2023Updated 2 years ago
- Easy modernBERT fine-tuning and multi-task learning☆65Mar 13, 2026Updated last month
- All my experiments with the various transformers and various transformer frameworks available☆14Apr 30, 2021Updated 5 years ago
- TyDiP Multilingual Politeness dataset and code☆12Oct 15, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆16Updated this week
- Implements several Markov chain Monte Carlo (MCMC) algorithms for the latent Dirichlet allocation (LDA) model☆11Feb 11, 2020Updated 6 years ago
- Sentence VAE using the Transformer encoder-decoder architecture.☆12Nov 30, 2024Updated last year
- [NeurIPS 2024] The official implementation of "Image Copy Detection for Diffusion Models"☆18Oct 1, 2024Updated last year
- A transformer model to predict pathogenic mutations☆12Jun 25, 2025Updated 10 months ago
- A variational autoencoder is trained on motion capture data and used to generate humanoid animations in Unity3D by sampling the latent sp…☆16May 13, 2020Updated 5 years ago
- ☆13Jul 11, 2018Updated 7 years ago
- Materials for the "Recommender Systems through the lens of Decision Theory" tutorial delivered at the 30th Web Conference (WWW '21).☆11Apr 13, 2021Updated 5 years ago
- Evaluating and improving the faithfulness of the interpretations offered by Neural Module Networks☆13Jun 12, 2023Updated 2 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- 命名实体识别☆13Jul 28, 2020Updated 5 years ago
- This repo contains code for the paper "Psychologically-informed chain-of-thought prompts for metaphor understanding in large language mod…☆14Apr 28, 2023Updated 3 years ago
- Research code for "Towards multi-task learning of speech and speaker recognition" at https://arxiv.org/pdf/2302.12773.pdf☆12Dec 2, 2024Updated last year
- ☆15Oct 19, 2020Updated 5 years ago
- 📖The Big-&-Extending-Repository-of-Transformers: Pretrained PyTorch models for Google's BERT, OpenAI GPT & GPT-2, Google/CMU Transformer…☆11May 30, 2019Updated 6 years ago
- Code Implementation for "NASH: A Simple Unified Framework of Structured Pruning for Accelerating Encoder-Decoder Language Models" (EMNLP …☆17Oct 17, 2023Updated 2 years ago
- This is the official repository of our ECCV 2022 paper, "Point MixSwap: Attentional Point Cloud Mixing via Swapping Matched Structural Di…☆13Oct 26, 2022Updated 3 years ago
- [ICLR 2025] Learning-Guided Rolling Horizon Optimization for Long-Horizon Flexible Job-Shop Scheduling☆20Oct 20, 2025Updated 6 months ago
- Code for the paper "Predict-then-optimize or predict-and-optimize? An empirical evaluation of cost-sensitive learning strategies".☆19Feb 7, 2024Updated 2 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- An Egghead collection to teach product-building with JAMStack and Serverless☆12Jun 7, 2020Updated 5 years ago
- ☆17Nov 8, 2024Updated last year
- ☆17Sep 24, 2018Updated 7 years ago
- PyTorch – SMART: Robust and Efficient Fine-Tuning for Pre-trained Natural Language Models.☆62Jun 28, 2022Updated 3 years ago
- Code for paper: AdvKnn: Adversarial Attacks On K-Nearest Neighbor Classifiers With Approximate Gradients☆14Dec 23, 2019Updated 6 years ago
- ☆18Jun 26, 2023Updated 2 years ago
- This is the C++ and Matlab implementation of the CVIU 2019 paper 'L2 Divergence for robust colour transfer'☆17Feb 22, 2019Updated 7 years ago
- ☆17Jul 30, 2024Updated last year
- ☆13Feb 18, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Fine grained Empathy Direction Detection☆16Dec 11, 2020Updated 5 years ago
- Viewer for text datasets in formats like HuggingFace, JSONL, etc.☆15Feb 25, 2025Updated last year
- Code for ACL 2023 main conference paper "CMOT: Cross-modal Mixup via Optimal Transport for Speech Translation"☆17Oct 29, 2024Updated last year
- The official code of "PixelWorld: Towards Perceiving Everything as Pixels" [TMLR25]☆16Sep 12, 2025Updated 7 months ago
- ☆10Aug 13, 2020Updated 5 years ago
- Morgan A. Schmitz., Matthieu Heitz, Nicolas Bonneel, Fred Ngole, David Coeurjolly, Marco Cuturi, Gabriel Peyré, and Jean-Luc Starck. "Was…☆20Oct 18, 2019Updated 6 years ago
- Embed media in a 2D scatter plot.☆16Oct 1, 2020Updated 5 years ago