Multitask-learning of a BERT backbone. Allows to easily train a BERT model with state-of-the-art method such as PCGrad, Gradient Vaccine, PALs, Scheduling, Class imbalance handling and many optimizations
☆20Oct 8, 2023Updated 2 years ago
Alternatives and similar repositories for BERT-Multitask-learning
Users that are interested in BERT-Multitask-learning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A simple project training 3 separate NLP tasks simultaneously using Multitask-Learning☆23Jun 12, 2023Updated 2 years ago
- python codes for iDNA-ABF: multi-scale deep biological language learning model for the accurate and interpretable prediction of DNA methy…☆15May 6, 2024Updated last year
- PyTorch implements `Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning` paper.☆14Aug 19, 2022Updated 3 years ago
- Code to build models that effectively predict promoter-driven gene expression☆11May 15, 2025Updated 10 months ago
- SHApe simulation with graph KERnels☆11May 17, 2021Updated 4 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Bivariate Shapley is a Shapley-based method of identifying directional feature interactions and feature redundancy☆20May 19, 2025Updated 10 months ago
- BoostDiff - Inference of differential networks from gene expression data☆13Jan 11, 2024Updated 2 years ago
- All my experiments with the various transformers and various transformer frameworks available☆14Apr 30, 2021Updated 4 years ago
- TyDiP Multilingual Politeness dataset and code☆12Oct 15, 2023Updated 2 years ago
- A curated list of community detection research papers with implementations.☆13Jan 2, 2020Updated 6 years ago
- Frequent subgraph mining using FFSM algorithm, C++☆11Jan 15, 2018Updated 8 years ago
- ☆13Aug 28, 2018Updated 7 years ago
- ☆16Mar 13, 2026Updated last week
- Seth's Github Pages☆10Sep 27, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Sentence VAE using the Transformer encoder-decoder architecture.☆12Nov 30, 2024Updated last year
- Implements several Markov chain Monte Carlo (MCMC) algorithms for the latent Dirichlet allocation (LDA) model☆11Feb 11, 2020Updated 6 years ago
- A documentation for FAIR GPT, a virtual RDM consultant☆15Oct 10, 2024Updated last year
- ☆20Updated this week
- ☆14Jan 13, 2023Updated 3 years ago
- VisBERT: Demo web app for "How Does BERT Answer Questions?"☆11Jul 22, 2023Updated 2 years ago
- [NeurIPS 2024] The official implementation of "Image Copy Detection for Diffusion Models"☆18Oct 1, 2024Updated last year
- A transformer model to predict pathogenic mutations☆12Jun 25, 2025Updated 9 months ago
- ☆12Jul 22, 2025Updated 8 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆13Jul 11, 2018Updated 7 years ago
- Materials for the "Recommender Systems through the lens of Decision Theory" tutorial delivered at the 30th Web Conference (WWW '21).☆11Apr 13, 2021Updated 4 years ago
- Evaluating RNA structure prediction using diverse thermodynamic prediction tasks and high-throughput datasets.☆17Jun 10, 2022Updated 3 years ago
- C implementation of algorithms to find the Density-Friendly graph decomposition☆12Apr 6, 2020Updated 5 years ago
- 命名实体识别☆13Jul 28, 2020Updated 5 years ago
- This repo contains code for the paper "Psychologically-informed chain-of-thought prompts for metaphor understanding in large language mod…☆14Apr 28, 2023Updated 2 years ago
- Research code for "Towards multi-task learning of speech and speaker recognition" at https://arxiv.org/pdf/2302.12773.pdf☆12Dec 2, 2024Updated last year
- Solution of CCL2021 Track1 Subtask1 NER☆23Mar 29, 2022Updated 3 years ago
- A Python framework for generating multilayer networks with planted mesoscale structure.☆18Jun 4, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Library for multilayer network tensor factorization☆18Jan 27, 2021Updated 5 years ago
- Mobile Artificial Intelligence Projects, published by Packt☆11Jan 30, 2023Updated 3 years ago
- ☆15Oct 19, 2020Updated 5 years ago
- User-friendly extensions to the Disease Ontology☆20Sep 9, 2016Updated 9 years ago
- ☆13Dec 6, 2018Updated 7 years ago
- Weighted Training for Cross-Task Learning☆15Feb 12, 2023Updated 3 years ago
- Joint multi-task emotion deep neural model for emotion classification in multigenre.☆14May 10, 2024Updated last year