Easy modernBERT fine-tuning and multi-task learning
☆65Mar 13, 2026Updated 2 months ago
Alternatives and similar repositories for tasknet
Users that are interested in tasknet are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Datasets collection and preprocessings framework for NLP extreme multitask learning☆195Jul 9, 2025Updated 11 months ago
- Multi-task modelling extensions for huggingface transformers☆21Mar 3, 2023Updated 3 years ago
- A simple recipe for training and inferencing Transformer architecture for Multi-Task Learning on custom datasets. You can find two approa…☆101Jul 14, 2022Updated 3 years ago
- ☆11Feb 29, 2024Updated 2 years ago
- Code Roberta version of RetroMAE: Pre-Training Retrieval-oriented Language Models Via Masked Auto-Encoder☆10Mar 16, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Multitask-learning of a BERT backbone. Allows to easily train a BERT model with state-of-the-art method such as PCGrad, Gradient Vaccine,…☆20Oct 8, 2023Updated 2 years ago
- CSS-LM: Contrastive Semi-supervised Fine-tuning of Pre-trained Language Models☆12Jul 1, 2023Updated 2 years ago
- ☆15Dec 15, 2025Updated 5 months ago
- SMASHED is a toolkit designed to apply transformations to samples in datasets, such as fields extraction, tokenization, prompting, batchi …☆35May 24, 2024Updated 2 years ago
- ☆15Oct 19, 2020Updated 5 years ago
- Code for paper 'Data-Efficient FineTuning'☆28May 24, 2023Updated 3 years ago
- Code for embedding and retrieval research.☆16Oct 24, 2023Updated 2 years ago
- ☆15Apr 8, 2022Updated 4 years ago
- Jax implementation of VIT-VQGAN☆10Jan 25, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Implementation of the GLOM model for text☆11Mar 4, 2021Updated 5 years ago
- ☆45Oct 14, 2021Updated 4 years ago
- The trainer for HF to record losses of different tasks and objectives.☆54Mar 12, 2025Updated last year
- Tool for converting LLMs from uni-directional to bi-directional by removing causal mask for tasks like classification and sentence embedd…☆67Dec 12, 2024Updated last year
- ☆13Jul 6, 2021Updated 4 years ago
- Starbucks: Improved Training for 2D Matryoshka Embeddings☆23Jun 30, 2025Updated 11 months ago
- ☆13Nov 19, 2022Updated 3 years ago
- Task Compass: Scaling Multi-task Pre-training with Task Prefix (EMNLP 2022: Findings) (stay tuned & more will be updated)☆22Oct 17, 2022Updated 3 years ago
- Python module for registering and instantiating classes by name☆13Dec 11, 2019Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Expand -> Retrieve -> Rerank - simple method with strong results on BRIGHT benchmark☆22Aug 22, 2025Updated 9 months ago
- Efficient few-shot learning with cross-encoders.☆66Feb 16, 2024Updated 2 years ago
- GOPHI: an AMR-to-English Verbalizer☆11Feb 5, 2020Updated 6 years ago
- AdaptKeyBERT: keyword/keyphrase extraction with zero-shot and few-shot semi-supervised domain adaptation☆26Sep 22, 2024Updated last year
- ☆15Apr 9, 2019Updated 7 years ago
- This is an AI model using SAM and Grounding DINO to segment objects in a floor plan and effectively remove them in order to get a clean a…☆15Mar 30, 2024Updated 2 years ago
- Repo for "Zemi: Learning Zero-Shot Semi-Parametric Language Models from Multiple Tasks" ACL 2023 Findings☆15May 3, 2023Updated 3 years ago
- ☆12Dec 30, 2020Updated 5 years ago
- ADAG: Transluce's MLP neuron-level circuit tracing library☆28Apr 10, 2026Updated 2 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Automatically detect errors in annotated corpora.☆48Sep 8, 2023Updated 2 years ago
- ☆10Jun 11, 2019Updated 7 years ago
- State-of-the-art paired encoder and decoder models (17M-1B params)☆73Aug 6, 2025Updated 10 months ago
- Final training script from HuggingFace Whisper Fine tuning event - to get best results on finetuned model.☆12Dec 24, 2022Updated 3 years ago
- The official code for PRIMERA: Pyramid-based Masked Sentence Pre-training for Multi-document Summarization☆157Nov 4, 2022Updated 3 years ago
- ☆11Feb 9, 2024Updated 2 years ago
- a Haskell library that implements (Projective) Discourse Representation Theory (DRT)☆27Sep 15, 2022Updated 3 years ago