Easy modernBERT fine-tuning and multi-task learning
☆65Mar 13, 2026Updated last month
Alternatives and similar repositories for tasknet
Users that are interested in tasknet are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Datasets collection and preprocessings framework for NLP extreme multitask learning☆195Jul 9, 2025Updated 9 months ago
- Multi-task modelling extensions for huggingface transformers☆21Mar 3, 2023Updated 3 years ago
- Discourse Based Evaluation of Language Understanding☆21Jan 28, 2023Updated 3 years ago
- A simple recipe for training and inferencing Transformer architecture for Multi-Task Learning on custom datasets. You can find two approa…☆101Jul 14, 2022Updated 3 years ago
- Code Roberta version of RetroMAE: Pre-Training Retrieval-oriented Language Models Via Masked Auto-Encoder☆10Mar 16, 2023Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- CSS-LM: Contrastive Semi-supervised Fine-tuning of Pre-trained Language Models☆12Jul 1, 2023Updated 2 years ago
- ☆15Dec 15, 2025Updated 4 months ago
- SMASHED is a toolkit designed to apply transformations to samples in datasets, such as fields extraction, tokenization, prompting, batchi…☆35May 24, 2024Updated last year
- Code for paper 'Data-Efficient FineTuning'☆28May 24, 2023Updated 2 years ago
- Implementation, trained models and result data for the paper "Aspect-based Document Similarity for Research Papers" #COLING2020☆63Apr 30, 2024Updated 2 years ago
- Implementation of the GLOM model for text☆11Mar 4, 2021Updated 5 years ago
- ☆45Oct 14, 2021Updated 4 years ago
- Tool for converting LLMs from uni-directional to bi-directional by removing causal mask for tasks like classification and sentence embedd…☆65Dec 12, 2024Updated last year
- Starbucks: Improved Training for 2D Matryoshka Embeddings☆23Jun 30, 2025Updated 10 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Logical inference system based on event semantics and degree semantics in formal semantics☆10Jan 22, 2023Updated 3 years ago
- Task Compass: Scaling Multi-task Pre-training with Task Prefix (EMNLP 2022: Findings) (stay tuned & more will be updated)☆22Oct 17, 2022Updated 3 years ago
- Python module for registering and instantiating classes by name☆13Dec 11, 2019Updated 6 years ago
- Streamlit UI to remove duplicate or near duplicate images☆12Mar 25, 2023Updated 3 years ago
- GOPHI: an AMR-to-English Verbalizer☆11Feb 5, 2020Updated 6 years ago
- Ultra-minimal autoregressive diffusion model for image generation☆21Dec 26, 2025Updated 4 months ago
- ☆15Apr 9, 2019Updated 7 years ago
- Mining Discourse Markers for Unsupervised Sentence Representation Learning☆61May 31, 2023Updated 2 years ago
- v1: Learning to Point Visual Tokens for Multimodal Grounded Reasoning☆19Oct 6, 2025Updated 6 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Repo for "Zemi: Learning Zero-Shot Semi-Parametric Language Models from Multiple Tasks" ACL 2023 Findings☆15May 3, 2023Updated 3 years ago
- ☆12Dec 30, 2020Updated 5 years ago
- Automatically detect errors in annotated corpora.☆48Sep 8, 2023Updated 2 years ago
- State-of-the-art paired encoder and decoder models (17M-1B params)☆69Aug 6, 2025Updated 8 months ago
- ☆15Jun 19, 2025Updated 10 months ago
- Final training script from HuggingFace Whisper Fine tuning event - to get best results on finetuned model.☆12Dec 24, 2022Updated 3 years ago
- ☆11Feb 9, 2024Updated 2 years ago
- The official code for PRIMERA: Pyramid-based Masked Sentence Pre-training for Multi-document Summarization☆157Nov 4, 2022Updated 3 years ago
- A Discourse-Level Named Entity Recognition and Relation Extraction Dataset for Chinese Literature Text☆12May 14, 2018Updated 7 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- a Haskell library that implements (Projective) Discourse Representation Theory (DRT)☆27Sep 15, 2022Updated 3 years ago
- ☆24Jun 12, 2023Updated 2 years ago
- TPTP python library and benchmarking service☆13Oct 2, 2019Updated 6 years ago
- Dataset of Burmese proverbs☆11Jun 26, 2017Updated 8 years ago
- Experiments for efforts to train a new and improved t5☆76Apr 15, 2024Updated 2 years ago
- Train 🤗transformers with DeepSpeed: ZeRO-2, ZeRO-3☆23May 20, 2021Updated 4 years ago
- Lite weight wrapper for the independent implementation of SPLADE++ models for search & retrieval pipelines. Models and Library created by…☆34Aug 24, 2024Updated last year