Research framework for low resource text classification that allows the user to experiment with classification models and active learning strategies on a large number of sentence classification datasets, and to simulate real-world scenarios. The framework is easily expandable to new classification models, active learning strategies and datasets.
☆101Mar 9, 2022Updated 4 years ago
Alternatives and similar repositories for low-resource-text-classification-framework
Users that are interested in low-resource-text-classification-framework are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Active Learning for Text Classification in Python☆640Apr 17, 2026Updated 2 weeks ago
- Code base for the EMNLP 2021 Findings paper: Cartography Active Learning☆14Jun 3, 2025Updated 11 months ago
- Text generation using language models with multiple exit heads☆16Sep 18, 2025Updated 7 months ago
- The YASO targeted sentiment analysis dataset, accompanied by evaluation code.☆20Sep 17, 2025Updated 7 months ago
- Quality Controlled Paraphrase Generation (ACL 2022)☆71Sep 17, 2025Updated 7 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Text Classification Dataset for Turkish Language☆10Nov 16, 2021Updated 4 years ago
- code for the paper "Cluster & Tune: Boost Cold Start Performance in Text Classification" for ACL2022☆27May 18, 2022Updated 3 years ago
- Active Learning Helps Pretrained Models Learn the Intended Task (https://arxiv.org/abs/2204.08491) by Alex Tamkin, Dat Nguyen, Salil Desh…☆11Nov 22, 2022Updated 3 years ago
- AutoDeploy is a single configuration deployment library☆40Sep 30, 2021Updated 4 years ago
- Repository for "Attribute First, then Generate: Locally-attributable Grounded Text Generation", ACL 2024☆30Dec 19, 2024Updated last year
- [WIP] Behold, semantic-search, built over sentence-transformers to make it easy for search engineers to evaluate, optimise and deploy mod…☆15Apr 21, 2023Updated 3 years ago
- ☆13Feb 14, 2022Updated 4 years ago
- The Plumber framework for KG completion and structured triples extraction☆23May 23, 2023Updated 2 years ago
- Website for release of TellMeWhy dataset for why question answering☆14Nov 11, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Bias Benchmark for Natural Language Inference. Code repo for the Findings of NAACL 2022 paper "On Measuring Social Biases in Prompt-Based…☆15Apr 28, 2022Updated 4 years ago
- ☆11Oct 3, 2021Updated 4 years ago
- ☆13Mar 25, 2022Updated 4 years ago
- Active learning in NLP☆14Dec 14, 2022Updated 3 years ago
- Code and datasets for the EMNLP 2020 paper "Calibration of Pre-trained Transformers"☆60Jun 12, 2023Updated 2 years ago
- ☆10Oct 2, 2024Updated last year
- [AAAI 2023] This is the code for our paper `Neighborhood-Regularized Self-Training for Learning with Few Labels'.☆12Jan 11, 2023Updated 3 years ago
- Benchmarking algorithms for assessing quality of data labeled by multiple annotators☆34Dec 3, 2025Updated 5 months ago
- Few-shot NLP benchmark for unified, rigorous eval☆93Jul 12, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Active learning☆78Feb 8, 2023Updated 3 years ago
- Code and datasets for the paper "Can Pre-trained Language Models Interpret Similes as Smart as Human?" (ACL 2022)☆14Jan 4, 2023Updated 3 years ago
- A modular active learning framework for Python☆2,348Feb 26, 2024Updated 2 years ago
- ☆14Jul 13, 2025Updated 9 months ago
- Source code for NAACL 2022 paper Weakly Supervised Text Classification using Supervision Signals from a Language Mode☆10Jun 13, 2022Updated 3 years ago
- ERRor ANnotation Toolkit: Automatically extract and classify grammatical errors in parallel original and corrected sentences.☆12Mar 23, 2023Updated 3 years ago
- A codebase for ACL 2023 paper: Mitigating Label Biases for In-context Learning☆10Aug 4, 2023Updated 2 years ago
- PyTorch Library for Active Learning to accompany Human-in-the-Loop Machine Learning book☆987Dec 8, 2022Updated 3 years ago
- ☆60Dec 20, 2022Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- BERT baselines for extractive question answering on coqa (https://stanfordnlp.github.io/coqa/)☆10Jan 27, 2020Updated 6 years ago
- Dataset Cartography: Mapping and Diagnosing Datasets with Training Dynamics☆219Jul 19, 2022Updated 3 years ago
- SimCSE☆15Oct 1, 2022Updated 3 years ago
- ☆11Dec 23, 2021Updated 4 years ago
- ☆25Apr 26, 2023Updated 3 years ago
- Multitask Learning with Pretrained Transformers☆40Mar 20, 2021Updated 5 years ago
- ☆13Feb 7, 2023Updated 3 years ago