Research framework for low resource text classification that allows the user to experiment with classification models and active learning strategies on a large number of sentence classification datasets, and to simulate real-world scenarios. The framework is easily expandable to new classification models, active learning strategies and datasets.
☆101Mar 9, 2022Updated 4 years ago
Alternatives and similar repositories for low-resource-text-classification-framework
Users that are interested in low-resource-text-classification-framework are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of sequential Information Bottleneck (sIB) in Python and in C++☆20Apr 27, 2026Updated 3 weeks ago
- Active Learning for Text Classification in Python☆643May 17, 2026Updated last week
- Code base for the EMNLP 2021 Findings paper: Cartography Active Learning☆14Jun 3, 2025Updated 11 months ago
- Text generation using language models with multiple exit heads☆16Sep 18, 2025Updated 8 months ago
- Quality Controlled Paraphrase Generation (ACL 2022)☆71Sep 17, 2025Updated 8 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Code for the EMNLP 2021 Paper "Active Learning by Acquiring Contrastive Examples" & the ACL 2022 Paper "On the Importance of Effectively …☆129May 24, 2022Updated 4 years ago
- Distantly Supervised Biomedical Named Entity Recognition with Dictionary Expansion: https://ieeexplore.ieee.org/document/8983212☆13Jun 4, 2020Updated 5 years ago
- Active Learning Helps Pretrained Models Learn the Intended Task (https://arxiv.org/abs/2204.08491) by Alex Tamkin, Dat Nguyen, Salil Desh…☆11Nov 22, 2022Updated 3 years ago
- Data Annotation Tool for Named Entity Recognition using Active Learning and Transfer Learning☆11Aug 20, 2021Updated 4 years ago
- Repository for "Attribute First, then Generate: Locally-attributable Grounded Text Generation", ACL 2024☆30Dec 19, 2024Updated last year
- [WIP] Behold, semantic-search, built over sentence-transformers to make it easy for search engineers to evaluate, optimise and deploy mod…☆15Apr 21, 2023Updated 3 years ago
- ☆50Nov 11, 2024Updated last year
- The Plumber framework for KG completion and structured triples extraction☆24May 23, 2023Updated 3 years ago
- Website for release of TellMeWhy dataset for why question answering☆14Nov 11, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆10Oct 10, 2022Updated 3 years ago
- Code and datasets for the EMNLP 2020 paper "Calibration of Pre-trained Transformers"☆60Jun 12, 2023Updated 2 years ago
- ☆10Oct 2, 2024Updated last year
- [AAAI 2023] This is the code for our paper `Neighborhood-Regularized Self-Training for Learning with Few Labels'.☆12Jan 11, 2023Updated 3 years ago
- Benchmarking algorithms for assessing quality of data labeled by multiple annotators☆34Dec 3, 2025Updated 5 months ago
- Few-shot NLP benchmark for unified, rigorous eval☆93Jul 12, 2022Updated 3 years ago
- Code and datasets for the paper "Can Pre-trained Language Models Interpret Similes as Smart as Human?" (ACL 2022)☆14Jan 4, 2023Updated 3 years ago
- My personal Gollum deployment☆15Oct 8, 2017Updated 8 years ago
- ☆14Jul 13, 2025Updated 10 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ERRor ANnotation Toolkit: Automatically extract and classify grammatical errors in parallel original and corrected sentences.☆12Mar 23, 2023Updated 3 years ago
- PyTorch Library for Active Learning to accompany Human-in-the-Loop Machine Learning book☆990Dec 8, 2022Updated 3 years ago
- ☆60Dec 20, 2022Updated 3 years ago
- BERT baselines for extractive question answering on coqa (https://stanfordnlp.github.io/coqa/)☆10Jan 27, 2020Updated 6 years ago
- Multitask Learning with Pretrained Transformers☆40Mar 20, 2021Updated 5 years ago
- ☆13Feb 7, 2023Updated 3 years ago
- Every Eval Ever is a shared schema and crowdsourced eval database. It defines a standardized metadata format for storing AI evaluation re…☆67May 18, 2026Updated last week
- [NeurIPS 2021] WRENCH: Weak supeRvision bENCHmark☆227Feb 13, 2024Updated 2 years ago
- skweak: A software toolkit for weak supervision applied to NLP tasks☆927Sep 2, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Dataset for Unified Editing, EMNLP 2023. This is a model editing dataset where edits are natural language phrases.☆24Sep 4, 2024Updated last year
- An implementation of GrASP (Shnarch et. al., 2017)☆23Aug 29, 2022Updated 3 years ago
- 🦄 Unitxt is a Python library for enterprise-grade evaluation of AI performance, offering the world's largest catalog of tools and data …☆212Updated this week
- Code for paper "Incorporating Multimodal Information in Open-Domain Web Keyphrase Extraction"☆19Jan 28, 2021Updated 5 years ago
- Code for: "Cutting Down on Prompts and Parameters: Simple Few-Shot Learning with Language Models"☆19Feb 2, 2022Updated 4 years ago
- This repository contains the code for "Generating Datasets with Pretrained Language Models".☆189Aug 17, 2021Updated 4 years ago
- Code accompanying EMNLP 2020 paper "Cold-start Active Learning through Self-supervised Language Modeling".☆40May 25, 2021Updated 4 years ago