Self-training with Weak Supervision (NAACL 2021)
☆163Jul 24, 2023Updated 2 years ago
Alternatives and similar repositories for ASTRA
Users that are interested in ASTRA are comparing it to the libraries listed below
Sorting:
- [NAACL 2021] This is the code for our paper `Fine-Tuning Pre-trained Language Model with Weak Supervision: A Contrastive-Regularized Self…☆206Aug 17, 2022Updated 3 years ago
- OptimSeed - Seed Word Selection for Weakly-Supervised Text Classification [NAACL SRW 2021]☆14Mar 29, 2021Updated 4 years ago
- [NeurIPS 2021] WRENCH: Weak supeRvision bENCHmark☆227Feb 13, 2024Updated 2 years ago
- Weakly Supervised End-to-End Learning (NeurIPS 2021)☆156Mar 20, 2023Updated 2 years ago
- SPEAR: Programmatically label and build training data quickly.☆109Jun 27, 2024Updated last year
- skweak: A software toolkit for weak supervision applied to NLP tasks☆926Sep 2, 2024Updated last year
- ☆59Apr 24, 2021Updated 4 years ago
- Advances in Neural Information Processing Systems (NeurIPS 2021)☆22Nov 4, 2022Updated 3 years ago
- A PyTorch-based open-source framework that provides methods for improving the weakly annotated data and allows researchers to efficiently…☆108Sep 10, 2024Updated last year
- More interactive weak supervision with FlyingSquid☆317Sep 1, 2020Updated 5 years ago
- Active Learning for Text Classification in Python☆639Feb 1, 2026Updated 3 weeks ago
- Implementation of experiments in paper "Learning from Rules Generalizing Labeled Exemplars" to appear in ICLR2020 (https://openreview.net…☆50Feb 28, 2023Updated 3 years ago
- Few-shot NLP benchmark for unified, rigorous eval☆93Jul 12, 2022Updated 3 years ago
- A curated list of programmatic weak supervision papers and resources☆190Mar 1, 2023Updated 2 years ago
- Collection of NLP model explanations and accompanying analysis tools☆144Jun 26, 2023Updated 2 years ago
- SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in c…☆359Feb 22, 2022Updated 4 years ago
- A library to synthesize text datasets using Large Language Models (LLM)☆152Jan 17, 2023Updated 3 years ago
- ☆77May 17, 2023Updated 2 years ago
- Uncertainty-aware Self-training☆124Dec 20, 2023Updated 2 years ago
- Search Engines with Autoregressive Language models☆295Apr 4, 2023Updated 2 years ago
- Language-agnostic BERT Sentence Embedding (LaBSE) Pytorch Model☆21Sep 2, 2020Updated 5 years ago
- Measuring if attention is explanation with ROAR☆22Mar 3, 2023Updated 2 years ago
- CCQA A New Web-Scale Question Answering Dataset for Model Pre-Training☆32Jul 20, 2022Updated 3 years ago
- The dataset and code for ACL 2022 paper "SciNLI: A Corpus for Natural Language Inference on Scientific Text" are released here.☆28Oct 17, 2023Updated 2 years ago
- ☆75Jul 2, 2021Updated 4 years ago
- Repository with code for MaChAmp: https://aclanthology.org/2021.eacl-demos.22/☆91Feb 12, 2026Updated 2 weeks ago
- Training & evaluation library for text-based neural re-ranking and dense retrieval models built with PyTorch☆265Jan 27, 2023Updated 3 years ago
- [ACL 2021] Learning Dense Representations of Phrases at Scale; EMNLP'2021: Phrase Retrieval Learns Passage Retrieval, Too https://arxiv.o…☆606Jun 15, 2022Updated 3 years ago
- Tokenize and clean strings in Python☆11Jan 11, 2018Updated 8 years ago
- SummVis is an interactive visualization tool for text summarization.☆254Jun 17, 2022Updated 3 years ago
- ☆69May 1, 2025Updated 9 months ago
- Bootstrapped Unsupervised Sentence Representation Learning (ACL 2021)☆30Apr 27, 2022Updated 3 years ago
- Powerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: …☆340Jul 6, 2023Updated 2 years ago
- Code for Tangent Model Composition for Ensembling and Continual Fine-tuning (ICCV 2023) and Tangent Transformers for Composition, Privacy…☆13May 14, 2024Updated last year
- ☆12Oct 10, 2021Updated 4 years ago
- Repo for ICML23 "Why do Nearest Neighbor Language Models Work?"☆59Jan 12, 2023Updated 3 years ago
- XtremeDistil framework for distilling/compressing massive multilingual neural network models to tiny and efficient models for AI at scale☆157Dec 20, 2023Updated 2 years ago
- This repository contains the code for the paper 'PARM: Paragraph Aggregation Retrieval Model for Dense Document-to-Document Retrieval' pu…☆41Jan 5, 2022Updated 4 years ago
- NL-Augmenter 🦎 → 🐍 A Collaborative Repository of Natural Language Transformations☆786May 19, 2024Updated last year