microsoft/ASTRA

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/microsoft/ASTRA)

microsoft / ASTRA

Self-training with Weak Supervision (NAACL 2021)

☆162

Alternatives and similar repositories for ASTRA

Users that are interested in ASTRA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

yueyu1030 / COSINE
View on GitHub
[NAACL 2021] This is the code for our paper `Fine-Tuning Pre-trained Language Model with Weak Supervision: A Contrastive-Regularized Self…
☆205Aug 17, 2022Updated 3 years ago
YipingNUS / OptimSeed
View on GitHub
OptimSeed - Seed Word Selection for Weakly-Supervised Text Classification [NAACL SRW 2021]
☆14Mar 29, 2021Updated 5 years ago
autonlab / weasel
View on GitHub
Weakly Supervised End-to-End Learning (NeurIPS 2021)
☆155Mar 20, 2023Updated 3 years ago
JieyuZ2 / wrench
View on GitHub
[NeurIPS 2021] WRENCH: Weak supeRvision bENCHmark
☆231Feb 13, 2024Updated 2 years ago
NorskRegnesentral / skweak
View on GitHub
skweak: A software toolkit for weak supervision applied to NLP tasks
☆925Sep 2, 2024Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
HazyResearch / flyingsquid
View on GitHub
More interactive weak supervision with FlyingSquid
☆315Sep 1, 2020Updated 5 years ago
ZihanWangKi / XClass
View on GitHub
☆59Apr 24, 2021Updated 5 years ago
decile-team / spear
View on GitHub
SPEAR: Programmatically label and build training data quickly.
☆112Jun 27, 2024Updated 2 years ago
JieyuZ2 / Awesome-Weak-Supervision
View on GitHub
A curated list of programmatic weak supervision papers and resources
☆195Mar 1, 2023Updated 3 years ago
fgranese / DOCTOR
View on GitHub
Advances in Neural Information Processing Systems (NeurIPS 2021)
☆23Nov 4, 2022Updated 3 years ago
jwieting / paraphrastic-representations-at-scale
View on GitHub
☆74Jul 2, 2021Updated 5 years ago
webis-de / small-text
View on GitHub
Active Learning for Text Classification in Python
☆646May 24, 2026Updated last month
knodle / knodle
View on GitHub
A PyTorch-based open-source framework that provides methods for improving the weakly annotated data and allows researchers to efficiently…
☆108Sep 10, 2024Updated last year
facebookresearch / SentAugment
View on GitHub
SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in c…
☆359Feb 22, 2022Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
uds-lsv / anea
View on GitHub
☆19Apr 28, 2021Updated 5 years ago
DFKI-NLP / thermostat
View on GitHub
Collection of NLP model explanations and accompanying analysis tools
☆141Jun 26, 2023Updated 3 years ago
yang-zhang / labse-pytorch
View on GitHub
Language-agnostic BERT Sentence Embedding (LaBSE) Pytorch Model
☆21Sep 2, 2020Updated 5 years ago
jayetri / DrugEHRQA-A-Question-Answering-Dataset-on-Structured-and-Unstructured-Electronic-Health-Records
View on GitHub
☆10Nov 7, 2022Updated 3 years ago
microsoft / UST
View on GitHub
Uncertainty-aware Self-training
☆124Dec 20, 2023Updated 2 years ago
awasthiabhijeet / Learning-From-Rules
View on GitHub
Implementation of experiments in paper "Learning from Rules Generalizing Labeled Exemplars" to appear in ICLR2020 (https://openreview.net…
☆50Feb 28, 2023Updated 3 years ago
allenai / flex
View on GitHub
Few-shot NLP benchmark for unified, rigorous eval
☆93Jul 12, 2022Updated 4 years ago
infinitylogesh / mutate
View on GitHub
A library to synthesize text datasets using Large Language Models (LLM)
☆152Jan 17, 2023Updated 3 years ago
StefanHeng / ProgGen
View on GitHub
Code for paper "ProgGen: Generating Named Entity Recognition Datasets Step-by-step with Self-Reflexive Large Language Models"
☆17Mar 29, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
megagonlabs / ruler
View on GitHub
Data Programming by Demonstration (DPBD) for Document Classification
☆35Jun 17, 2021Updated 5 years ago
UCSC-REAL / SimiFeat
View on GitHub
☆76May 17, 2023Updated 3 years ago
robustness-gym / summvis
View on GitHub
SummVis is an interactive visualization tool for text summarization.
☆253Jun 17, 2022Updated 4 years ago
JonathanRaiman / ciseau
View on GitHub
Tokenize and clean strings in Python
☆11Jan 11, 2018Updated 8 years ago
sebastian-hofstaetter / matchmaker
View on GitHub
Training & evaluation library for text-based neural re-ranking and dense retrieval models built with PyTorch
☆265Jan 27, 2023Updated 3 years ago
princeton-nlp / DensePhrases
View on GitHub
[ACL 2021] Learning Dense Representations of Phrases at Scale; EMNLP'2021: Phrase Retrieval Learns Passage Retrieval, Too https://arxiv.o…
☆607Jun 15, 2022Updated 4 years ago
terrierteam / pyterrier_doc2query
View on GitHub
☆39Nov 27, 2025Updated 7 months ago
amzn / amazon-weak-ner-needle
View on GitHub
Named Entity Recognition with Small Strongly Labeled and Large Weakly Labeled Data
☆101Jul 25, 2023Updated 2 years ago
LinxinS97 / NLPBench
View on GitHub
NLPBench: Evaluating NLP-Related Problem-solving Ability in Large Language Models
☆10Oct 27, 2023Updated 2 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
salesforce / DocNLI
View on GitHub
☆69May 1, 2025Updated last year
yanzhangnlp / BSL
View on GitHub
Bootstrapped Unsupervised Sentence Representation Learning (ACL 2021)
☆30Apr 27, 2022Updated 4 years ago
GEM-benchmark / NL-Augmenter
View on GitHub
NL-Augmenter 🦎 → 🐍 A Collaborative Repository of Natural Language Transformations
☆786May 19, 2024Updated 2 years ago
qkaren / unsup_gen_for_cms_reasoning
View on GitHub
☆49Jun 12, 2023Updated 3 years ago
jungokasai / beam_with_patience
View on GitHub
☆46Apr 13, 2022Updated 4 years ago
princeton-nlp / MADE
View on GitHub
EMNLP 2021: Single-dataset Experts for Multi-dataset Question-Answering
☆68Nov 26, 2021Updated 4 years ago
UKPLab / gpl
View on GitHub
Powerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: …
☆343Jul 6, 2023Updated 3 years ago