Code for co-training large language models (e.g. T0) with smaller ones (e.g. BERT) to boost few-shot performance
☆17Sep 23, 2022Updated 3 years ago
Alternatives and similar repositories for cotrain-prompting
Users that are interested in cotrain-prompting are comparing it to the libraries listed below
Sorting:
- SMASHED is a toolkit designed to apply transformations to samples in datasets, such as fields extraction, tokenization, prompting, batchi…☆35May 24, 2024Updated last year
- Subset selection / data pruning for weak supervision☆16Jun 21, 2023Updated 2 years ago
- This repository contains code used for our Multi Sentence Inference NAACL'22 paper.☆12Mar 6, 2023Updated 3 years ago
- ☆12Jul 6, 2023Updated 2 years ago
- Multi-LexSum is an abstractive summarization dataset for US Civil Rights Lawsuits☆21Dec 15, 2022Updated 3 years ago
- Starbucks: Improved Training for 2D Matryoshka Embeddings☆22Jun 30, 2025Updated 8 months ago
- Gantry is a CLI that streamlines running experiments in Beaker☆32Updated this week
- Lite Self-Training☆30Jul 25, 2023Updated 2 years ago
- Code for paper "Prompt-Based Metric Learning for Few-shot NER".☆23Nov 14, 2023Updated 2 years ago
- Medical Similarity Dataset creation from SNOMED☆28Jul 14, 2022Updated 3 years ago
- The corresponding code for our paper: "Exploring the Challenges of Open Domain Multi-Document Summarization". Do not hesitate to open an …☆33Jun 24, 2023Updated 2 years ago
- ☆53Oct 13, 2025Updated 4 months ago
- Code for ACL 2021 paper "Unsupervised Out-of-Domain Detection via Pre-trained Transformers"☆30Aug 20, 2021Updated 4 years ago
- Code for NAACL 2022 paper "Reframing Human-AI Collaboration for Generating Free-Text Explanations"☆31Apr 28, 2023Updated 2 years ago
- Arabic News Stance Corpus☆11Feb 5, 2021Updated 5 years ago
- Python code to automatically produce a summary of a piece of text.☆12Sep 8, 2016Updated 9 years ago
- ☆16Feb 28, 2026Updated last week
- scrape web content into readable markdown for llms and human readers☆10Feb 19, 2024Updated 2 years ago
- Self-Supervised Document-to-Document Similarity Ranking via Contextualized Language Models and Hierarchical Inference☆45Nov 28, 2022Updated 3 years ago
- using AI model to infer patient phenotypes from identified named entities (instances of biomedical concepts)☆10Jan 13, 2023Updated 3 years ago
- ☆10May 1, 2025Updated 10 months ago
- A library for Partially Homomorphic Encryption in Python☆12May 30, 2017Updated 8 years ago
- Official implementation of the paper "Pretraining Language Models to Ponder in Continuous Space"☆25Jul 21, 2025Updated 7 months ago
- Code Roberta version of RetroMAE: Pre-Training Retrieval-oriented Language Models Via Masked Auto-Encoder☆10Mar 16, 2023Updated 2 years ago
- The official implementation of the paper "Text Classification in the Wild: a Large-scale Long-tailed Name Normalization Dataset"(ICASSP 2…☆12Feb 19, 2023Updated 3 years ago
- T5Patches is a set of tools for fast and targeted editing of generative language models built with T5X.☆12May 31, 2024Updated last year
- Code of "Instruction Multi-Constraint Molecular Generation Using a Teacher-Student Large Language Model"☆14Jul 8, 2025Updated 7 months ago
- A library for handling Structural Causal Models and performing interventional and counterfactual inference on them.☆13Jul 3, 2020Updated 5 years ago
- Code and webpages for our study on teaching humans to defer to an AI☆12Nov 6, 2023Updated 2 years ago
- A labeled dataset for domain specific named entity recognition☆13Mar 1, 2022Updated 4 years ago
- Codebase for the paper "Schema-guided User Satisfaction Modeling for Task-oriented Dialogues"☆11Aug 6, 2025Updated 7 months ago
- Hierarchical Story Generation based on (https://arxiv.org/abs/1805.04833)☆13May 6, 2020Updated 5 years ago
- This repository contains the implementation code for paper: Mixup Your Own Pairs☆12Oct 1, 2023Updated 2 years ago
- A method for evaluating the high-level coherence of machine-generated texts. Identifies high-level coherence issues in transformer-based …☆11Mar 18, 2023Updated 2 years ago
- Generate Software Bill of Materials for R Things☆19Feb 9, 2024Updated 2 years ago
- A file-backed dictionary for Python☆12Aug 15, 2022Updated 3 years ago
- ☆10Mar 2, 2022Updated 4 years ago
- This repository contains code and data for reproducing the experiments of three papers that focus on two subtasks of table annotation: co…☆12Mar 5, 2025Updated last year
- ☆15Jul 21, 2025Updated 7 months ago