hunterlang / weaksup-subset-selection
Subset selection / data pruning for weak supervision
☆15Updated last year
Alternatives and similar repositories for weaksup-subset-selection:
Users that are interested in weaksup-subset-selection are comparing it to the libraries listed below
- Code for co-training large language models (e.g. T0) with smaller ones (e.g. BERT) to boost few-shot performance☆17Updated 2 years ago
- BioELECTRA☆50Updated 3 years ago
- Biomedical Entity Linking Benchmark☆12Updated 4 months ago
- ☆51Updated 3 years ago
- Neighborhood Contrastive Learning for Scientific Document Representations with Citation Embeddings (EMNLP 2022 paper)☆67Updated 2 years ago
- Dataset and evaluation suite enabling LLM instruction-following for scientific literature understanding.☆40Updated last month
- Embedding Recycling for Language models☆38Updated last year
- LitQA Eval: A difficult set of scientific questions that require context of full-text research papers to answer☆38Updated 4 months ago
- Search the biomedical literature for protein interactions and protein associations☆11Updated last year
- Contrastive neighbor embeddings☆54Updated last month
- CascadER: Cross-Modal Cascading for Knowledge Graph Link Prediction (arXiv 22)☆13Updated 2 years ago
- My heuristic script for sentence tokenization of mimic notes☆8Updated 7 years ago
- Code for the paper SciCo: Hierarchical Cross-Document Coreference for Scientific Concepts (AKBC 2021). https://openreview.net/forum?id=OF…☆29Updated 3 years ago
- Interpretable and efficient predictors using pre-trained language models. Scikit-learn compatible.☆42Updated last month
- EMNLP'2021: Can Language Models be Biomedical Knowledge Bases?☆56Updated 2 years ago
- BioM-Transformers: Building Large Biomedical Language Models with BERT, ALBERT and ELECTRA☆37Updated last year
- Using pretrained language models for biomedical knowledge graph completion.☆46Updated 3 years ago
- Bio relation extraction labeled dataset☆45Updated 3 years ago
- An annotated implementation of the Hyena Hierarchy paper☆32Updated last year
- ☆45Updated 3 years ago
- Cross-domain data integration for named entity disambiguation in biomedical text☆11Updated 3 years ago
- Weak supervision methods for extracting real world evidence from EHRs☆33Updated 5 years ago
- Ranking of fine-tuned HF models as base models.☆35Updated last year
- An SKLearn-style toolbox for estimating and analyzing models, distributions, and functions with context-specific parameters.☆70Updated 2 months ago
- This repository contains the code used for distillation and fine-tuning of compact biomedical transformers that have been introduced in t…☆18Updated last year
- Bioformer: an efficient BERT model for biomedical text mining☆54Updated 2 years ago
- ☆59Updated last year
- Code repository for BEEP (Biomedical Evidence Enhanced Predictions) clinical outcome prediction system☆26Updated last year
- SciFive: a text-text transformer model for biomedical literature☆94Updated 10 months ago
- Multidocument Summarization for Literature Review Shared Task 2022☆29Updated 2 years ago