hunterlang / weaksup-subset-selection
Subset selection / data pruning for weak supervision
☆15Updated last year
Related projects ⓘ
Alternatives and complementary repositories for weaksup-subset-selection
- Neighborhood Contrastive Learning for Scientific Document Representations with Citation Embeddings (EMNLP 2022 paper)☆64Updated 2 years ago
- LitQA Eval: A difficult set of scientific questions that require context of full-text research papers to answer☆35Updated 4 months ago
- CascadER: Cross-Modal Cascading for Knowledge Graph Link Prediction (arXiv 22)☆13Updated 2 years ago
- EMNLP'2021: Can Language Models be Biomedical Knowledge Bases?☆54Updated last year
- Code for co-training large language models (e.g. T0) with smaller ones (e.g. BERT) to boost few-shot performance☆17Updated 2 years ago
- Hierarchical Attention Transformers (HAT)☆45Updated 10 months ago
- ☆45Updated 2 years ago
- Embedding Recycling for Language models☆38Updated last year
- BioM-Transformers: Building Large Biomedical Language Models with BERT, ALBERT and ELECTRA☆34Updated 9 months ago
- PyTorch implementation for "Long Horizon Temperature Scaling", ICML 2023☆19Updated last year
- ☆19Updated 2 years ago
- Search the biomedical literature for protein interactions and protein associations☆11Updated 11 months ago
- Lightweight implementations of generative label models for weakly supervised machine learning☆18Updated 7 months ago
- Google's BigBird (Jax/Flax & PyTorch) @ 🤗Transformers☆47Updated last year
- An annotated implementation of the Hyena Hierarchy paper☆31Updated last year
- My heuristic script for sentence tokenization of mimic notes☆8Updated 7 years ago
- Biomedical Entity Linking Benchmark☆10Updated last week
- PyTorch implementation for MRL☆18Updated 9 months ago
- A fast implementation of T5/UL2 in PyTorch using Flash Attention☆71Updated last month
- Repository collecting resources and best practices to improve experimental rigour in deep learning research.☆27Updated last year
- Learning to Split for Automatic Bias Detection☆47Updated last year
- Code for the paper SciCo: Hierarchical Cross-Document Coreference for Scientific Concepts (AKBC 2021). https://openreview.net/forum?id=OF…☆25Updated 3 years ago
- LTG-Bert☆29Updated 10 months ago
- ☆18Updated 3 years ago
- JAX/Flax implementation of the Hyena Hierarchy☆31Updated last year
- Using pretrained language models for biomedical knowledge graph completion.☆46Updated 3 years ago
- ☆48Updated 2 years ago
- Efficient Conformal Prediction via Cascaded Inference with Expanded Admission☆20Updated 3 years ago
- Code associated with the paper "Entropy-based Attention Regularization Frees Unintended Bias Mitigation from Lists"☆46Updated 2 years ago
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.☆92Updated last year