cleanlab / multiannotator-benchmarks
Benchmarking algorithms for assessing quality of data labeled by multiple annotators
☆32Updated 2 years ago
Alternatives and similar repositories for multiannotator-benchmarks:
Users that are interested in multiannotator-benchmarks are comparing it to the libraries listed below
- Ranking of fine-tuned HF models as base models.☆35Updated last year
- Embedding Recycling for Language models☆38Updated last year
- Advances in Neural Information Processing Systems (NeurIPS 2021)☆22Updated 2 years ago
- Companion Repo for the Vision Language Modelling YouTube series - https://bit.ly/3PsbsC2 - by Prithivi Da. Open to PRs and collaborations☆14Updated 2 years ago
- ☆8Updated 9 months ago
- ☆44Updated 5 months ago
- ☆31Updated 2 years ago
- ☆29Updated last year
- Google Research☆46Updated 2 years ago
- Parkar and Kim et al.'s paper on Can LLMs Select Important Instructions to Annotate?"☆12Updated 9 months ago
- QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning P…☆34Updated last year
- ☆11Updated 4 months ago
- Using short models to classify long texts☆21Updated 2 years ago
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification☆11Updated last year
- [CVPR'23 Highlight] Heterogeneous Continual Learning.☆16Updated last year
- ☆24Updated last year
- This is the official PyTorch repo for "UNIREX: A Unified Learning Framework for Language Model Rationale Extraction" (ICML 2022).☆24Updated 2 years ago
- Repository for the paper Do SSL Models Have Déjà Vu? A Case of Unintended Memorization in Self-supervised Learning☆36Updated last year
- This project develops compact transformer models tailored for clinical text analysis, balancing efficiency and performance for healthcare…☆18Updated last year
- Finding semantically meaningful and accurate prompts.☆46Updated last year
- ☆13Updated 7 months ago
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆23Updated this week
- A library for squeakily cleaning and filtering language datasets.☆47Updated last year
- ☆28Updated last year
- Aioli: A unified optimization framework for language model data mixing☆23Updated 3 months ago
- Training and Inference Notebooks for the RedPajama (OpenLlama) models☆18Updated last year
- Code for paper: "Privately generating tabular data using language models".☆15Updated last year
- efficient query encoding for dense retrieval☆11Updated 8 months ago
- Code for paper "Do Language Models Have Beliefs? Methods for Detecting, Updating, and Visualizing Model Beliefs"☆28Updated 2 years ago
- Library for the Test-based Calibration Error (TCE) metric to quantify the degree to classifier calibration.☆13Updated last year