autonlab / aqua
AQuA: A Benchmarking Tool for Label Quality Assessment
☆21Updated last year
Alternatives and similar repositories for aqua:
Users that are interested in aqua are comparing it to the libraries listed below
- Official repo of Progressive Data Expansion: data, code and evaluation☆28Updated last year
- Testing Language Models for Memorization of Tabular Datasets.☆33Updated last month
- Code and results accompanying our paper titled RLSbench: Domain Adaptation under Relaxed Label Shift☆34Updated last year
- In-context Example Selection with Influences☆15Updated last year
- ☆34Updated last year
- Personal implementation of ASIF by Antonio Norelli☆25Updated 10 months ago
- Towards Understanding the Mixture-of-Experts Layer in Deep Learning☆26Updated last year
- A weak supervision framework for (partial) labeling functions☆16Updated 8 months ago
- ☆15Updated last year
- Interpretable and efficient predictors using pre-trained language models. Scikit-learn compatible.☆42Updated last month
- ☆23Updated 4 months ago
- Model zoo for different kinds of uncertainty quantification methods used in Natural Language Processing, implemented in PyTorch.☆53Updated last year
- ☆28Updated last year
- Research on Tabular Foundation Models☆44Updated 3 months ago
- Code for the paper "Data Feedback Loops: Model-driven Amplification of Dataset Biases"☆15Updated 2 years ago
- Tasks for describing differences between text distributions.☆16Updated 8 months ago
- ☆10Updated 6 months ago
- PaCE: Parsimonious Concept Engineering for Large Language Models (NeurIPS 2024)☆35Updated 5 months ago
- Data-OOB: Out-of-bag Estimate as a Simple and Efficient Data Value (ICML 2023)☆18Updated last year
- Code for "Counterfactual Token Generation in Large Language Models", Arxiv 2024.☆25Updated 5 months ago
- This is an official repository for "Performance Scaling via Optimal Transport: Enabling Data Selection from Partially Revealed Sources" (…☆14Updated last year
- Code repository for the paper "Mission: Impossible Language Models."☆50Updated this week
- Recycling diverse models☆44Updated 2 years ago
- Code for experiments on self-prediction as a way to measure introspection in LLMs☆12Updated 3 months ago
- Provably (and non-vacuously) bounding test error of deep neural networks under distribution shift with unlabeled test data.☆10Updated last year
- Materials for the course Principles of AI: LLMs at UPenn (Stat 9911, Spring 2025). LLM architectures, training paradigms (pre- and post-t…☆30Updated this week
- Is In-Context Learning Sufficient for Instruction Following in LLMs? [ICLR 2025]☆29Updated 2 months ago
- Code for paper: Are Large Language Models Post Hoc Explainers?☆31Updated 8 months ago
- How do transformer LMs encode relations?☆46Updated last year
- MultiModN – Multimodal, Multi-Task, Interpretable Modular Networks (NeurIPS 2023)☆32Updated last year