Measuring data importance over ML pipelines using the Shapley value.
☆45Aug 26, 2025Updated 10 months ago
Alternatives and similar repositories for datascope
Users that are interested in datascope are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [CVPR 2021] Scalability vs. Utility: Do We Have to Sacrifice One for the Other in Data Importance Quantification?☆34Dec 26, 2020Updated 5 years ago
- Data-OOB: Out-of-bag Estimate as a Simple and Efficient Data Value (ICML 2023)☆21Jul 26, 2023Updated 2 years ago
- OpenDataVal: a Unified Benchmark for Data Valuation in Python (NeurIPS 2023)☆101Feb 4, 2025Updated last year
- Official Repo for "Efficient task-specific data valuation for nearest neighbor algorithms"☆27Feb 21, 2020Updated 6 years ago
- Beta Shapley: a Unified and Noise-reduced Data Valuation Framework for Machine Learning (AISTATS 2022 Oral)☆44Nov 10, 2022Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Data Shapley: Equitable Valuation of Data for Machine Learning☆298May 1, 2024Updated 2 years ago
- Now it is exported as an official example☆13Jan 24, 2018Updated 8 years ago
- Collection of available data sources for cognitively-inspired NLP☆38Jul 14, 2020Updated 5 years ago
- Functions for easily making publication-quality figures with matplotlib.☆19Jan 20, 2024Updated 2 years ago
- Distributional Shapley: A Distributional Framework for Data Valuation☆32May 1, 2024Updated 2 years ago
- This is an official repository for "LAVA: Data Valuation without Pre-Specified Learning Algorithms" (ICLR2023).☆55Jun 5, 2024Updated 2 years ago
- Implementation of FedBary☆17Mar 24, 2025Updated last year
- Resources for recent AI systems (deployment concerns, cost and accessibility). -- closed☆12May 29, 2021Updated 5 years ago
- Implementation of Geolocated Articles Processing and Poverty Mapping - [KDD19]☆19Apr 24, 2021Updated 5 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- pyDVL is a library of stable implementations of algorithms for data valuation and influence function computation☆146Apr 13, 2026Updated 2 months ago
- AQuA: A Benchmarking Tool for Label Quality Assessment, NeurIPS'23 D&B☆23Oct 17, 2023Updated 2 years ago
- Domain Adaptation☆23Nov 27, 2021Updated 4 years ago
- Advanced Machine Learning Course☆13Nov 16, 2024Updated last year
- A Data-Centric library providing a unified interface for state-of-the-art methods for hardness characterisation of data points.☆26Mar 6, 2025Updated last year
- ☆13Oct 5, 2020Updated 5 years ago
- a (work-in-progress) grammatical WISYWIG text editor☆12Sep 13, 2018Updated 7 years ago
- 🦀 A Rust implementation of a RoBERTa classification model for the SNLI dataset☆13Sep 13, 2021Updated 4 years ago
- LaTeX Template for Fudan University School of Computer Science 2024☆12May 21, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆11Oct 8, 2021Updated 4 years ago
- 上海房产信息实录,分析每个小区优劣,包括地段、地铁、学区等☆10Apr 7, 2019Updated 7 years ago
- This repository contains the artifacts accompanied by the paper "Fair Preprocessing"☆13Jul 20, 2021Updated 4 years ago
- 💱 A curated list of data valuation (DV) to design your next data marketplace☆143Feb 20, 2025Updated last year
- ☆10Sep 14, 2022Updated 3 years ago
- ☆13Oct 3, 2024Updated last year
- Performant, composable online learning☆16Feb 22, 2021Updated 5 years ago
- ☆23Nov 1, 2022Updated 3 years ago
- Moyuk turns TypeScript functions into web apps in a few seconds.☆14Jun 1, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆15Nov 3, 2022Updated 3 years ago
- An Interactive Tool for Scalable and Reproducible Error Analysis.☆109Jul 22, 2021Updated 4 years ago
- ☆10Jun 19, 2023Updated 3 years ago
- The official implementation of "The Shapley Value of Classifiers in Ensemble Games" (CIKM 2021).☆226Jan 1, 2026Updated 6 months ago
- a distributed computation platform for running Python and Bash computation tasks on multiple nodes☆12Mar 19, 2025Updated last year
- ☆10Jun 13, 2021Updated 5 years ago
- go-active-learning is a command line annotation tool for binary classification problem written in Go.☆15Apr 3, 2021Updated 5 years ago