Measuring data importance over ML pipelines using the Shapley value.
☆45Aug 26, 2025Updated 6 months ago
Alternatives and similar repositories for datascope
Users that are interested in datascope are comparing it to the libraries listed below
Sorting:
- Beta Shapley: a Unified and Noise-reduced Data Valuation Framework for Machine Learning (AISTATS 2022 Oral)☆43Nov 10, 2022Updated 3 years ago
- PyTorch reimplementation of computing Shapley values via Truncated Monte Carlo sampling from "What is your data worth? Equitable Valuatio…☆27Jan 21, 2022Updated 4 years ago
- Scalable data valuation using optimal transport (ICLR 2025)☆13Jul 15, 2025Updated 7 months ago
- Automatic Differentiation for Gradient Boosted Decision Trees.☆13May 17, 2022Updated 3 years ago
- Functions for easily making publication-quality figures with matplotlib.☆19Jan 20, 2024Updated 2 years ago
- Alpha version of our data-centric visual benchmark for training data selection☆16Aug 28, 2023Updated 2 years ago
- Data Shapley: Equitable Valuation of Data for Machine Learning☆288May 1, 2024Updated last year
- Implementation of Geolocated Articles Processing and Poverty Mapping - [KDD19]☆19Apr 24, 2021Updated 4 years ago
- The smart-match module contains functions for calculating strings/sets similarity.☆14Feb 22, 2024Updated 2 years ago
- ☆23Nov 1, 2022Updated 3 years ago
- Graph Agglomerative Clustering Library☆23Jul 4, 2019Updated 6 years ago
- Source code and data for the paper "SALT: Sales Autocompletion Linked Business Tables Dataset"☆33Jul 9, 2025Updated 7 months ago
- Autoencoder network for imputing missing values☆27May 13, 2019Updated 6 years ago
- Implementation of Deep Soft-K means☆29Apr 28, 2021Updated 4 years ago
- GEDFN: Graph-Embedded Deep Feedforward Network☆23Apr 23, 2019Updated 6 years ago
- Distributional Shapley: A Distributional Framework for Data Valuation☆30May 1, 2024Updated last year
- ☆11Dec 1, 2023Updated 2 years ago
- Advanced Machine Learning Course☆13Nov 16, 2024Updated last year
- pyDVL is a library of stable implementations of algorithms for data valuation and influence function computation☆142Feb 11, 2026Updated 2 weeks ago
- Official implementation of FedGAT: Generative Autoregressive Transformers for Model-Agnostic Federated MRI Reconstruction (https://arxiv.…☆20May 22, 2025Updated 9 months ago
- Two-stream remote sensing model for water quality mapping: 2SeaColor☆10Feb 2, 2021Updated 5 years ago
- Demo repository to lambda-fy your dbt runs☆11Sep 7, 2023Updated 2 years ago
- Image Annotator Web-app using Plotly-Dash, MongoDB, and MLflow☆14Nov 22, 2022Updated 3 years ago
- Painless and simple development environments using Vagrant☆11Jun 9, 2015Updated 10 years ago
- The impact of text pre-processing methods on the performance of deep learning models for the toxic comments classification☆10Jan 12, 2021Updated 5 years ago
- Framework for geospatial research CNN development.☆11Apr 10, 2021Updated 4 years ago
- Data visualization workshop☆11May 12, 2020Updated 5 years ago
- ☆11Feb 18, 2022Updated 4 years ago
- ☆10Dec 14, 2020Updated 5 years ago
- SODEN: A Scalable Continuous-Time Survival Model through Ordinary Differential Equation Networks☆14Mar 2, 2023Updated 3 years ago
- ☆11Jan 25, 2021Updated 5 years ago
- Maps Medicare LDS claims data to the Tuva Input Layer so you can easily run the Tuva Project.☆12Dec 15, 2025Updated 2 months ago
- Parse Bluetooth Low Energy peripheral advertising packets☆14Nov 7, 2021Updated 4 years ago
- Pytorch Implementation of Lightweight and robust representation of economic scales from satellite imagery.☆13Nov 21, 2019Updated 6 years ago
- ☆12Jan 5, 2026Updated last month
- A data analysis package for high-dimensional, multi-block data.☆12Jan 13, 2022Updated 4 years ago
- ☆12May 14, 2025Updated 9 months ago
- A system used to request absentee ballots in the state of Virginia. 📬☆10Sep 14, 2023Updated 2 years ago
- Online Comment Toxicity Analysis using averaging the Classifiers and used both char level as well as word level n-grams.☆10Mar 31, 2018Updated 7 years ago