ing-bank/popmon

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ing-bank/popmon)

ing-bank / popmon

Monitor the stability of a Pandas or Spark dataframe ⚙︎

☆512

Alternatives and similar repositories for popmon

Users that are interested in popmon are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ilanfri / TorchCox
View on GitHub
A validated, reasonably fast, and easily extensible implementation of a Cox model in PyTorch
☆13Feb 4, 2021Updated 5 years ago
pycodehash / pycodehash
View on GitHub
PyCodeHash is a generic data and code hashing library that facilitates downstream caching.
☆13Jan 26, 2026Updated 6 months ago
dwarszawski / kafka-connect-mlflow
View on GitHub
Dedicated Kafka Connector to track changes in MLflow Model Registry
☆10Jan 8, 2021Updated 5 years ago
stephanecollot / sparkmon
View on GitHub
Spark Monitoring
☆14Feb 28, 2023Updated 3 years ago
ing-bank / ordeq
View on GitHub
Ordeq simplifies IO and modularizes pipeline logic.
☆41Dec 19, 2025Updated 7 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
Data-Centric-AI-Community / fg-data-profiling
View on GitHub
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
☆13,653Apr 22, 2026Updated 3 months ago
dylan-profiler / visions
View on GitHub
Type System for Data Analysis in Python
☆222May 27, 2026Updated 2 months ago
ing-bank / probatus
View on GitHub
SHAP-based validation for linear and tree-based models. Applied to binary, multiclass and regression problems.
☆154Apr 19, 2025Updated last year
ing-bank / spark-matcher
View on GitHub
Record matching and entity resolution at scale in Spark
☆36Oct 31, 2023Updated 2 years ago
histogrammar / histogrammar-python
View on GitHub
Python implementation of Histogrammar, a package for creating histograms with Numpy, Pandas and Spark.
☆36Sep 2, 2025Updated 10 months ago
ing-bank / EntityMatchingModel
View on GitHub
Entity Matching Model solves the problem of matching company names between two possibly very large datasets.
☆99May 18, 2026Updated 2 months ago
koaning / scikit-lego
View on GitHub
Extra blocks for scikit-learn pipelines.
☆1,409Updated this week
fivetran / great_expectations
View on GitHub
Always know what to expect from your data.
☆11,680Updated this week
feature-engine / feature_engine
View on GitHub
Feature engineering and selection open-source Python library compatible with sklearn.
☆2,264Updated this week
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
RUrlus / ModelMetricUncertainty
View on GitHub
Python package for Model Metric Uncertainty estimation
☆17Updated this week
unionai-oss / pandera
View on GitHub
A light-weight, flexible, and expressive statistical data testing library
☆4,413Jul 18, 2026Updated last week
NannyML / nannyml
View on GitHub
nannyml: post-deployment data science in python
☆2,147Jul 12, 2025Updated last year
whylabs / whylogs
View on GitHub
An open-source data logging library for machine learning models and data pipelines. 📚 Provides visibility into data quality & model perf…
☆2,829Jan 10, 2025Updated last year
kedro-org / kedro
View on GitHub
Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and…
☆10,938Updated this week
skrub-data / skrub
View on GitHub
Machine learning with dataframes
☆1,643Updated this week
fbdesignpro / sweetviz
View on GitHub
Visualize and compare datasets, target values and associations, with one line of code.
☆3,121Apr 11, 2026Updated 3 months ago
re-data / re-data
View on GitHub
re_data - fix data issues before your users & CEO would discover them 😊
☆1,566Apr 30, 2024Updated 2 years ago
SeldonIO / alibi-detect
View on GitHub
Algorithms for outlier, adversarial and drift detection
☆2,541Dec 11, 2025Updated 7 months ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
hi-primus / optimus
View on GitHub
Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark
☆1,536Dec 2, 2024Updated last year
sktime / sktime
View on GitHub
A unified framework for machine learning with time series
☆9,889Updated this week
ing-bank / sparse_dot_topn
View on GitHub
Python package to accelerate the sparse matrix multiplication and top-n similarity selection
☆424Updated this week
fugue-project / fugue
View on GitHub
A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rew…
☆2,170May 19, 2026Updated 2 months ago
orchest / orchest
View on GitHub
Build data pipelines, the easy way 🛠️
☆4,135Jun 6, 2023Updated 3 years ago
getindata / kedro-airflow-k8s
View on GitHub
Kedro Plugin to support running pipelines on Kubernetes using Airflow.
☆27Mar 11, 2025Updated last year
brickfrog / kedro-pandas-profiling
View on GitHub
A simple wrapper to use Pandas Profiling easily in Kedro
☆17Apr 12, 2021Updated 5 years ago
alteryx / evalml
View on GitHub
EvalML is an AutoML library written in python.
☆852Jan 14, 2026Updated 6 months ago
fivetran / great_expectations_action
View on GitHub
A GitHub Action that makes it easy to use Great Expectations to validate your data pipelines in your CI workflows.
☆84May 10, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
sfu-db / dataprep
View on GitHub
Open-source low code data preparation library in python. Collect, clean and visualization your data in python with a few lines of code.
☆2,246Jun 27, 2024Updated 2 years ago
getindata / kedro-kubeflow
View on GitHub
Kedro Plugin to support running workflows on Kubeflow Pipelines
☆57May 29, 2026Updated 2 months ago
dylan-profiler / compressio
View on GitHub
Lossless in-memory compression of pandas DataFrames and Series powered by the visions type system. Up to 10x less RAM needed for the same…
☆30Nov 10, 2022Updated 3 years ago
AutoViML / featurewiz
View on GitHub
Use advanced feature engineering strategies and select best features from your data set with a single line of code. Created by Ram Seshad…
☆683Feb 19, 2025Updated last year
MAIF / eurybia
View on GitHub
⚓ Eurybia monitors model drift over time and securizes model deployment with data validation
☆224Mar 23, 2026Updated 4 months ago
oegedijk / explainerdashboard
View on GitHub
Quickly build Explainable AI dashboards that show the inner workings of so-called "blackbox" machine learning models.
☆2,503Feb 11, 2026Updated 5 months ago
deepchecks / deepchecks
View on GitHub
Deepchecks: Tests for Continuous Validation of ML Models & Data. Deepchecks is a holistic open-source solution for all of your AI & ML va…
☆4,041Dec 28, 2025Updated 7 months ago