Clearbox-AI / StructuredDataProfiling
A Python library to check for data quality and automatically generate data tests.
☆42Updated last year
Alternatives and similar repositories for StructuredDataProfiling:
Users that are interested in StructuredDataProfiling are comparing it to the libraries listed below
- A Python library to perform NER on structured data and generate PII with Faker☆29Updated 9 months ago
- An open-source Python library for the assessment of utility and privacy performance of any tabular synthetic dataset.☆22Updated 2 weeks ago
- Python Biella Group basic template for a modern generic python application☆12Updated 9 months ago
- Modern Data Engineering Project☆11Updated 2 years ago
- Clearbox AI's all-in-one solution for generation and evaluation of synthetic tabular and time-series data.☆40Updated 2 weeks ago
- An agnostic wrapper for the most common ML frameworks.☆14Updated 3 years ago
- ⚓ Eurybia monitors model drift over time and securizes model deployment with data validation☆206Updated 5 months ago
- Monitor the stability of a Pandas or Spark dataframe ⚙︎☆499Updated 2 months ago
- Kedro Plugin to support running workflows on Kubeflow Pipelines☆53Updated 6 months ago
- Repository con materiale delle lezioni e degli argomenti affrontati☆32Updated last week
- A Python library to test your SQL models using mocked input data☆44Updated 11 months ago
- IbisML is a library for building scalable ML pipelines using Ibis.☆108Updated 3 months ago
- Possibly the fastest DataFrame-agnostic quality check library in town.☆183Updated this week
- A write-audit-publish implementation on a data lake without the JVM☆46Updated 7 months ago
- Arcan public trial docker files.☆10Updated 5 months ago
- ☆35Updated 9 months ago
- ☆26Updated 2 years ago
- Frouros: an open-source Python library for drift detection in machine learning systems.☆210Updated last month
- Find data quality issues and clean your data in a single line of code with a Scikit-Learn compatible Transformer.☆130Updated last year
- Pandas Chaining Ninja is a CheatSheet-like repo that helps you to writing pandas code in a more readable and maintainable way using the "…☆85Updated 3 months ago
- Plugins, extensions, case studies, articles, and video tutorials for Kedro☆72Updated 3 months ago
- Assessing whether data from database complies with reference information.☆42Updated 2 weeks ago
- Google Search for The Cheshire Cat AI☆11Updated last year
- SQLMesh example projects☆26Updated 4 months ago
- A tool to automatically infer columns data types in .csv files☆35Updated 2 years ago
- Tutorial for implementing data validation in data science pipelines☆33Updated 2 years ago
- A portable Datamart and Business Intelligence suite built with Docker, sqlmesh + dbtcore, DuckDB and Superset☆49Updated 4 months ago
- Set up a Cost-Effective Modern Data Stack for a Charity☆19Updated 2 months ago
- Swiple enables you to easily observe, understand, validate and improve the quality of your data☆82Updated this week
- A project to kickstart your ML development☆30Updated 7 months ago