Clearbox-AI / StructuredDataProfiling
A Python library to check for data quality and automatically generate data tests.
☆42Updated last year
Alternatives and similar repositories for StructuredDataProfiling:
Users that are interested in StructuredDataProfiling are comparing it to the libraries listed below
- A Python library to perform NER on structured data and generate PII with Faker☆29Updated 10 months ago
- An open-source Python library for the assessment of utility and privacy performance of any tabular synthetic dataset.☆23Updated last month
- Python Biella Group basic template for a modern generic python application☆12Updated last week
- Clearbox AI's all-in-one solution for generation and evaluation of synthetic tabular and time-series data.☆42Updated last week
- An agnostic wrapper for the most common ML frameworks.☆14Updated 3 years ago
- Find data quality issues and clean your data in a single line of code with a Scikit-Learn compatible Transformer.☆130Updated last year
- ⚓ Eurybia monitors model drift over time and securizes model deployment with data validation☆207Updated 6 months ago
- Modern Data Engineering Project☆11Updated 2 years ago
- Tutorial for implementing data validation in data science pipelines☆33Updated 2 years ago
- A simple and easy to use Data Quality (DQ) tool built with Python.☆50Updated last year
- Explore and compare 1K+ accurate decision trees in your browser!☆160Updated last year
- Monitor the stability of a Pandas or Spark dataframe ⚙︎☆500Updated 3 months ago
- Doubt your data, find bad labels.☆511Updated 9 months ago
- A toolbox 🧰 for Jupyter notebooks 📙: testing, experiment tracking, debugging, profiling, and more!☆68Updated 7 months ago
- A tool to automatically infer columns data types in .csv files☆35Updated 2 years ago
- Set up a Cost-Effective Modern Data Stack for a Charity☆19Updated 3 weeks ago
- First-party plugins maintained by the Kedro team.☆99Updated last week
- A copier template for Kedro projects☆9Updated 7 months ago
- Identify bias and measure fairness of your data☆90Updated 3 weeks ago
- A Python library to test your SQL models using mocked input data☆44Updated last year
- Kedro Plugin to support running workflows on Kubeflow Pipelines☆53Updated 7 months ago
- Tutorials on creating a reproducible and maintainable data science project☆143Updated 2 years ago
- ACV is a python library that provides explanations for any machine learning model or data. It gives local rule-based explanations for any…☆100Updated 2 years ago
- A project to kickstart your ML development☆30Updated 8 months ago
- Notebooks that support blog posts and tech talks on Dask / Coiled.☆47Updated last month
- ANJANA is a Python library for anonymizing sensitive data☆30Updated this week
- IbisML is a library for building scalable ML pipelines using Ibis.☆108Updated 4 months ago
- Cookiecutter template for Python packages☆104Updated 2 months ago
- Data Quality assessment with one line of code☆438Updated 2 weeks ago
- ☆26Updated 2 years ago