Clearbox-AI / StructuredDataProfilingLinks
A Python library to check for data quality and automatically generate data tests.
☆42Updated 2 years ago
Alternatives and similar repositories for StructuredDataProfiling
Users that are interested in StructuredDataProfiling are comparing it to the libraries listed below
Sorting:
- A Python library to perform NER on structured data and generate PII with Faker☆30Updated last year
- Tutorial for implementing data validation in data science pipelines☆33Updated 3 years ago
- Monitor the stability of a Pandas or Spark dataframe ⚙︎☆510Updated 3 weeks ago
- Possibly the fastest DataFrame-agnostic quality check library in town.☆234Updated 3 months ago
- Start a data science project with modern tools☆204Updated 2 years ago
- The refactoring tutorial I wrote for PyConDE 2022. You can also work through the exercises on your own.☆18Updated last year
- Set up a Cost-Effective Modern Data Stack for a Charity☆19Updated 10 months ago
- Explore and compare 1K+ accurate decision trees in your browser!☆169Updated last year
- First-party plugins maintained by the Kedro team.☆112Updated this week
- Data Quality assessment with one line of code☆452Updated last month
- Feedzai's theme for Altair charts.☆15Updated 4 months ago
- Frouros: an open-source Python library for drift detection in machine learning systems.☆246Updated last week
- Find data quality issues and clean your data in a single line of code with a Scikit-Learn compatible Transformer.☆135Updated 2 years ago
- Python package for Gower distance☆82Updated last year
- A kedro-plugin for integration of mlflow capabilities inside kedro projects (especially machine learning model versioning and packaging)☆229Updated last week
- ☆26Updated 3 years ago
- ☆12Updated 3 years ago
- The easiest way to integrate Kedro and Great Expectations☆54Updated 3 years ago
- pipreqs with jupyter notebook support☆71Updated 2 years ago
- openclean - Data Cleaning and data profiling library for Python☆83Updated 4 years ago
- eds-scikit is a Python library providing tools to process and analyse OMOP data☆44Updated last year
- Python Biella Group basic template for a modern generic python application☆12Updated 9 months ago
- Clearbox AI's all-in-one solution for generation and evaluation of synthetic tabular and time-series data.☆44Updated 4 months ago
- Editing machine learning models to reflect human knowledge and values☆128Updated 2 years ago
- manipulate pandas dataframes from the comfort of your browser☆174Updated 4 years ago
- A python library for hierarchical classification compatible with scikit-learn☆136Updated 10 months ago
- 🌹 Cookiecutter template featuring the modern and extensible Python project manager hatch☆81Updated last year
- Type System for Data Analysis in Python☆216Updated 11 months ago
- A Python package to build predictive linear and logistic regression models focused on performance and interpretation☆30Updated last year
- ☆73Updated 3 months ago