Clearbox-AI / StructuredDataProfilingLinks
A Python library to check for data quality and automatically generate data tests.
☆42Updated 2 years ago
Alternatives and similar repositories for StructuredDataProfiling
Users that are interested in StructuredDataProfiling are comparing it to the libraries listed below
Sorting:
- A Python library to perform NER on structured data and generate PII with Faker☆30Updated last year
 - eds-scikit is a Python library providing tools to process and analyse OMOP data☆43Updated 10 months ago
 - Tutorial for implementing data validation in data science pipelines☆33Updated 3 years ago
 - Monitor the stability of a Pandas or Spark dataframe ⚙︎☆507Updated last month
 - 🌳 WALD Stack Demo 🏎️☆32Updated last year
 - Python Biella Group basic template for a modern generic python application☆12Updated 6 months ago
 - Cohort extractor tool which can generate dummy data, or real data against OpenSAFELY-compliant research databases☆38Updated 4 months ago
 - Data Quality assessment with one line of code☆452Updated 3 weeks ago
 - An open-source Python library for the assessment of utility and privacy performance of any tabular synthetic dataset.☆23Updated 4 months ago
 - Set up a Cost-Effective Modern Data Stack for a Charity☆19Updated 7 months ago
 - Start a data science project with modern tools☆202Updated 2 years ago
 - Type System for Data Analysis in Python☆213Updated 9 months ago
 - Possibly the fastest DataFrame-agnostic quality check library in town.☆223Updated last week
 - Clearbox AI's all-in-one solution for generation and evaluation of synthetic tabular and time-series data.☆44Updated last month
 - Feedzai's theme for Altair charts.☆15Updated 2 months ago
 - Explore and compare 1K+ accurate decision trees in your browser!☆169Updated last year
 - Modular, fast NLP framework, compatible with Pytorch and spaCy, offering tailored support for French clinical notes.☆141Updated last week
 - Simple & Easy-to-use python modules to perform Quick Exploratory Data Analysis for any structured dataset!☆105Updated 2 years ago
 - Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withou…☆114Updated last year
 - ⚓ Eurybia monitors model drift over time and securizes model deployment with data validation☆214Updated last year
 - A Python package to build predictive linear and logistic regression models focused on performance and interpretation☆30Updated last year
 - ☆26Updated 3 years ago
 - First-party plugins maintained by the Kedro team.☆106Updated this week
 - Swiple enables you to easily observe, understand, validate and improve the quality of your data☆84Updated this week
 - ☆271Updated last year
 - An automation tool to refactor Jupyter Notebooks to Python modules, with code dependency analysis.☆12Updated 8 months ago
 - ☆14Updated 7 months ago
 - Plugins, extensions, case studies, articles, and video tutorials for Kedro☆89Updated 10 months ago
 - Synthetic data generation by a Variational AutoEncoder with Differential Privacy assessed using Synthetic Data Vault metrics☆46Updated 2 years ago
 - manipulate pandas dataframes from the comfort of your browser☆174Updated 4 years ago