mostly-ai / mostlyai-qa
Synthetic Data Quality Assurance π
β31Updated this week
Alternatives and similar repositories for mostlyai-qa:
Users that are interested in mostlyai-qa are comparing it to the libraries listed below
- Synthetic Data SDK β¨β371Updated this week
- Metrics to evaluate quality and efficacy of synthetic datasets.β228Updated this week
- Benchmarking synthetic data generation methods.β271Updated 2 weeks ago
- Synthetic data generators for structured and unstructured text, featuring differentially private learning.β625Updated 2 weeks ago
- A kedro-plugin for integration of mlflow capabilities inside kedro projects (especially machine learning model versioning and packaging)β210Updated 2 weeks ago
- A novel approach for synthesizing tabular data using pretrained large language modelsβ305Updated 5 months ago
- Synthetic data generation for tabular dataβ2,566Updated this week
- A Unified Framework for Quantifying Privacy Risk in Synthetic Data according to the GDPRβ82Updated 3 weeks ago
- β264Updated 11 months ago
- A project to kickstart your ML developmentβ30Updated 7 months ago
- Generative adversarial training for generating synthetic tabular data.β285Updated 2 years ago
- π The core repository to support participants through the UN PET Lab Hackathon 2022 π Registration at: https://petlab.officialstatisticβ¦β19Updated 2 years ago
- Conditional GAN for generating synthetic tabular data.β1,362Updated 2 weeks ago
- Evaluate real and synthetic datasets against each otherβ86Updated 3 months ago
- A machine learning tool for automated prediction engineering. It allows you to easily structure prediction problems and generate labels fβ¦β505Updated this week
- SHAP-based validation for linear and tree-based models. Applied to binary, multiclass and regression problems.β136Updated this week
- Find data quality issues and clean your data in a single line of code with a Scikit-Learn compatible Transformer.β130Updated last year
- A library of Reversible Data Transformsβ124Updated this week
- Official GitHub for CTAB-GAN+β74Updated 10 months ago
- Food for thoughts around data contractsβ25Updated 3 weeks ago
- ARXaaS is a "Anonymization as a Service" project built ontop of the ARX libraryβ21Updated last year
- Polars Cookbook, Published by Packtβ319Updated 5 months ago
- pyCANON is a Python library and CLI to assess the values of the parameters associated with the most common privacy-preserving techniques.β35Updated this week
- Official git for "CTAB-GAN: Effective Table Data Synthesizing"β83Updated last year
- Templates for your Kedro projects.β72Updated this week
- A kedro plugin that streamlines the integration between Kedro projects and third-party applications, making it easier for you to developβ¦β38Updated last month
- First-party plugins maintained by the Kedro team.β98Updated this week
- Monitor the stability of a Pandas or Spark dataframe βοΈβ498Updated 2 months ago
- Kickstart your MLOps initiative with a flexible, robust, and productive Python package.β1,206Updated last week
- Examples of data science projects created with Kedro.β172Updated last year