capitalone / synthetic-data
Generating complex, nonlinear datasets appropriate for use with deep learning/black box models which 'need' nonlinearity
☆44Updated 11 months ago
Alternatives and similar repositories for synthetic-data
Users that are interested in synthetic-data are comparing it to the libraries listed below
Sorting:
- GAM (Global Attribution Mapping) explains the landscape of neural network predictions across subpopulations☆33Updated 3 weeks ago
- Capture all information throughout your model's development in a reproducible way and tie results directly to the model code!☆132Updated this week
- Abstractions for feature engineering on large graphs of tabular data.☆21Updated this week
- Deploy production-grade Metaflow cloud infrastructure on AWS☆66Updated 2 weeks ago
- Simple interface to synthesize complex and highly dimensional datasets using Gretel APIs.☆28Updated 2 months ago
- A Kedro plugin that provides pandas dropin replacements for the pandas datasets (e.g modin and cuDF)☆12Updated 4 years ago
- stratx is a library for A Stratification Approach to Partial Dependence for Codependent Variables☆66Updated last year
- Record matching and entity resolution at scale in Spark☆34Updated last year
- ☆26Updated 4 years ago
- An abstraction layer for parameter tuning☆35Updated 8 months ago
- Projects developed by Domino's R&D team☆76Updated 3 years ago
- ☁️ Export Ploomber pipelines to Kubernetes (Argo), Airflow, AWS Batch, SLURM, and Kubeflow.☆45Updated 2 months ago
- Kedro Plugin to support running workflows on Kubeflow Pipelines☆54Updated 8 months ago
- Chatbot for BI☆37Updated 2 years ago
- A Causal AI package for causal graphs.☆57Updated last month
- Build your feature store with macros right within your dbt repository☆38Updated 2 years ago
- Forecasting for knowable future events using Bayesian informative priors (forecasting with judgmental-adjustment).☆59Updated 3 years ago
- Demo repository to lambda-fy your dbt runs☆11Updated last year
- Assessing whether data from database complies with reference information.☆42Updated last week
- openclean - Data Cleaning and data profiling library for Python☆79Updated 3 years ago
- Helper code to interact with Rasgo via our SDK, PyRasgo☆40Updated 2 years ago
- real-time data + ML pipeline☆54Updated this week
- Tutorial for PyData London 2019 on AB Test by cluster☆13Updated 5 years ago
- Lambda Learner is a library for iterative incremental training of a class of supervised machine learning models.☆42Updated 2 years ago
- Build and deploy a serverless data pipeline on AWS with no effort.☆111Updated 2 years ago
- A library to find and visualise the most interesting slices in multidimensional data☆108Updated last month
- Sample configuration to deploy a modern data platform.☆88Updated 3 years ago
- First-party plugins maintained by the Kedro team.☆100Updated this week
- mercury-graph is a Python library that offers graph analytics capabilities with a technology-agnostic API.☆30Updated last month
- Repository for the ML Technology Readiness Levels framework☆37Updated 9 months ago