mostly-ai / mostlyai
Synthetic Data SDK β¨
β351Updated this week
Alternatives and similar repositories for mostlyai:
Users that are interested in mostlyai are comparing it to the libraries listed below
- Synthetic Data Quality Assurance πβ30Updated last week
- Synthetic Data Engine πβ49Updated last week
- Metrics to evaluate quality and efficacy of synthetic datasets.β228Updated this week
- Synthetic data generators for structured and unstructured text, featuring differentially private learning.β619Updated last week
- An end-to-end LLM reference implementation providing a Q&A interface for Airflow and Astronomerβ217Updated last week
- ANJANA is a Python library for anonymizing sensitive dataβ29Updated 2 weeks ago
- Frouros: an open-source Python library for drift detection in machine learning systems.β210Updated last month
- A library of Reversible Data Transformsβ124Updated this week
- Benchmarking synthetic data generation methods.β271Updated last week
- Kedro Plugin to support running workflows on Kubeflow Pipelinesβ53Updated 6 months ago
- Find data quality issues and clean your data in a single line of code with a Scikit-Learn compatible Transformer.β130Updated last year
- A curated list of awesome synthetic data tools (open source and commercial).β162Updated last year
- A template to kick-start your Python project β¨πβ51Updated 3 months ago
- A project to kickstart your ML developmentβ30Updated 7 months ago
- A kedro plugin that streamlines the integration between Kedro projects and third-party applications, making it easier for you to developβ¦β38Updated last month
- A curated list of awesome open source tools and commercial products for monitoring data quality, monitoring model performance, and profilβ¦β74Updated 10 months ago
- A Python library to perform NER on structured data and generate PII with Fakerβ29Updated 9 months ago
- A novel approach for synthesizing tabular data using pretrained large language modelsβ304Updated 4 months ago
- 𦫠MLOps for (online) machine learningβ87Updated last year
- Fiddler Auditor is a tool to evaluate language models.β177Updated last year
- nbsynthetic is simple and robust tabular synthetic data generation library for small and medium size datasetsβ65Updated 2 years ago
- β Eurybia monitors model drift over time and securizes model deployment with data validationβ206Updated 5 months ago
- A suite of auto-regressive and Seq2Seq (sequence-to-sequence) transformer models for tabular and relational synthetic data generation.β223Updated 2 weeks ago
- Binder to the cosmograph visual analytics for big graphsβ105Updated 2 weeks ago
- Joining the modern data stack with the modern ML stackβ196Updated last year
- A series of Terraform based recipes to provision popular MLOps stacks on the cloud.β254Updated 5 months ago
- Free Open-source ML observability course for data scientists and ML engineers. Learn how to monitor and debug your ML models in productioβ¦β79Updated last year
- Template repo for kickstarting recipes for regression use caseβ54Updated 3 months ago
- Start building and deploying Python packages and Docker images for MLOps tasks.β398Updated 3 weeks ago
- MLOps Cookiecutter Template: A Base Project Structure for Secure Production ML Engineeringβ40Updated 4 months ago