Synthetic data generators for tabular and time-series data
☆1,612Mar 2, 2026Updated this week
Alternatives and similar repositories for ydata-synthetic
Users that are interested in ydata-synthetic are comparing it to the libraries listed below
Sorting:
- Synthetic data generation for tabular data☆3,434Updated this week
- Synthetic data generators for structured and unstructured text, featuring differentially private learning.☆672Jun 24, 2025Updated 8 months ago
- Codebase for Time-series Generative Adversarial Networks (TimeGAN) - NeurIPS 2019☆1,036Feb 5, 2026Updated last month
- Tutorials for YData's Fabric platform☆35May 12, 2025Updated 9 months ago
- Conditional GAN for generating synthetic tabular data.☆1,525Feb 22, 2026Updated last week
- Data Quality assessment with one line of code☆453Updated this week
- We well know GANs for success in the realistic image generation. However, they can be applied in tabular data generation. We will review …☆564Jun 24, 2025Updated 8 months ago
- Open-Source Software, Tutorials, and Research on Data-Centric AI 🤖☆345Feb 10, 2026Updated 3 weeks ago
- [IMC 2020 (Best Paper Finalist)] Using GANs for Sharing Networked Time Series Data: Challenges, Initial Promise, and Open Questions☆311Nov 3, 2023Updated 2 years ago
- Fabric SDK to interact with the Fabric platform☆22Updated this week
- A library for generating and evaluating synthetic tabular data for privacy, fairness and data augmentation.☆640Feb 11, 2026Updated 3 weeks ago
- 1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.☆13,399Updated this week
- Benchmarking synthetic data generation methods.☆300Updated this week
- Synthetic Data SDK ✨☆747Jan 13, 2026Updated last month
- nbsynthetic is simple and robust tabular synthetic data generation library for small and medium size datasets☆69Feb 22, 2023Updated 3 years ago
- This repository is a non-official implementation of TimeGAN (Yoon et al., NIPS2019) using PyTorch.☆84Jul 26, 2022Updated 3 years ago
- Evidently is an open-source ML and LLM observability framework. Evaluate, test, and monitor any AI-powered system or data pipeline. Fro…☆7,272Feb 27, 2026Updated last week
- Make your dataset talk to you. The AI assistant for data preparation.☆11Jan 12, 2024Updated 2 years ago
- Feature engineering and selection open-source Python library compatible with sklearn.☆2,204Feb 25, 2026Updated last week
- MTSS-GAN: Multivariate Time Series Simulation with Generative Adversarial Networks (by @firmai)☆93Sep 29, 2020Updated 5 years ago
- Metrics to evaluate quality and efficacy of synthetic datasets.☆256Feb 26, 2026Updated last week
- ☆135Jul 16, 2024Updated last year
- ☆274Apr 3, 2024Updated last year
- 🔅 Shapash: User-friendly Explainability and Interpretability to Develop Reliable and Transparent Machine Learning Models☆3,149Feb 6, 2026Updated last month
- Scalable and user friendly neural forecasting algorithms.☆3,991Updated this week
- nannyml: post-deployment data science in python☆2,126Jul 12, 2025Updated 7 months ago
- EvalML is an AutoML library written in python.☆845Jan 14, 2026Updated last month
- Cleanlab's open-source library is the standard data-centric AI package for data quality and machine learning with messy, real-world data …☆11,346Jan 13, 2026Updated last month
- TimeVAE implementation in keras/tensorflow☆172Mar 30, 2025Updated 11 months ago
- Generation and evaluation of synthetic time series datasets (also, augmentations, visualizations, a collection of popular datasets) NeurI…☆217Nov 8, 2025Updated 3 months ago
- TTS-GAN: A Transformer-based Time-Series Generative Adversarial Network☆285Mar 10, 2024Updated last year
- Deepchecks: Tests for Continuous Validation of ML Models & Data. Deepchecks is a holistic open-source solution for all of your AI & ML va…☆3,982Dec 28, 2025Updated 2 months ago
- 🌊 Online machine learning in Python☆5,735Updated this week
- Experimenting with generating synthetic data using ydata-synthetic