π A curated list of resources dedicated to synthetic data
β141Jul 29, 2022Updated 3 years ago
Alternatives and similar repositories for awesome-synthetic-data
Users that are interested in awesome-synthetic-data are comparing it to the libraries listed below
Sorting:
- Simple interface to synthesize complex and highly dimensional datasets using Gretel APIs.β28Mar 5, 2025Updated last year
- A curated list of awesome synthetic data tools (open source and commercial).β244Jan 11, 2024Updated 2 years ago
- Proof of concept code from Gretel.ai and Illumina using generative neural networks to create synthetic versions of mouse genotype and pheβ¦β33Jan 19, 2022Updated 4 years ago
- Generative models to automatically anonymize data to meet GDPR & CCPA standards.β31Jan 24, 2023Updated 3 years ago
- Use FastCUT with public map images and location data from a few cities to generate realistic synthetic location data for any city in the β¦β24Feb 8, 2022Updated 4 years ago
- Synthetic data generators for structured and unstructured text, featuring differentially private learning.β672Jun 24, 2025Updated 8 months ago
- A curated list of awesome resources for creating synthetic dataβ45Feb 16, 2022Updated 4 years ago
- Python toolbox for multi-omics data mapping and analysisβ26Apr 13, 2023Updated 2 years ago
- Synthetic data generation by a Variational AutoEncoder with Differential Privacy assessed using Synthetic Data Vault metricsβ46Apr 4, 2023Updated 2 years ago
- study pytorchβ15Oct 14, 2018Updated 7 years ago
- Chat with Time-Series Data in PostgreSQL using LlamaIndex and Timescale Vectorβ12Mar 24, 2024Updated last year
- [APSIPA ASC 2023] The official code of paper, "FactLLaMA: Optimizing Instruction-Following Language Models with External Knowledge for Auβ¦β16Mar 7, 2024Updated 2 years ago
- Introduction to Econometrics at the University of Oregon (EC421) during Spring quarter, 2020. Taught by Ed Rubinβ14Jan 27, 2022Updated 4 years ago
- β16May 12, 2023Updated 2 years ago
- JAX-RS Plugin for Grailsβ50Apr 3, 2016Updated 9 years ago
- β17Jul 15, 2024Updated last year
- Repository for the results of my master thesis, about the generation and evaluation of synthetic data using GANsβ45Jun 21, 2023Updated 2 years ago
- Microarray Analysis Pipeline in Pythonβ19Aug 1, 2019Updated 6 years ago
- Dual Adversarial Autoencoder for Generating Set-valued Sequencesβ19Jan 15, 2021Updated 5 years ago
- Kirsche, connecting your references.β14Aug 20, 2024Updated last year
- β43Dec 7, 2022Updated 3 years ago
- Implementation of Quantum Ising Born Machineβ19May 1, 2020Updated 5 years ago
- Some basic algorithms and data structures in "Data Structure and Algorithm Analysis in C" by Mark Allen Weiss implementation in Python.β15Feb 1, 2021Updated 5 years ago
- Crowdsourced data for open domain relation classification from sentencesβ20Oct 26, 2018Updated 7 years ago
- Black for Python docstrings and reStructuredText (rst).β18Apr 7, 2023Updated 2 years ago
- A Python library to generate static data catalog sites. Carte scrapes metadata from your data assets and generates a fully searchable froβ¦β29Jul 7, 2022Updated 3 years ago
- β51May 23, 2023Updated 2 years ago
- Content for a talk on "The wonderful world of data quality tools in Python"β18May 5, 2021Updated 4 years ago
- A library for generating and evaluating synthetic tabular data for privacy, fairness and data augmentation.β640Feb 11, 2026Updated 3 weeks ago
- A pipeline for generating and evaluating synthetic data generation models. Currently using SynthVAE to demonstrate functionality. Read moβ¦β26Jul 11, 2022Updated 3 years ago
- π End-to-end examples and analysis of deploying LLMs serverless using Modal, Runpod, and Beamβ28Mar 25, 2024Updated last year
- PyTrial: A Comprehensive Platform for Artificial Intelligence for Drug Developmentβ122Jan 8, 2024Updated 2 years ago
- Synthetic data generation for tabular dataβ3,434Updated this week
- A deep-learning framework for multi-omics integrationβ32Jul 6, 2023Updated 2 years ago
- Framework for image and flow cytometry analysisβ32Jan 3, 2026Updated 2 months ago
- Tofu is a Python tool for generating synthetic UK Biobank data.β68Jul 25, 2023Updated 2 years ago
- A Data-Centric library providing a unified interface for state-of-the-art methods for hardness characterisation of data points.β26Mar 6, 2025Updated last year
- GitHub Repo for the UChicago, Spring 2021 course *Are We Doomed? Confronting the End of the World*β12Mar 30, 2021Updated 4 years ago
- Metrics to evaluate quality and efficacy of synthetic datasets.β256Updated this week