A pipeline for generating and evaluating synthetic data generation models. Currently using SynthVAE to demonstrate functionality. Read more about the project here: https://nhsx.github.io/skunkworks/synthetic-data-pipeline
☆26Jul 11, 2022Updated 3 years ago
Alternatives and similar repositories for skunkworks-synthetic-data
Users that are interested in skunkworks-synthetic-data are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- quickly diff kedro history☆10Jul 14, 2025Updated 8 months ago
- ☆25Apr 5, 2023Updated 2 years ago
- kedro plugin to automatically construct pipelines using pytest style pattern matching☆22Sep 4, 2025Updated 6 months ago
- Pipelines for generating large volumes of anonymous artificial data that share some of the characteristics of real NHS data☆35May 26, 2023Updated 2 years ago
- A simple wrapper to use Pandas Profiling easily in Kedro☆17Apr 12, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Synthetic data generation by a Variational AutoEncoder with Differential Privacy assessed using Synthetic Data Vault metrics☆46Apr 4, 2023Updated 2 years ago
- kedro cli plugin for generating a static kedro viz site (html, css, js) that can be deployed on many serverless tools.☆28Jan 6, 2023Updated 3 years ago
- HPC Python Workshop at RSECon22☆14Oct 17, 2022Updated 3 years ago
- ☆17Sep 20, 2023Updated 2 years ago
- Templates for your Kedro projects.☆83Mar 16, 2026Updated last week
- Kedro Wings automatically creates catalog entries to simplify Kedro pipeline writing. See the video here: https://www.youtube.com/watch?v…☆21Dec 26, 2022Updated 3 years ago
- Digital Research Toolkit for Linguists course materials☆12Jul 23, 2025Updated 8 months ago
- DJ Checkup is a security scanner for Django sites.☆32Dec 1, 2025Updated 3 months ago
- OpenAPI definitions and data specs for the FDP CDM☆22Mar 3, 2026Updated 3 weeks ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- This library intends to provide simplicity when accessing and parsing data from Power of 10, a UK-based site that provides a comprehensiv…☆12Feb 12, 2024Updated 2 years ago
- A simple CLI command that initialises a Kedro project from an existing Python package☆11Aug 23, 2024Updated last year
- Kedro-Accelerator speeds up pipelines by parallelizing I/O in the background.☆36Apr 13, 2022Updated 3 years ago
- Dual Adversarial Autoencoder for Generating Set-valued Sequences☆19Jan 15, 2021Updated 5 years ago
- A python module for redaction of personally identifiable information (PII) in clinical free-text. It builds on Presidio and is extremely …☆18Nov 13, 2025Updated 4 months ago
- Convert GTFS feeds to realistic, routable NetworkX graph.☆11Dec 14, 2023Updated 2 years ago
- Python package to calculate comorbidity scores including Charlson Comorbidity Score and Elixhauser Score and their weighted variants.☆19Jan 24, 2026Updated 2 months ago
- trust and specialty projections of waiting lists, for all trusts in england, updated each month☆10May 16, 2023Updated 2 years ago
- Repository for the bus transit spatial decomposition package created by researchers from the MIT JTL-Transit Lab☆12Mar 6, 2023Updated 3 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- ☆10Aug 3, 2021Updated 4 years ago
- Find the Pole of Inaccessibility (Visual Center) of a Polygon☆19Mar 2, 2026Updated 3 weeks ago
- Demo slides for the clean Quarto revealjs theme☆21Jan 16, 2026Updated 2 months ago
- Livelike: Vivid Synthetic Populations☆15Mar 19, 2026Updated last week
- Data science project development best practices and state of the art open-source tooling forged into a set of solved ML use cases to serv…☆20Jun 28, 2023Updated 2 years ago
- A Python library to convert transit data from TransXchange into GTFS -format.☆12Oct 25, 2021Updated 4 years ago
- A kedro-plugin to serve Kedro Pipelines as API☆13Jun 25, 2023Updated 2 years ago
- This covers how to load Microsoft Sharepoint documents into a document format that we can use downstream.☆31May 5, 2024Updated last year
- Cycling Potential Hackathon repo☆13Sep 25, 2020Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Streamlined Data Mapping to OMOP☆21Updated this week
- Full synthetic population for Melbourne's 4+ million residents☆12Dec 21, 2023Updated 2 years ago
- Hands on workshop "Refactor your Jupyter notebooks into maintainable data science code with Kedro"☆17Jan 22, 2025Updated last year
- This is a project extending the solution to the kaggle-connectx problem statement. Here I have made the frontend UI for the same and adde…☆10Mar 8, 2021Updated 5 years ago
- Simulation for Planning and Understanding Railways☆13Nov 10, 2025Updated 4 months ago
- ☆13May 20, 2022Updated 3 years ago
- Processing and performing financial calculations on HFT data☆10Dec 19, 2018Updated 7 years ago