huda-lab / synnerLinks
Generating Realistic Synthetic Data
☆40Updated last year
Alternatives and similar repositories for synner
Users that are interested in synner are comparing it to the libraries listed below
Sorting:
- MirrorDataGenerator is a python tool that generates synthetic data based on user-specified causal relations among features in the data. I…☆25Updated 3 years ago
- plait.py - a fake data modeler☆435Updated 6 years ago
- Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.☆63Updated last week
- Public blueprints for data use cases☆85Updated 2 months ago
- Application and python script to identify, remove, and/or recode personally identifiable information (PII) from field experiment datasets…☆46Updated last month
- openclean - Data Cleaning and data profiling library for Python☆82Updated 4 years ago
- Python package that adds IntelligentGraph capabilities to RDFLib RDF graph package☆55Updated last year
- Convert monolithic Jupyter notebooks 📙 into maintainable Ploomber pipelines. 📊☆79Updated last year
- MLOps simplified. One-stop AI delivery platform, all the features you need.☆105Updated this week
- Data Lineage Tracing Library☆23Updated 3 years ago
- 🚕 A spreadsheet-like data preparation web app that works over Optimus (Pandas, Dask, cuDF, Dask-cuDF, Spark and Vaex)☆141Updated 2 years ago
- Dremio Flight connector. Access Dremio using Arrow flight☆39Updated 4 years ago
- PyPi module for Graphlet AI Knowledge Graph Factory☆33Updated 2 years ago
- Viewflow is an Airflow-based framework that allows data scientists to create data models without writing Airflow code.☆125Updated 4 years ago
- Data Catalog for Databases and Data Warehouses☆35Updated last year
- A repository of all code and resources of my published blog articles.☆34Updated 2 months ago
- The complete graph data science platform☆140Updated 9 months ago
- A package to build an end-to-end pipeline for detecting personally identifiable information from text.☆48Updated 6 years ago
- DataOps Data Quality TestGen is part of DataKitchen's Open Source Data Observability. DataOps TestGen delivers simple, fast data qualit…☆65Updated last month
- Metadata and data identification tool and Python library. Identifies PII, common identifiers, language specific identifiers. Fully custom…☆45Updated 4 months ago
- Data Mesh Architecture☆84Updated last month
- A curated list of awesome open source tools and commercial products to catalog, version, and manage data 🚀☆38Updated 3 years ago
- Go from graph data to a secure and interactive visual graph app in 15 minutes. Batteries-included self-hosting of graph data apps with St…☆237Updated this week
- Config files for setting up Multitenant Kubeflow on AWS with spot instances☆10Updated 5 years ago
- Provide an easy way with Python to protect your data sources by searching its metadata. 🛡️☆18Updated 3 weeks ago
- Sample solution to automate tedious regulatory compliance processes using multi-agent systems☆19Updated 7 months ago
- This project is created to promote and advocate the use of FOSS machine learning.☆47Updated 6 months ago
- Examples scripts that showcase how to use Private AI Text to de-identify, redact, hash, tokenize, mask and synthesize PII in text.☆85Updated last month
- Batteries included toolkit for data engineering.☆36Updated 10 months ago
- ☆24Updated 2 years ago