huda-lab / synner
Generating Realistic Synthetic Data
☆34Updated last year
Alternatives and similar repositories for synner:
Users that are interested in synner are comparing it to the libraries listed below
- MirrorDataGenerator is a python tool that generates synthetic data based on user-specified causal relations among features in the data. I…☆21Updated 2 years ago
- Awesome Orchest projects, both official and submitted by the community.☆25Updated last year
- A curated list of awesome open source tools and commercial products to catalog, version, and manage data 🚀☆32Updated 2 years ago
- A course on systems thinking for technologists by Dan McCreary☆17Updated 5 months ago
- Batteries included toolkit for data engineering.☆33Updated 2 months ago
- ☆37Updated last month
- Open-source repository for Semantic Modeling Language (SML)☆61Updated last week
- Added functionality to the cml python package☆14Updated 5 months ago
- Python package that adds IntelligentGraph capabilities to RDFLib RDF graph package☆55Updated last year
- Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.☆56Updated 3 months ago
- plait.py - a fake data modeler☆434Updated 6 years ago
- GraphRag vs Embeddings☆13Updated 8 months ago
- TextGraphs + LLMs + graph ML for entity extraction, linking, ranking, and constructing a lemma graph☆23Updated last year
- Data Catalog for Databases and Data Warehouses☆33Updated last year
- openclean - Data Cleaning and data profiling library for Python☆74Updated 3 years ago
- Learn Kubeflow with Arrikto☆15Updated 3 years ago
- A framework of open-source technologies to design real-time machine learning systems☆28Updated 2 years ago
- Demonstration of how to perform continuous model monitoring on CML using Model Metrics and Evidently.ai dashboards☆12Updated 3 months ago
- Crowdsourced cypher statement evaluation☆33Updated last year
- Record matching and entity resolution at scale in Spark☆34Updated last year
- PyPi module for Graphlet AI Knowledge Graph Factory☆28Updated last year
- Test data management tool for any data source, batch or real-time. Generate, validate and clean up data all in one tool.☆49Updated last month
- ☆43Updated this week
- Birgitta is a Python ETL test and schema framework, providing automated tests for pyspark notebooks/recipes.☆14Updated last year
- Data for the Chat With Your Data benchmark.☆133Updated last year
- An organizational AI system to build a suite of AI assistants leveraging ontologies as a unifying field that connect data, AI models, wor…☆56Updated this week
- This project is created to promote and advocate the use of FOSS machine learning.☆43Updated last month
- The sane way of building a data layer in Airflow☆24Updated 5 years ago
- ☆25Updated last week
- Provide an easy way with Python to protect your data sources by searching its metadata. 🛡️☆16Updated last week