huda-lab / synner
Generating Realistic Synthetic Data
☆31Updated 8 months ago
Related projects ⓘ
Alternatives and complementary repositories for synner
- MirrorDataGenerator is a python tool that generates synthetic data based on user-specified causal relations among features in the data. I…☆19Updated 2 years ago
- A curated list of awesome synthetic data tools (open source and commercial).☆104Updated 9 months ago
- Batteries included toolkit for data engineering.☆32Updated 2 months ago
- plait.py - a fake data modeler☆431Updated 5 years ago
- Awesome Orchest projects, both official and submitted by the community.☆25Updated last year
- ☆29Updated 10 months ago
- Metadata and data identification tool and Python library. Identifies PII, common identifiers, language specific identifiers. Fully custom…☆44Updated 4 months ago
- A curated list of awesome open source tools and commercial products to catalog, version, and manage data 🚀☆25Updated 2 years ago
- 📖 A curated list of resources dedicated to synthetic data☆118Updated 2 years ago
- Provide an easy way with Python to protect your data sources by searching its metadata. 🛡️☆17Updated last week
- Datapractices site☆33Updated last year
- TextGraphs + LLMs + graph ML for entity extraction, linking, ranking, and constructing a lemma graph☆20Updated 8 months ago
- ☆42Updated 2 months ago
- Clone of chatgpt built with Bytewax, Streamlit and NATS☆15Updated last year
- Repo to experiment with Graph RAG strategies using Kùzu☆27Updated last week
- An Example Dremio ARP driven connector that supports SQLLite☆19Updated 7 months ago
- Added functionality to the cml python package☆14Updated last month
- Python package that adds IntelligentGraph capabilities to RDFLib RDF graph package☆53Updated 10 months ago
- Data Lineage Tracing Library☆22Updated 2 years ago
- PyPi module for Graphlet AI Knowledge Graph Factory☆28Updated last year
- Data Quality and Observability platform for the whole data lifecycle, from profiling new data sources to full automation with Data Observ…☆111Updated this week
- Learn Kubeflow with Arrikto☆15Updated 2 years ago
- MedEmbed is a collection of embedding models fine-tuned specifically for medical and clinical data.☆23Updated 2 weeks ago
- GraphRag vs Embeddings☆13Updated 3 months ago
- ☆17Updated last year
- My speaker profile for events and conferences based on codepo8/presenter-terms☆13Updated last month
- Neo4j Cybersecurity Demo☆16Updated 2 years ago
- ☆23Updated last year
- Declarative text based tool for data analysts and engineers to extract, load, transform and orchestrate their data pipelines.☆53Updated this week
- Graph Storage for Concerto Models☆46Updated last month