apicrafter / metacrafter
Metadata and data identification tool and Python library. Identifies PII, common identifiers, language specific identifiers. Fully customizable and flexible rules
β44Updated 7 months ago
Alternatives and similar repositories for metacrafter:
Users that are interested in metacrafter are comparing it to the libraries listed below
- List of entity resolution software and resources.β57Updated last week
- Provide an easy way with Python to protect your data sources by searching its metadata. π‘οΈβ16Updated last week
- Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.β56Updated 2 months ago
- PyPi module for Graphlet AI Knowledge Graph Factoryβ28Updated last year
- scraping and querying documents for LLMsβ18Updated 2 months ago
- quadipy is a python package to help transform structured data into RDF graph formatβ19Updated last year
- Registry of metadata identifier entities like UUID, GUID, person fullname, address and so on. Linked with other sourcesβ17Updated last year
- A fully-featured multi-source data pipeline for continuously extracting knowledge from COVID-19 data.β21Updated 3 years ago
- Batteries included toolkit for data engineering.β33Updated 2 months ago
- Ibis analytics, with Ibis (and more!)β20Updated 5 months ago
- Scripts to make specific datasets cleaner and more convenientβ41Updated 2 years ago
- Python package for deduplication/entity resolution using active learningβ76Updated 6 months ago
- A Python library to generate static data catalog sites. Carte scrapes metadata from your data assets and generates a fully searchable froβ¦β27Updated 2 years ago
- A collection of python utility functionsβ11Updated 8 months ago
- β21Updated 6 months ago
- A small Python module containing quick utility functions for standard ETL processes.β34Updated last week
- Graph Engine for Exploration and Searchβ40Updated last year
- Swiple enables you to easily observe, understand, validate and improve the quality of your dataβ83Updated this week
- Prefect integrations for working with Dockerβ43Updated 10 months ago
- CLI to create an ER Diagram from DuckDB database filesβ84Updated 5 months ago
- β35Updated last month
- Personal Finance Project to automatically collect swiss banking transaction into a DWH and visualise itβ26Updated last year
- Record matching and entity resolution at scale in Sparkβ34Updated last year
- portable Python ML-powered data botβ23Updated 5 months ago
- Cloud-agnostic Python APIβ61Updated 8 months ago
- DuckDB Community Extension to prompt LLMs from SQLβ42Updated last month
- Example project demonstrating deployment patterns for real-time streaming workflows with Prefect 2.0β44Updated 2 years ago
- Anomstack - Painless open source anomaly detection for your metrics πππβ97Updated this week
- Dask integration for Snowflakeβ30Updated 3 months ago
- Prefect integrations for working with OpenAI.β36Updated 10 months ago