apicrafter / metacrafterLinks
Metadata and data identification tool and Python library. Identifies PII, common identifiers, language specific identifiers. Fully customizable and flexible rules
☆44Updated last month
Alternatives and similar repositories for metacrafter
Users that are interested in metacrafter are comparing it to the libraries listed below
Sorting:
- Data processing and modelling framework for automating tasks (incl. Python & SQL transformations).☆120Updated 2 months ago
- Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.☆61Updated this week
- 🦖 A SQL-on-everything Query Engine you can execute over multiple databases and file formats. Query your data, where it lives.☆107Updated this week
- Toolkit for graph-relational data across space and time☆116Updated 11 months ago
- Data pipelines from re-usable components☆107Updated 2 years ago
- PyPi module for Graphlet AI Knowledge Graph Factory☆29Updated 2 years ago
- CLI to create an ER Diagram from DuckDB database files☆131Updated 5 months ago
- Swiple enables you to easily observe, understand, validate and improve the quality of your data☆84Updated last week
- Python+VueJS application to load, explore, combine,transform and deliver data☆96Updated 6 months ago
- DAG based BI-as-code CLI tool. Unlocks a better approach data visualization that integrates seamlessly into the modern data stack.☆60Updated this week
- Provide an easy way with Python to protect your data sources by searching its metadata. 🛡️☆18Updated this week
- dagster scikit-learn pipeline example.☆46Updated 2 years ago
- Opinionated JSON to CSV/XLSX/SQLITE/PARQUET converter. Flattens JSON fast.☆198Updated 2 months ago
- List of entity resolution software and resources.☆84Updated 6 months ago
- ☆51Updated this week
- Scripts to make specific datasets cleaner and more convenient☆42Updated 2 years ago
- Data Quality and Observability platform for the whole data lifecycle, from profiling new data sources to full automation with Data Observ…☆161Updated last month
- Ibis analytics, with Ibis (and more!)☆22Updated 11 months ago
- Convert monolithic Jupyter notebooks 📙 into maintainable Ploomber pipelines. 📊☆79Updated 11 months ago
- A small Python module containing quick utility functions for standard ETL processes.☆36Updated this week
- An automation tool to refactor Jupyter Notebooks to Python modules, with code dependency analysis.☆12Updated 6 months ago
- Declare multi-table rules for SQLAlchemy update logic -- 40X more concise, Python for extensibility.☆46Updated 2 weeks ago
- Registry of metadata identifier entities like UUID, GUID, person fullname, address and so on. Linked with other sources☆17Updated last year
- Anomstack - Painless open source anomaly detection for your metrics 📈📉🚀☆104Updated this week
- Data Tools Subjective List☆86Updated 2 years ago
- ☆23Updated last year
- undatum: a command-line tool for data processing. Brings CSV simplicity to NDJSON, BSON, XML and other dat files☆48Updated 3 weeks ago
- quadipy is a python package to help transform structured data into RDF graph format☆19Updated 2 years ago
- Personal Finance Project to automatically collect swiss banking transaction into a DWH and visualise it☆26Updated last year
- dbd is a database prototyping tool that enables data analysts and engineers to quickly load and transform data in SQL databases.☆57Updated 3 years ago