Metadata and data identification tool and Python library. Identifies PII, common identifiers, language specific identifiers. Fully customizable and flexible rules
☆46Jan 1, 2026Updated 3 months ago
Alternatives and similar repositories for metacrafter
Users that are interested in metacrafter are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A CLI for identifying potential Personally Identifiable Information in datasets.☆14Apr 9, 2019Updated 7 years ago
- A package to build an end-to-end pipeline for detecting personally identifiable information from text.☆49Jun 2, 2019Updated 6 years ago
- Registry of data portals, catalogs, data repositories including data catalogs dataset and catalog description standard☆51Feb 24, 2026Updated last month
- Example project for consuming AWS Kinesis streamming and save data on Amazon Redshift using Apache Spark☆11May 22, 2018Updated 7 years ago
- Provide an easy way with Python to protect your data sources by searching its metadata. 🛡️☆18Apr 10, 2026Updated last week
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- A research python package for detecting, categorizing, and assessing the severity of personal identifiable information (PII)☆98Feb 15, 2026Updated 2 months ago
- Suite of converters to transform MIDI files into RDF and backwards☆16Dec 7, 2022Updated 3 years ago
- Quick and dirty date parsing Python library to parse HTML dates really fast☆22Jan 3, 2026Updated 3 months ago
- A web front-end for SirixDB based on Nuxt.js/Vue.js, D3.js and Typescript☆17Aug 12, 2020Updated 5 years ago
- ☆13Jan 28, 2024Updated 2 years ago
- LLM Assisted Geology Descriptions of Arbitrary Locations = LAGDAL☆14Jun 23, 2024Updated last year
- 🏌️ a fun challenge to compress 1M rows to the smallest possible size☆53Feb 24, 2026Updated last month
- An application that open source projects can use to ensure they include relevant documentation (and not secrets or PII!)☆10Mar 29, 2021Updated 5 years ago
- Generates Markdown documentation from Python module dosctrings☆16Aug 28, 2025Updated 7 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Collection of color palettes for Python☆15Apr 25, 2022Updated 3 years ago
- OWL ontology and SKOS taxonomy for TOGAF 9.2 Content Metamodel☆12Feb 21, 2022Updated 4 years ago
- ☆12Dec 7, 2025Updated 4 months ago
- CLK hash: hash pii for entity matching☆47May 12, 2025Updated 11 months ago
- Repo for paper: Exploring the Power of Graph Neural Networks in Solving Linear Optimization Problems, accepted at AISTATS 2024☆18Oct 17, 2023Updated 2 years ago
- Ontology alignment between Schema.Org, Wikidata, and DBpedia☆11Oct 25, 2017Updated 8 years ago
- Dash Component created from ukrbublik/react-awesome-query-builder☆12Apr 6, 2026Updated last week
- A Dash component library that integrates React Flow functionality into Dash applications☆11Jan 3, 2025Updated last year
- Data Explorer app and components built in React oriented to use with CKAN☆14Sep 11, 2025Updated 7 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Adaptation postgres adapter for Greenplum☆36Mar 7, 2024Updated 2 years ago
- COM runtime support for SharpGen generated interop code.☆13Sep 9, 2021Updated 4 years ago
- ☆11Nov 11, 2023Updated 2 years ago
- International Address formatter which considers the standard formatting rules of the country☆13Nov 21, 2024Updated last year
- AWS Amplify project to demonstrate Amazon Connect Chat with realtime language detection and translation☆17Updated this week
- ☆15Aug 2, 2024Updated last year
- 🗣 Frictionless Data Forum esp for "How do I" type questions☆10Mar 1, 2021Updated 5 years ago
- A curated list of awesome resources that feature usages of Semantic Web technologies in business cases (applications)☆21Dec 23, 2019Updated 6 years ago
- Merged into https://github.com/frictionlessdata/frictionlessdata.io☆12Sep 12, 2022Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Easily share pprof formatted profiles from your terminal.☆35Oct 12, 2022Updated 3 years ago
- A comprehensive tool for capturing performance metrics and workload snapshots, and generating in-depth comparison reports for Amazon Auro…☆20Apr 9, 2026Updated last week
- Lambda Chaos Engineering without changing code☆13Jan 8, 2025Updated last year
- A project to kickstart your ML development☆31Aug 20, 2024Updated last year
- Copy data from Azure Blob Storage to Amazon S3 using code. View Azure costs using Amazon QuickSight☆16Mar 5, 2026Updated last month
- Next generation compute platform for the post-modern data stack☆34Updated this week
- Search for PII in Python☆30Jan 29, 2024Updated 2 years ago