Library for identification, anonymization and de-anonymization of PII data
☆22Dec 26, 2022Updated 3 years ago
Alternatives and similar repositories for anonymizer
Users that are interested in anonymizer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A package to build an end-to-end pipeline for detecting personally identifiable information from text.☆49Jun 2, 2019Updated 6 years ago
- Vector Embedding Server in under 100 lines of code☆22Mar 1, 2024Updated 2 years ago
- Registry of metadata identifier entities like UUID, GUID, person fullname, address and so on. Linked with other sources☆18Jan 1, 2026Updated 4 months ago
- Application and python script to identify, remove, and/or recode personally identifiable information (PII) from field experiment datasets…☆46Jan 7, 2026Updated 3 months ago
- Simulation of job offers and CVs with real-time processing, classification, and analytics using Kafka, Ray, Spark, and Databricks. Includ…☆14Dec 25, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Ansible scripts for deploying Kafka on EC2☆10Oct 7, 2016Updated 9 years ago
- Scan databases and data warehouses for PII data. Tag tables and columns in data catalogs like Amundsen and Datahub☆340Jan 5, 2024Updated 2 years ago
- Python SDK for PII detection and redaction in text and images, combining regex + NLP pipelines for production privacy workflows.☆54Updated this week
- My HackerRank Solutions : https://www.hackerrank.com/RohanKhude☆12Jul 13, 2016Updated 9 years ago
- ☆20Apr 27, 2012Updated 14 years ago
- ☆22Mar 15, 2024Updated 2 years ago
- ☆12Mar 15, 2022Updated 4 years ago
- NICTA Named Entity Recogniser is a rule based Named Entity Recogniser which extracts named entities from text such as Organisation, Locat…☆16Apr 15, 2023Updated 3 years ago
- ☆15Jan 17, 2022Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Algorithms and Data Structures implemented in Java☆12Jul 28, 2019Updated 6 years ago
- Data Quality Monitoring Tool☆15Dec 5, 2017Updated 8 years ago
- ☆24Jan 10, 2024Updated 2 years ago
- This project describes how to write full ETL data pipeline using spark.☆15Oct 15, 2022Updated 3 years ago
- 🎭 Sentiment Analysis with Neural Networks☆10Dec 4, 2016Updated 9 years ago
- File Watcher 核心库:轻量级Java库☆30Sep 20, 2018Updated 7 years ago
- Create a data mart using Azure Data Factory as ELT / ETL, Azure Synapse as database and Power BI as visualization tool.☆20Apr 20, 2022Updated 4 years ago
- Template to deploy Synapse Analytics using best practices to deliver a proof of concept.☆21Mar 3, 2023Updated 3 years ago
- This is a comprehensive guide on how you can automate your feature engineering process.☆11Jun 25, 2018Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Competitive Programming Solutions - Majorly in Java. Timely update for space and time efficiency☆19Mar 5, 2023Updated 3 years ago
- Master complex big data processing, stream analytics, and machine learning with Apache Spark☆18Jan 30, 2023Updated 3 years ago
- Multi-stage, config driven, SQL based ETL framework using PySpark☆26Sep 16, 2019Updated 6 years ago
- ☆25Apr 9, 2025Updated last year
- ☆11Nov 23, 2017Updated 8 years ago
- Structured Streaming is a reference application showing how to easily integrate structured streaming Apache Spark Structured Streaming, …☆13Nov 17, 2018Updated 7 years ago
- High performance HBase / Spark SQL engine☆28Jul 7, 2022Updated 3 years ago
- ☆12Sep 17, 2019Updated 6 years ago
- Robust de-identification of medical notes using transformer architectures☆59Jun 27, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Build and run Spark Structured Streaming pipelines in Hadoop - project using PySpark.☆13Jun 6, 2019Updated 6 years ago
- Incremental Machine Leaning by example - Detecting suspicious activity in real time with Zeek data streams, River and JA3 hashes☆17Aug 10, 2022Updated 3 years ago
- Elasticsearch Open Data Demo☆15Dec 26, 2022Updated 3 years ago
- insight data engineering fellow project☆16Nov 14, 2016Updated 9 years ago
- Machine Learning with the Elastic Stack, Published by Packt☆17Jan 30, 2023Updated 3 years ago
- ☆46Feb 9, 2022Updated 4 years ago
- Code of "Visualizing and Understanding Object Detecor"☆20Jun 24, 2021Updated 4 years ago