thoughtworks-datakind/anonymizer

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/thoughtworks-datakind/anonymizer)

thoughtworks-datakind / anonymizer

Library for identification, anonymization and de-anonymization of PII data

☆22

Alternatives and similar repositories for anonymizer

Users that are interested in anonymizer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

edwardcooper / piidetect
View on GitHub
A package to build an end-to-end pipeline for detecting personally identifiable information from text.
☆50Jun 2, 2019Updated 7 years ago
struct-chat / embedding
View on GitHub
Vector Embedding Server in under 100 lines of code
☆22Mar 1, 2024Updated 2 years ago
PovertyAction / PII_detection
View on GitHub
Application and python script to identify, remove, and/or recode personally identifiable information (PII) from field experiment datasets…
☆47Jan 7, 2026Updated 6 months ago
Hamza88-coder / Real-Time-Recruitment-System-with-AI-and-Data-Analytics
View on GitHub
Simulation of job offers and CVs with real-time processing, classification, and analytics using Kafka, Ray, Spark, and Databricks. Includ…
☆14Dec 25, 2024Updated last year
IoTSEstudy / IoTbugschallenges
View on GitHub
Replication Package of "IoT Bugs and Development Challenges" study
☆12Feb 20, 2021Updated 5 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Poogles / piiregex
View on GitHub
Search for PII in Python
☆29Jan 29, 2024Updated 2 years ago
milinda / KafkaOnEC2
View on GitHub
Ansible scripts for deploying Kafka on EC2
☆10Oct 7, 2016Updated 9 years ago
rohankhudedev / HackerRank
View on GitHub
My HackerRank Solutions : https://www.hackerrank.com/RohanKhude
☆12Jul 13, 2016Updated 10 years ago
sidneyocirqueira / azure-synapse-analytics
View on GitHub
☆12Mar 15, 2022Updated 4 years ago
fischJan / CiRA
View on GitHub
System behavior is often expressed by causal relations in requirements (e.g. if event 1 then event 2). Automatically extracting this embe…
☆13Oct 24, 2021Updated 4 years ago
apennisi / mctracker
View on GitHub
A multi camera tracker based on homography and costs.
☆18May 16, 2020Updated 6 years ago
Apress / pro-spark-streaming
View on GitHub
Source code for 'Pro Spark Streaming' by Zubair Nabi
☆11Mar 27, 2017Updated 9 years ago
hpclab / efficient-query-expansion
View on GitHub
Official repository of "Efficient and Effective Query Expansion for Web Search", Short Paper @ CIKM 2018
☆15Nov 17, 2019Updated 6 years ago
Cascading / cascading.samples
View on GitHub
Sample applications using Cascading
☆20Jun 7, 2015Updated 11 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
tianyi-zhang / ExampleStack-ICSE-Artifact
View on GitHub
☆16Jul 8, 2024Updated 2 years ago
Azure / DW-with-Synapse-Data-Factory-Power-BI
View on GitHub
Create a data mart using Azure Data Factory as ELT / ETL, Azure Synapse as database and Power BI as visualization tool.
☆19Apr 20, 2022Updated 4 years ago
PacktPublishing / Learning-Apache-Spark-2
View on GitHub
Code repository for Learning Apache Spark 2, published by Packt
☆21Jan 30, 2023Updated 3 years ago
brianmwangy / Beginner-Guide-to-Automated-Feature-Engineering-With-Deep-Feature-Synthesis.
View on GitHub
This is a comprehensive guide on how you can automate your feature engineering process.
☆11Jun 25, 2018Updated 8 years ago
WenzheLiu / filewatcher
View on GitHub
File Watcher 核心库：轻量级Java库
☆30Sep 20, 2018Updated 7 years ago
dream-lab / goffish_v3
View on GitHub
Latest version of GoFFish Distributed Graph Processing Platforms
☆12Apr 30, 2018Updated 8 years ago
mvillis / measure-mate
View on GitHub
Simple tool to track maturity assessments
☆13Jul 8, 2023Updated 3 years ago
griesyuli / semanticMSE
View on GitHub
Implementation query expansion in semantic meta-search engine. The resulting expansion system is called Wiki-MetaSemantik.
☆11Feb 10, 2019Updated 7 years ago
ashwinitonge / deepprivate
View on GitHub
☆12Sep 17, 2019Updated 6 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
avensolutions / spark-sql-etl-framework
View on GitHub
Multi-stage, config driven, SQL based ETL framework using PySpark
☆26Sep 16, 2019Updated 6 years ago
obi-ml-public / ehr_deidentification
View on GitHub
Robust de-identification of medical notes using transformer architectures
☆63Jun 27, 2022Updated 4 years ago
Sripaad / ai4privacy
View on GitHub
☆23Mar 15, 2024Updated 2 years ago
adaltas / spark-streaming-pyspark
View on GitHub
Build and run Spark Structured Streaming pipelines in Hadoop - project using PySpark.
☆13Jun 6, 2019Updated 7 years ago
da2so / Interpretable-Explanations-of-Black-Boxes-by-Meaningful-Perturbation
View on GitHub
Interpretable Explanations of Black Boxes by Meaningful Perturbation Pytorch
☆12Aug 30, 2024Updated last year
DivLoic / xke-ratatouille
View on GitHub
Poison pills and Kafka Streams demo
☆10Jul 25, 2020Updated 6 years ago
tmpsrcrepo / benchmark_minhash_lsh
View on GitHub
insight data engineering fellow project
☆16Nov 14, 2016Updated 9 years ago
intersectional-fairness / isf
View on GitHub
Intersectional Fairness (ISF) is a bias detection and mitigation technology for intersectional bias, which combinations of multiple prote…
☆20Apr 23, 2026Updated 3 months ago
tmalaska / Spark.TableStatsExample
View on GitHub
Simple Spark example of generating table stats for use of data quality checks
☆27Apr 28, 2017Updated 9 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
Caoang327 / vis_det
View on GitHub
Code of "Visualizing and Understanding Object Detecor"
☆20Jun 24, 2021Updated 5 years ago
nathankerr / hadoopGIS
View on GitHub
Parallel GIS processing using Hadoop
☆29Oct 21, 2009Updated 16 years ago
gia-uh / automl-survey
View on GitHub
An (in-progress) AutoML survey focusing on practical systems.
☆16Oct 5, 2021Updated 4 years ago
eBay / GZinga
View on GitHub
☆46Feb 9, 2022Updated 4 years ago
dataiku / dss-plugin-timeseries-forecast
View on GitHub
Dataiku DSS plugin to automate time series forecasting with Deep Learning and statistical models 📈
☆17Apr 14, 2023Updated 3 years ago
boguss1225 / ObjectDetectionGUI
View on GitHub
Easy to use, Good looking, Highly utilized, and Light Tensorflow Tool, PygIDE (Python graphical integrated development environment).
☆13Nov 23, 2022Updated 3 years ago
jfrazee / nifi-provenance-reporting-bundle
View on GitHub
NiFi provenance reporting tasks
☆14Sep 21, 2023Updated 2 years ago