Library for identification, anonymization and de-anonymization of PII data
☆22Dec 26, 2022Updated 3 years ago
Alternatives and similar repositories for anonymizer
Users that are interested in anonymizer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A package to build an end-to-end pipeline for detecting personally identifiable information from text.☆49Jun 2, 2019Updated 6 years ago
- Application and python script to identify, remove, and/or recode personally identifiable information (PII) from field experiment datasets…☆46Jan 7, 2026Updated 3 months ago
- Search for PII in Python☆31Jan 29, 2024Updated 2 years ago
- low-level helpers for Apache Spark libraries and tests☆16Dec 29, 2018Updated 7 years ago
- Simulation of job offers and CVs with real-time processing, classification, and analytics using Kafka, Ray, Spark, and Databricks. Includ…☆14Dec 25, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆19Oct 28, 2018Updated 7 years ago
- Scan databases and data warehouses for PII data. Tag tables and columns in data catalogs like Amundsen and Datahub☆338Jan 5, 2024Updated 2 years ago
- My HackerRank Solutions : https://www.hackerrank.com/RohanKhude☆12Jul 13, 2016Updated 9 years ago
- Data, Code and Results from the ICSE 2019 accepted paper: Analysis and Detection of Information Types of Open Source Software Issue Discu…☆12Feb 25, 2019Updated 7 years ago
- ☆12Mar 15, 2022Updated 4 years ago
- NICTA Named Entity Recogniser is a rule based Named Entity Recogniser which extracts named entities from text such as Organisation, Locat…☆16Apr 15, 2023Updated 2 years ago
- Model explanation provides the ability to interpret the effect of the predictors on the composition of an individual score.☆13Jan 21, 2021Updated 5 years ago
- Multi Cloud Data Tokenization Solution By Using Dataflow and Cloud DLP☆97Aug 13, 2024Updated last year
- Algorithms and Data Structures implemented in Java☆12Jul 28, 2019Updated 6 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Data Quality Monitoring Tool☆15Dec 5, 2017Updated 8 years ago
- Floodgate - Pipelines as Code solution for Spinnaker☆16Apr 7, 2021Updated 5 years ago
- A FinServ microservice DevOps blueprint to kickstart a successful software development workflow on Google Cloud Plataform and Github Thi…☆10Jul 11, 2022Updated 3 years ago
- spark自学手册,包含了例如spark core、spark sql、spark streaming、spark-kafka、delta-lake,以及scala基础练习,还有一些例如master、shuffle源码分析,总结及翻译。☆18Jul 19, 2023Updated 2 years ago
- Tensor-based Spectral LDA on Spark☆18Jun 5, 2018Updated 7 years ago
- This project describes how to write full ETL data pipeline using spark.☆15Oct 15, 2022Updated 3 years ago
- Udacity self driving car engineer integration project: focus on building ROS nodes to implement core functionality of the autonomous vehi…☆19Jan 10, 2018Updated 8 years ago
- Kafka Connect connector for receiving data and writing data to Splunk.☆25Nov 7, 2017Updated 8 years ago
- Making high-accuracy and visually-interpretable decision tree-based models for semantic segmentation http://segnbdt.aaalv.in☆11Oct 12, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- File Watcher 核心库:轻量级Java库☆29Sep 20, 2018Updated 7 years ago
- Create a data mart using Azure Data Factory as ELT / ETL, Azure Synapse as database and Power BI as visualization tool.☆20Apr 20, 2022Updated 3 years ago
- ☆12Mar 31, 2023Updated 3 years ago
- sample oozie workflows☆17Jun 13, 2017Updated 8 years ago
- Internal dashboard summarising the release pipeline for components of the GOV.UK web content management system.☆24Updated this week
- Flatten/Explode JSON objects☆21Feb 5, 2026Updated 2 months ago
- Master complex big data processing, stream analytics, and machine learning with Apache Spark☆18Jan 30, 2023Updated 3 years ago
- Multi-stage, config driven, SQL based ETL framework using PySpark☆26Sep 16, 2019Updated 6 years ago
- Developer workflow tooling for jenkins, jira, reviewboard and git☆23Jan 31, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Structured Streaming is a reference application showing how to easily integrate structured streaming Apache Spark Structured Streaming, …☆13Nov 17, 2018Updated 7 years ago
- High performance HBase / Spark SQL engine☆28Jul 7, 2022Updated 3 years ago
- Robust de-identification of medical notes using transformer architectures☆59Jun 27, 2022Updated 3 years ago
- Evaluate the performance of several state-of-the-art deep learning techniques on various text classification datasets. This project is pa…☆27Jul 3, 2021Updated 4 years ago
- Interpretable Explanations of Black Boxes by Meaningful Perturbation Pytorch☆12Aug 30, 2024Updated last year
- Access data from RTE API☆14Oct 3, 2022Updated 3 years ago
- Elasticsearch Open Data Demo☆15Dec 26, 2022Updated 3 years ago