elastic / anonymize-itLinks
a general utility for anonymizing data
☆125Updated 2 months ago
Alternatives and similar repositories for anonymize-it
Users that are interested in anonymize-it are comparing it to the libraries listed below
Sorting:
- A Workflow for Data Scientists to bring Jupyter Notebook Visualizations to Kibana Dashboards☆45Updated 2 years ago
- Clean personally identifiable information from dirty dirty text.☆413Updated last year
- Airflow plugin to transfer arbitrary files between operators☆78Updated 6 years ago
- Ansible role to deploy and configure Airflow☆41Updated last month
- Ansible role to install Apache Airflow☆84Updated 6 months ago
- Scan databases and data warehouses for PII data. Tag tables and columns in data catalogs like Amundsen and Datahub☆321Updated last year
- ☆21Updated last year
- A tool for batch loading data files (json, parquet, csv, tsv) into ElasticSearch☆401Updated 3 years ago
- Load a CSV (or TSV) file into an Elasticsearch instance☆62Updated 2 years ago
- Open source Flotilla☆195Updated 2 weeks ago
- Sensitive Data Management: Data Discovery and Anonymization toolkit☆155Updated last week
- Helpers & syntactic sugar for PySpark.☆62Updated 2 years ago
- Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.☆61Updated this week
- A cookiecutter template for an elasticsearch ingest processor plugin☆47Updated 3 years ago
- ☆75Updated 5 months ago
- Graphistry admin docs: launch, configure, use, & debug☆28Updated last month
- A toolkit providing a uniform interface for connecting to and extracting data from a wide variety of (potentially remote) data stores (in…☆254Updated last month
- A Tool for Complex and Scalable Data Access Policy Enforcement☆96Updated 4 years ago
- Now included in rigour☆151Updated 2 weeks ago
- Tool for removing duplicate documents from Elasticsearch☆54Updated 3 months ago
- The sane way of building a data layer in Airflow☆24Updated 5 years ago
- Easy way to get structured stuff into Elasticsearch (CSV, MSSQL, API)☆88Updated 5 years ago
- Chatlytics is a data query and visualization platform for chat!☆13Updated 8 years ago
- A python bot framework for slack☆22Updated last year
- plait.py - a fake data modeler☆436Updated 6 years ago
- Puppet module to provision Airbnb's Airflow☆19Updated 3 years ago
- A component that tries to avoid downloading duplicate content☆27Updated 7 years ago
- An automated ingestion service for blogs to construct a corpus for NLP research.☆86Updated 7 years ago
- Send summary messages of your Luigi jobs to Slack☆46Updated 6 years ago
- A curated list of all the awesome examples, articles, tutorials and videos for Apache Airflow.☆96Updated 4 years ago