elastic / anonymize-it
a general utility for anonymizing data
☆121Updated last month
Related projects: ⓘ
- Use dask to fetch data from Elasticsearch in parallel by sending the request to each shard separatelly.☆20Updated 3 years ago
- Ansible role to deploy and configure Airflow☆41Updated this week
- A Workflow for Data Scientists to bring Jupyter Notebook Visualizations to Kibana Dashboards☆43Updated last year
- plait.py - a fake data modeler☆430Updated 5 years ago
- Print an Elasticsearch inverted index as a CSV table or JSON object.☆12Updated 6 months ago
- Apache NiFi Custom Processor for working with Stanford CoreNLP for Sentiment Analysis in Java 8☆11Updated 6 years ago
- Rally track for simulating event-based data use-cases☆33Updated 2 months ago
- Peek into Elasticsearch clusters☆23Updated 7 months ago
- Entity resolution for Elasticsearch.☆156Updated last month
- Library for identification, anonymization and de-anonymization of PII data☆22Updated last year
- PST extraction and analytic pipeline☆37Updated 6 years ago
- Open Distro Kibana Notebooks☆21Updated 2 years ago
- Ansible role to install Apache Airflow☆82Updated last year
- A cookiecutter template for an elasticsearch ingest processor plugin☆47Updated 2 years ago
- Sensitive Data Management: Data Discovery and Anonymization toolkit☆144Updated last month
- Streaming web crawler with WebSocket API☆44Updated last year
- A component that tries to avoid downloading duplicate content☆27Updated 6 years ago
- Python package providing a simple interface to manipulate Elasticsearch queries and aggregations☆11Updated 2 years ago
- Load a CSV (or TSV) file into an Elasticsearch instance☆62Updated last year
- Snorkel - Bootstrap your Data Science☆23Updated 6 years ago
- A repository for personal information data patterns and detection for EU member states. These will be useful to understand how to best de…☆12Updated 6 years ago
- Sensitive Data Discovery tool☆31Updated 5 years ago
- Tool for removing duplicate documents from Elasticsearch☆54Updated 11 months ago
- Python Driver for Apache Drill.☆59Updated last year
- Provide an easy way with Python to protect your data sources by searching its metadata. 🛡️☆17Updated this week
- Trying to generate name synonyms from wikidata☆33Updated 4 years ago
- Slack notifications for the Luigi workflow manager☆46Updated 3 years ago
- Jupyter notebook version of elasticsearch definitive guide with all examples in Python (DSL and client)☆35Updated 7 years ago
- T4 is now in production as Quilt 3☆64Updated 5 years ago
- Quickly analyze and explore email with advanced analytics and visualization.☆55Updated 2 years ago