elastic / anonymize-it
a general utility for anonymizing data
☆123Updated 5 months ago
Alternatives and similar repositories for anonymize-it:
Users that are interested in anonymize-it are comparing it to the libraries listed below
- Send summary messages of your Luigi jobs to Slack☆46Updated 5 years ago
- Ansible role to deploy and configure Airflow☆41Updated last month
- Python package providing a simple interface to manipulate Elasticsearch queries and aggregations☆11Updated last month
- Use dask to fetch data from Elasticsearch in parallel by sending the request to each shard separatelly.☆20Updated 4 years ago
- A Workflow for Data Scientists to bring Jupyter Notebook Visualizations to Kibana Dashboards☆44Updated 2 years ago
- This project is created to promote and advocate the use of FOSS machine learning.☆44Updated 4 months ago
- Sensitive Data Management: Data Discovery and Anonymization toolkit☆148Updated 3 months ago
- Trying to generate name synonyms from wikidata☆33Updated 4 years ago
- Entity resolution for Elasticsearch.☆158Updated this week
- PST extraction and analytic pipeline☆37Updated 6 years ago
- Python Driver for Apache Drill.☆58Updated last year
- A package to build an end-to-end pipeline for detecting personally identifiable information from text.☆43Updated 5 years ago
- Make it easier to compare and cross-reference the names of companies and people by applying strong normalisation.☆147Updated last week
- Traptor -- A distributed Twitter feed☆26Updated 2 years ago
- Hidden alignment conditional random field for classifying string pairs.☆25Updated 3 months ago
- Kibana Milestones Visualization☆90Updated last year
- Kibana visualization that provides controls for setting and animating time ranges.☆125Updated 4 years ago
- Real-time performance monitoring of an Elasticsearch cluster from the command line☆78Updated 3 years ago
- This is a data pipeline for Twitter (ETL) using the elastic stack Elasticsearch, Logstash and Kibana (version 6.1)☆58Updated 6 years ago
- Airflow plugin to transfer arbitrary files between operators☆78Updated 6 years ago
- Postgraas is a super simple PostgreSQL-as-a-service☆29Updated 4 years ago
- transformpy is a Python 2/3 module for doing transforms on "streams" of data☆29Updated 7 years ago
- Algorithms for "schema matching"☆25Updated 8 years ago
- python package for performing deduplication using flexible text matching and cleaning in pandas dataframe☆25Updated 4 years ago
- A Topic Modeling toolbox☆92Updated 8 years ago
- Pure Python wrapper to the Yajl C Library☆82Updated last month
- Legacy instrumentation for your Python apps with Honeycomb.☆33Updated 4 months ago
- A curated list of all the awesome examples, articles, tutorials and videos for Apache Airflow.☆96Updated 4 years ago
- Sensitive Data Discovery tool☆32Updated 6 years ago