elastic / anonymize-it
a general utility for anonymizing data
☆122Updated 9 months ago
Alternatives and similar repositories for anonymize-it
Users that are interested in anonymize-it are comparing it to the libraries listed below
Sorting:
- A Workflow for Data Scientists to bring Jupyter Notebook Visualizations to Kibana Dashboards☆45Updated 2 years ago
- Ansible role to deploy and configure Airflow☆41Updated this week
- A component that tries to avoid downloading duplicate content☆27Updated 6 years ago
- Use dask to fetch data from Elasticsearch in parallel by sending the request to each shard separatelly.☆20Updated 4 years ago
- Search relevance evaluation toolkit☆32Updated 2 years ago
- Trying to generate name synonyms from wikidata☆32Updated 4 years ago
- A cookiecutter template for an elasticsearch ingest processor plugin☆47Updated 2 years ago
- Entity resolution for Elasticsearch.☆159Updated 4 months ago
- Now included in rigour☆151Updated last week
- Traptor -- A distributed Twitter feed☆26Updated 2 years ago
- A classifier for detecting soft 404 pages☆57Updated last year
- This is a data pipeline for Twitter (ETL) using the elastic stack Elasticsearch, Logstash and Kibana (version 6.1)☆58Updated 7 years ago
- Search relevance evaluation toolkit☆73Updated 3 years ago
- Real-time performance monitoring of an Elasticsearch cluster from the command line☆78Updated 3 years ago
- Python package providing a simple interface to manipulate Elasticsearch queries and aggregations☆11Updated 5 months ago
- Hidden alignment conditional random field for classifying string pairs.☆24Updated this week
- Python Driver for Apache Drill.☆59Updated 2 years ago
- A curated list of all the awesome examples, articles, tutorials and videos for Apache Airflow.☆96Updated 4 years ago
- Algorithms for "schema matching"☆26Updated 8 years ago
- Commons of stupid, simple Python micro functions. Pull requests very welcome.☆19Updated last month
- A Python implementation of our efficient Bloom filter library.☆29Updated 5 years ago
- Skinfer is a tool for inferring and merging JSON schemas☆139Updated last year
- Load a CSV (or TSV) file into an Elasticsearch instance☆61Updated 2 years ago
- An elasticsearch site plugin for identifying risky IPs or subnets in web logs☆46Updated 9 years ago
- Restrict crawl and scraping scope using matchers.☆25Updated 8 years ago
- A simple command line interface to the datamade/dedupe library.☆42Updated 2 years ago
- An automated ingestion service for blogs to construct a corpus for NLP research.☆87Updated 6 years ago
- Tag Cloud Plugin for Kibana 4☆69Updated 8 years ago
- REST-like API exposing Airflow data and operations☆61Updated 6 years ago
- This Kibana plugin allows any data visualizations from Elastic Search and other data sources using Vega grammar. You can even create a vi…☆135Updated 6 years ago