deric / es-dedupe
Tool for removing duplicate documents from Elasticsearch
☆54Updated last year
Related projects ⓘ
Alternatives and complementary repositories for es-dedupe
- A cookiecutter template for an elasticsearch ingest processor plugin☆47Updated 2 years ago
- Exporters is an extensible export pipeline library that supports filter, transform and several sources and destinations☆40Updated 6 months ago
- Load a CSV (or TSV) file into an Elasticsearch instance☆62Updated 2 years ago
- An Elasticsearch plugin for rescoring based on Redis keys☆29Updated 3 years ago
- This is a data pipeline for Twitter (ETL) using the elastic stack Elasticsearch, Logstash and Kibana (version 6.1)☆58Updated 6 years ago
- docker scrapyd scrapy boot2docker crawler - a spider Python application that can be "Dockerized".☆42Updated 9 years ago
- ElasticBeat to download and index tweets of specified screen names☆32Updated 8 years ago
- Chorus, now for Elasticsearch!☆14Updated 5 months ago
- Angular JS Solr and Elasticsearch and OpenSearch Diagnostic Search Services☆25Updated 5 months ago
- a general utility for anonymizing data☆123Updated 3 months ago
- A bundle of useful Elasticsearch plugins☆110Updated 7 months ago
- Development setups for Elasticsearch and Kibana with docker-compose☆152Updated 8 months ago
- Simple tool to import CSV into ElasticSearch☆87Updated 6 years ago
- Entity resolution for Elasticsearch.☆158Updated 4 months ago
- Dockerfile for Redis Sentinel☆68Updated 5 years ago
- An Elasticsearch Plugin that notifies about changes to indices☆92Updated 8 years ago
- A curated list of Awesome Apache Solr links and resources.☆106Updated 3 years ago
- Ingest processor doing language detection for fields☆71Updated 2 years ago
- Framework for building Commerce Search Solutions around open source search technology like Elasticsearch☆41Updated last week
- Radar visualization for Kibana☆35Updated last year
- Demonstration Project : Fast Data Analytic platform with Clickhouse, Apache Kafka and ksqlDB☆21Updated 4 years ago
- Querqy for Elasticsearch☆45Updated this week
- Stanford CoreNLP NER addon for Apache Tika's NamerEntityParser☆13Updated 2 years ago
- Module that extracts structured information from a rendered html site and outputs JSON. HTML to JSON.☆69Updated 3 years ago
- Curated synonym files and Helpers for Elasticsearch Synonym Token Filter☆64Updated last year
- Datasets for exploringelasticsearch.com☆54Updated 10 years ago
- Elasticsearch Metricbeat example configuration to monitor Host and Services with docker☆83Updated 6 years ago
- An example how to implement a custom similarity (overlap similarity) for elasticsearch☆41Updated 8 years ago
- Search relevance evaluation toolkit☆30Updated 2 years ago
- Elasticsearch diff tool.☆71Updated 2 years ago