deric / es-dedupeLinks
Tool for removing duplicate documents from Elasticsearch
☆54Updated 8 months ago
Alternatives and similar repositories for es-dedupe
Users that are interested in es-dedupe are comparing it to the libraries listed below
Sorting:
- Exporters is an extensible export pipeline library that supports filter, transform and several sources and destinations☆40Updated last year
- A cookiecutter template for an elasticsearch ingest processor plugin☆47Updated 3 years ago
- An elasticsearch plugin to create hierarchical aggregations☆52Updated 2 months ago
- This is a data pipeline for Twitter (ETL) using the elastic stack Elasticsearch, Logstash and Kibana (version 6.1)☆59Updated 7 years ago
- A dataset migration tool from MongoDB to Elasticsearch and vice versa.☆138Updated 4 years ago
- Entity resolution for Elasticsearch.☆166Updated last month
- docker scrapyd scrapy boot2docker crawler - a spider Python application that can be "Dockerized".☆42Updated 10 years ago
- An Elasticsearch plugin to return query results as either PDF,HTML or CSV.☆48Updated 7 years ago
- Kibana-API is an extension to Kibana that lets you tap in to the dashboard management board from your app and change the visualizations d…☆123Updated 3 years ago
- A tool for batch loading data files (json, parquet, csv, tsv) into ElasticSearch☆402Updated 3 years ago
- Radar visualization for Kibana☆35Updated 2 years ago
- A bundle of useful Elasticsearch plugins☆112Updated last year
- Bulk indexing command line tool for elasticsearch.☆286Updated last month
- Angular JS Solr and Elasticsearch and OpenSearch Diagnostic Search Services☆28Updated last month
- Simple tool to import CSV into ElasticSearch☆87Updated 7 years ago
- a scaleable and efficient crawelr with docker cluster , crawl million pages in 2 hours with a single machine☆97Updated last year
- a pure javascript frontend for ElasticSearch search indices.☆80Updated 7 years ago
- ☆16Updated 9 years ago
- A curated list of Awesome Apache Solr links and resources.☆110Updated 4 years ago
- Traptor -- A distributed Twitter feed☆26Updated 3 years ago
- Chorus, now for Elasticsearch!☆16Updated last year
- Use Watson Natural Language Understanding and Watson Knowledge Studio to fingerprint personal data from unstructured documents☆55Updated 4 years ago
- "Stop worrying about Elasticsearch analyzers", my therapist says☆154Updated 4 years ago
- Searchkit starter app. Based off create-react-app☆120Updated 2 years ago
- Apache Tika Server as a Docker Image☆174Updated 3 years ago
- A python library detect and extract listing data from HTML page.☆108Updated 8 years ago
- A javascript shell for elasticsearch☆106Updated 10 years ago
- a general utility for anonymizing data☆126Updated this week
- A machine learning plugin for Elasticsearch providing aggregations to compute multiple linear regression on search results in real-time f…☆66Updated 7 years ago
- Elasticsearch cluster with Docker☆45Updated 9 years ago