alexander-marquardt / deduplicate-elasticsearchLinks
Remove duplicate documents from Elasticsearch
☆45Updated 2 years ago
Alternatives and similar repositories for deduplicate-elasticsearch
Users that are interested in deduplicate-elasticsearch are comparing it to the libraries listed below
Sorting:
- python implementation of jordansissel's grok regular expression library☆282Updated 2 years ago
- A Workflow for Data Scientists to bring Jupyter Notebook Visualizations to Kibana Dashboards☆45Updated 2 years ago
- Reading and writing pandas DataFrames in Elasticsearch☆25Updated 4 years ago
- Python library that reads JSON files of any size.☆196Updated 2 years ago
- A tool for batch loading data files (json, parquet, csv, tsv) into ElasticSearch☆401Updated 3 years ago
- samples-python-flask☆103Updated last year
- A DBAPI and SQLAlchemy dialect for Elasticsearch☆120Updated last year
- Example of how to handle background processes with Flask, Redis Queue, and Docker☆223Updated 2 years ago
- Flask extension that ties boto3 connectors to the application context☆35Updated 4 years ago
- A tool that parses emails by enhancing the Python standard library, extracting all details into a comprehensive object.☆428Updated 2 months ago
- moved: https://git.unturf.com/python/nested-lookup/☆209Updated 3 years ago
- Library for scraping websites or apis at any scale☆54Updated last year
- Use dask to fetch data from Elasticsearch in parallel by sending the request to each shard separatelly.☆20Updated 5 years ago
- Example app that logs in with Google using Flask-Dance☆37Updated 3 years ago
- Running a flask application over HTTPS with traefik and Let's Encrypt☆36Updated 7 years ago
- 🏗️ Create APIs from CSV files within seconds, using fastapi☆79Updated 4 years ago
- Celery extension which allows to orchestrate 100/1000/10000 tasks combined into a complex workflow☆102Updated 2 years ago
- An Elasticsearch client exposing DataFrame API☆284Updated 2 years ago
- A simple Python 3.5+ multitasking library.☆35Updated 5 years ago
- A simple application demonstrating direct uploads to S3 using Python☆131Updated 9 years ago
- AWS integration for python logging handlers(S3, Kinesis)☆72Updated 3 years ago
- Slackify: Lightweight framework to quickly develop modern Slack bots 🚀☆121Updated 5 years ago
- A schema analyser for MongoDB, written in Python.☆79Updated last week
- Flask Dashboard - Modular Admin Design | AppSeed☆59Updated 2 years ago
- Simple, easy-to-use throttler for asyncio.☆127Updated 3 years ago
- Example app to be deployed to AWS as an API Gateway / Lambda Stack☆141Updated 4 years ago
- Nested JSON to CSV Converter☆290Updated 3 years ago
- Boilerplate for running Nginx + Gunicorn + Flask + Let's Encrypt (https) with auto renewals on Docker.☆183Updated 7 months ago
- An example program that scrapes data from AllRecipes.com and store in Elasticsearch☆99Updated 7 years ago
- Simple tool to import CSV into ElasticSearch☆87Updated 7 years ago