avlahop / dask-elkLinks
Use dask to fetch data from Elasticsearch in parallel by sending the request to each shard separatelly.
☆20Updated 5 years ago
Alternatives and similar repositories for dask-elk
Users that are interested in dask-elk are comparing it to the libraries listed below
Sorting:
- An Elasticsearch client exposing DataFrame API☆284Updated 2 years ago
- Python Client and Toolkit for DataFrames, Big Data, Machine Learning and ETL in Elasticsearch☆691Updated 2 months ago
- A lucene query parser generating ElasticSearch queries and more !☆199Updated 10 months ago
- A pipeline abstraction for Python☆168Updated 4 years ago
- RedisGraph python client☆190Updated 2 years ago
- A DBAPI and SQLAlchemy dialect for Elasticsearch☆120Updated last year
- Python package providing a simple interface to manipulate Elasticsearch queries and aggregations☆11Updated last year
- a general utility for anonymizing data☆127Updated this week
- python implementation of jordansissel's grok regular expression library☆282Updated 2 years ago
- Probabilistic data structures in python http://pyprobables.readthedocs.io/en/latest/index.html☆122Updated last month
- This is a pytest plugin that enables you to test your code that relies on a running Elasticsearch search engine. It allows you to specify…☆68Updated 2 weeks ago
- Stream Processing Made Easy☆43Updated 3 years ago
- Clean personally identifiable information from dirty dirty text.☆416Updated 2 years ago
- A Cython implementation of the affine gap string distance☆57Updated 2 years ago
- 🐍 A CPython extension for the Hyperscan regular expression matching library.☆189Updated last month
- Python wrapper for RE2☆106Updated 2 months ago
- Example of using Faust with Docker☆23Updated 6 years ago
- Concurrent appendable key-value storage☆107Updated last year
- Python Driver for Apache Drill.☆61Updated 2 years ago
- A pandas.DataFrame-based ORM.☆85Updated 3 years ago
- Simple, easy-to-use throttler for asyncio.☆127Updated 3 years ago
- A Python 3.5 rewrite of the TinkerPop 3 OGM Goblin☆90Updated 7 years ago
- A module for getting data into python from large data sources☆176Updated last year
- Python stream processing for humans☆189Updated last month
- Async IPython Magic for Asynchronous Notebook Cell Execution☆22Updated 3 years ago
- Backend for elasticsearch-py based on python's asyncio module.☆278Updated 3 years ago
- Price and currency parsing utility☆27Updated 2 years ago
- A SQLAlchemy like ORM implementation using python-arango as the backend library☆148Updated 2 years ago
- persistent caching to memory, disk, or database☆278Updated this week
- A simple fuzzy matching set for python strings☆230Updated last year