avlahop / dask-elkLinks
Use dask to fetch data from Elasticsearch in parallel by sending the request to each shard separatelly.
☆20Updated 5 years ago
Alternatives and similar repositories for dask-elk
Users that are interested in dask-elk are comparing it to the libraries listed below
Sorting:
- An Elasticsearch client exposing DataFrame API☆284Updated 2 years ago
- A lucene query parser generating ElasticSearch queries and more !☆200Updated last week
- Python package providing a simple interface to manipulate Elasticsearch queries and aggregations☆11Updated last year
- RedisGraph python client☆190Updated 2 years ago
- Python Driver for Apache Drill.☆61Updated 2 years ago
- This is a pytest plugin that enables you to test your code that relies on a running Elasticsearch search engine. It allows you to specify…☆68Updated this week
- Python Client and Toolkit for DataFrames, Big Data, Machine Learning and ETL in Elasticsearch☆693Updated this week
- A pipeline abstraction for Python☆168Updated 4 years ago
- Simple, easy-to-use throttler for asyncio.☆127Updated 3 years ago
- SQL on dataframes - pandas and dask☆64Updated 7 years ago
- Async IPython Magic for Asynchronous Notebook Cell Execution☆22Updated 3 years ago
- python implementation of jordansissel's grok regular expression library☆282Updated 2 years ago
- A DBAPI and SQLAlchemy dialect for Elasticsearch☆120Updated 2 years ago
- A Cython implementation of the affine gap string distance☆57Updated 3 years ago
- Lossless in-memory compression of pandas DataFrames and Series powered by the visions type system. Up to 10x less RAM needed for the same…☆30Updated 3 years ago
- A pure Python implementation of Apache Spark's RDD and DStream interfaces.☆270Updated last year
- Containerized distributed programming framework for Python☆53Updated 2 years ago
- Python wrapper for RE2☆106Updated 3 months ago
- Python Cypher Querybuilder☆178Updated 2 years ago
- A consistent table management library in python☆160Updated 2 years ago
- Example of using Faust with Docker☆23Updated 6 years ago
- A validation library for Pandas data frames using user-friendly schemas☆193Updated 2 years ago
- Transport classes and utilities shared among Python Elastic client libraries☆23Updated 3 weeks ago
- persistent caching to memory, disk, or database☆278Updated last week
- Battle-tested Apache Storm Multi-Lang implementation for Python☆70Updated 6 months ago
- Tools for test driven data-wrangling and data validation.☆295Updated 4 years ago
- Stream Processing Made Easy☆42Updated 3 years ago
- LRU cache for Python. Use Redis as backend. Provides a dictionary-like object as well as a method decorator. pip install redis-lru☆43Updated 3 years ago
- Concurrent appendable key-value storage☆107Updated last year
- A wrapper for libhdfs3 to interact with HDFS from Python☆137Updated 4 years ago