fuyb1992 / es_pandasLinks
Read, write and update large scale pandas DataFrame with Elasticsearch
☆35Updated 6 months ago
Alternatives and similar repositories for es_pandas
Users that are interested in es_pandas are comparing it to the libraries listed below
Sorting:
- An Elasticsearch client exposing DataFrame API☆284Updated 2 years ago
- Docker images for dask☆241Updated 2 weeks ago
- As a believer of learning through examples, I have decided to put my own examples of Gremlin queries inside Jupyter Notebooks for people …☆32Updated 5 years ago
- A Python client for Apache Livy, enabling use of remote Apache Spark clusters.☆70Updated 3 years ago
- Pylint plugin for static code analysis on Airflow code☆95Updated 4 years ago
- python automatic data quality check toolkit☆283Updated 4 years ago
- Shared-memory Python object namespace with Apache Plasma. Built because of Plotly Dash, useful anywhere.☆83Updated 4 months ago
- Joblib Apache Spark Backend☆247Updated 2 months ago
- Python Client and Toolkit for DataFrames, Big Data, Machine Learning and ETL in Elasticsearch☆677Updated 3 weeks ago
- Apache Avro <-> pandas DataFrame☆137Updated 10 months ago
- PostgreSQL offline and online stores for Feast☆32Updated 3 years ago
- Use dask to fetch data from Elasticsearch in parallel by sending the request to each shard separatelly.☆20Updated 4 years ago
- Fast iterative local development and testing of Apache Airflow workflows☆201Updated last month
- pandabase links DataFrames to SQL databases using primary keys.☆21Updated 4 years ago
- ☆127Updated 4 years ago
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withou…☆113Updated last year
- Python module for Apache ORC file format☆66Updated 3 months ago
- ☆30Updated 3 years ago
- python implementation of jordansissel's grok regular expression library☆280Updated last year
- Airflow plugin to export dag and task based metrics to Prometheus.☆252Updated last week
- A web frontend for scheduling Jupyter notebook reports☆253Updated 6 months ago
- Simple YAML configuration file parser☆80Updated 11 months ago
- HandySpark - bringing pandas-like capabilities to Spark dataframes☆193Updated 6 years ago
- Python module for interacting with geohashes☆166Updated 2 weeks ago
- A DBAPI and SQLAlchemy dialect for Elasticsearch☆115Updated last year
- Airflow Backfill UI based plugin for existing / new Airflow environment☆65Updated 4 years ago
- triggering a DAG run multiple times☆88Updated last year
- Helm charts for Dask☆96Updated 2 weeks ago
- material-ui components for Dash☆196Updated 6 months ago
- Presto and Minio on Docker Infrastructure☆42Updated 6 years ago