fuyb1992 / es_pandasLinks
Read, write and update large scale pandas DataFrame with Elasticsearch
☆35Updated last year
Alternatives and similar repositories for es_pandas
Users that are interested in es_pandas are comparing it to the libraries listed below
Sorting:
- A DBAPI and SQLAlchemy dialect for Elasticsearch☆120Updated last year
- Python Client and Toolkit for DataFrames, Big Data, Machine Learning and ETL in Elasticsearch☆691Updated 2 months ago
- An Elasticsearch client exposing DataFrame API☆284Updated 2 years ago
- pandabase links DataFrames to SQL databases using primary keys.☆21Updated 5 years ago
- Simple YAML configuration file parser☆78Updated last year
- Docker images for dask☆244Updated 3 weeks ago
- DataFlows is a simple, intuitive lightweight framework for building data processing flows in python.☆222Updated 7 months ago
- JayDeBeApi module allows you to connect from Python code to databases using Java JDBC. It provides a Python DB-API v2.0 to that database.☆382Updated last year
- A Python client for Apache Livy, enabling use of remote Apache Spark clusters.☆70Updated 4 years ago
- Flatten JSON in Python☆553Updated 2 years ago
- A high-performance Python Kafka client. Efficiently from Kafka to Pandas and back.☆41Updated 6 years ago
- Jupyter Notebooks in S3 - Jupyter Contents Manager implementation☆256Updated 3 weeks ago
- Quickly ingest messy CSV and XLS files. Export to clean pandas, SQL, parquet☆196Updated 2 years ago
- Fast Avro for Python☆692Updated 2 weeks ago
- Pandas interface for Clickhouse database☆240Updated 5 years ago
- python automatic data quality check toolkit☆279Updated 5 years ago
- Fast, resilient and reproducible data analysis with cached SQL queries☆30Updated 2 years ago
- The Prefect API and backend☆248Updated 2 years ago
- Timeseries Anomaly detection and Root Cause Analysis on data in SQL data warehouses and databases☆234Updated 3 years ago
- A plugin for Apache Airflow that allows you to manage the users that can login☆14Updated 6 years ago
- Fast iterative local development and testing of Apache Airflow workflows☆202Updated 2 weeks ago
- material-ui components for Dash☆196Updated last year
- JupyterLab extension that enables monitoring launched Apache Spark jobs from within a notebook☆92Updated 3 years ago
- PostgreSQL offline and online stores for Feast☆32Updated 3 years ago
- Jupyter extensions for SWAN☆58Updated last month
- ☆19Updated 2 years ago
- Presto and Minio on Docker Infrastructure☆43Updated 7 years ago
- Joblib Apache Spark Backend☆249Updated 9 months ago
- DBND is an agile pipeline framework that helps data engineering teams track and orchestrate their data processes.☆267Updated 9 months ago
- Apache Avro <-> pandas DataFrame☆138Updated 4 months ago