fuyb1992 / es_pandasLinks
Read, write and update large scale pandas DataFrame with Elasticsearch
☆35Updated 7 months ago
Alternatives and similar repositories for es_pandas
Users that are interested in es_pandas are comparing it to the libraries listed below
Sorting:
- Python Client and Toolkit for DataFrames, Big Data, Machine Learning and ETL in Elasticsearch☆683Updated 3 weeks ago
- A DBAPI and SQLAlchemy dialect for Elasticsearch☆116Updated last year
- An Elasticsearch client exposing DataFrame API☆284Updated 2 years ago
- Docker images for dask☆242Updated this week
- Joblib Apache Spark Backend☆249Updated 3 months ago
- Apache Avro <-> pandas DataFrame☆138Updated 11 months ago
- PostgreSQL offline and online stores for Feast☆32Updated 3 years ago
- Jupyter Notebooks in S3 - Jupyter Contents Manager implementation☆255Updated this week
- Simple YAML configuration file parser☆80Updated last year
- A Python client for Apache Livy, enabling use of remote Apache Spark clusters.☆70Updated 3 years ago
- Airflow Backfill UI based plugin for existing / new Airflow environment☆65Updated 4 years ago
- JayDeBeApi module allows you to connect from Python code to databases using Java JDBC. It provides a Python DB-API v2.0 to that database.☆375Updated last year
- Viewflow is an Airflow-based framework that allows data scientists to create data models without writing Airflow code.☆126Updated 3 years ago
- Flatten JSON in Python☆547Updated last year
- python automatic data quality check toolkit☆283Updated 4 years ago
- An efficient Python implementation of the Apriori algorithm.☆335Updated last month
- A simple guide to understand Prefect and make it work with your own docker-compose configuration.☆163Updated last year
- Distributed SQL Engine in Python using Dask☆406Updated 10 months ago
- Fast iterative local development and testing of Apache Airflow workflows☆202Updated 2 months ago
- Repository for Docker Image of Apache-Superset. [Docker Image: https://hub.docker.com/r/abhioncbr/docker-superset]☆103Updated 4 years ago
- Native Kubernetes integration for Dask☆323Updated last week
- python implementation of the parquet columnar file format.☆836Updated 3 months ago
- JupyterLab extension that enables monitoring launched Apache Spark jobs from within a notebook☆92Updated 2 years ago
- Tool to automate data quality checks on data pipelines☆254Updated 2 years ago
- HandySpark - bringing pandas-like capabilities to Spark dataframes☆196Updated 6 years ago
- The Prefect API and backend☆243Updated last year
- A Python connector for Druid☆517Updated 3 weeks ago
- Fast Avro for Python☆676Updated last week
- Some abstractions to make creating UI's easier in Dash☆65Updated 6 years ago
- DDL parase and Convert to BigQuery JSON schema and DDL statements☆88Updated last year