fuyb1992 / es_pandasLinks
Read, write and update large scale pandas DataFrame with Elasticsearch
☆35Updated 10 months ago
Alternatives and similar repositories for es_pandas
Users that are interested in es_pandas are comparing it to the libraries listed below
Sorting:
- A DBAPI and SQLAlchemy dialect for Elasticsearch☆120Updated last year
- Joblib Apache Spark Backend☆249Updated 6 months ago
- Python Client and Toolkit for DataFrames, Big Data, Machine Learning and ETL in Elasticsearch☆683Updated this week
- Docker images for dask☆242Updated last month
- An Elasticsearch client exposing DataFrame API☆284Updated 2 years ago
- Timeseries Anomaly detection and Root Cause Analysis on data in SQL data warehouses and databases☆233Updated 3 years ago
- Fast iterative local development and testing of Apache Airflow workflows☆201Updated 2 months ago
- A simple guide to understand Prefect and make it work with your own docker-compose configuration.☆161Updated last year
- pandabase links DataFrames to SQL databases using primary keys.☆21Updated 5 years ago
- SQL upsert using pandas DataFrames for PostgreSQL, SQlite and MySQL with extra features☆231Updated last year
- Quickly ingest messy CSV and XLS files. Export to clean pandas, SQL, parquet☆197Updated 2 years ago
- Prometheus Exporter for Airflow☆161Updated last year
- python automatic data quality check toolkit☆282Updated 5 years ago
- Airflow Backfill UI based plugin for existing / new Airflow environment☆65Updated 4 years ago
- PostgreSQL offline and online stores for Feast☆32Updated 3 years ago
- Pylint plugin for static code analysis on Airflow code☆96Updated 5 years ago
- Monitor Apache Spark from Jupyter Notebook☆172Updated 3 years ago
- Airflow plugin to export dag and task based metrics to Prometheus.☆262Updated this week
- Simple YAML configuration file parser☆79Updated last year
- Shared-memory Python object namespace with Apache Plasma. Built because of Plotly Dash, useful anywhere.☆83Updated 9 months ago
- Fast, resilient and reproducible data analysis with cached SQL queries☆30Updated 2 years ago
- Apache Avro <-> pandas DataFrame☆138Updated last month
- VerticaPy is a Python library that exposes sci-kit like functionality to conduct data science projects on data stored in Vertica, thus ta…☆225Updated last week
- Jupyter Notebooks in S3 - Jupyter Contents Manager implementation☆256Updated last month
- DBND is an agile pipeline framework that helps data engineering teams track and orchestrate their data processes.☆267Updated 6 months ago
- The Prefect API and backend☆246Updated 2 years ago
- A web frontend for scheduling Jupyter notebook reports☆254Updated 10 months ago
- 🚎 Notebook sharing hub☆500Updated 2 years ago
- python implementation of the parquet columnar file format.☆863Updated 2 weeks ago
- An extendable Docker image for Airbnb's Superset platform, previously known as Caravel.☆114Updated 3 years ago