fuyb1992 / es_pandasLinks
Read, write and update large scale pandas DataFrame with Elasticsearch
☆35Updated last year
Alternatives and similar repositories for es_pandas
Users that are interested in es_pandas are comparing it to the libraries listed below
Sorting:
- An Elasticsearch client exposing DataFrame API☆284Updated 2 years ago
- A DBAPI and SQLAlchemy dialect for Elasticsearch☆120Updated last year
- Docker images for dask☆244Updated last month
- Quickly ingest messy CSV and XLS files. Export to clean pandas, SQL, parquet☆197Updated 2 years ago
- SQL upsert using pandas DataFrames for PostgreSQL, SQlite and MySQL with extra features☆231Updated 2 years ago
- A simple guide to understand Prefect and make it work with your own docker-compose configuration.☆160Updated last year
- Fast iterative local development and testing of Apache Airflow workflows☆202Updated 3 months ago
- Airflow Backfill UI based plugin for existing / new Airflow environment☆64Updated 4 years ago
- Simple, easy-to-use throttler for asyncio.☆126Updated 3 years ago
- manipulate pandas dataframes from the comfort of your browser☆174Updated 4 years ago
- Official repository for pygrametl - ETL programming in Python☆299Updated 2 months ago
- Asynchronous actions for PySpark☆47Updated 4 years ago
- Pylint plugin for static code analysis on Airflow code☆96Updated 5 years ago
- Apache Avro <-> pandas DataFrame☆138Updated 3 months ago
- The Prefect API and backend☆247Updated 2 years ago
- Fast Avro for Python☆689Updated last month
- A plugin for Apache Airflow that allows you to manage the users that can login☆14Updated 6 years ago
- Timeseries Anomaly detection and Root Cause Analysis on data in SQL data warehouses and databases☆231Updated 3 years ago
- DDL parase and Convert to BigQuery JSON schema and DDL statements☆86Updated 2 years ago
- A python wrapper for the KSQL REST API.☆158Updated 2 years ago
- REST-like API exposing Airflow data and operations☆61Updated 6 years ago
- Python Rest Client to interact against Schema Registry confluent server☆179Updated 2 weeks ago
- python automatic data quality check toolkit☆282Updated 5 years ago
- Fast, resilient and reproducible data analysis with cached SQL queries☆30Updated 2 years ago
- Jupyter Notebooks in S3 - Jupyter Contents Manager implementation☆256Updated this week
- MongoDB integrations for Apache Arrow. Export MongoDB documents to numpy array, parquet files, and pandas dataframes in one line of code.☆112Updated last week
- Airflow plugin to export dag and task based metrics to Prometheus.☆269Updated last month
- Prometheus Exporter for Airflow☆161Updated last year
- Simple YAML configuration file parser☆78Updated last year
- DBND is an agile pipeline framework that helps data engineering teams track and orchestrate their data processes.☆267Updated 8 months ago