fuyb1992 / es_pandas
Read, write and update large scale pandas DataFrame with Elasticsearch
☆35Updated 2 months ago
Alternatives and similar repositories for es_pandas:
Users that are interested in es_pandas are comparing it to the libraries listed below
- Use dask to fetch data from Elasticsearch in parallel by sending the request to each shard separatelly.☆20Updated 4 years ago
- Fast, resilient and reproducible data analysis with cached SQL queries☆30Updated last year
- A DBAPI and SQLAlchemy dialect for Elasticsearch☆109Updated last year
- ☆30Updated 3 years ago
- An Elasticsearch client exposing DataFrame API☆285Updated last year
- python automatic data quality check toolkit☆284Updated 4 years ago
- simple, flexible, offline capable, cloud storage with a Python path-like interface☆173Updated 7 months ago
- The Python implementation of JavaScript Library RoughViz to create interactive sketchy charts☆93Updated 9 months ago
- Joblib Apache Spark Backend☆244Updated 5 months ago
- A simple guide to understand Prefect and make it work with your own docker-compose configuration.☆160Updated 8 months ago
- MongoDB integrations for Apache Arrow. Export MongoDB documents to numpy array, parquet files, and pandas dataframes in one line of code.☆95Updated this week
- Docker images for dask☆234Updated last week
- Simple, lightweight, extensible DAG framework for Python with a Kubeflow-like API☆74Updated 11 months ago
- Type System for Data Analysis in Python☆210Updated 5 months ago
- A high-performance Python Kafka client. Efficiently from Kafka to Pandas and back.☆40Updated 5 years ago
- AdminLTE3 Dash components☆87Updated 2 years ago
- A Python client for Apache Livy, enabling use of remote Apache Spark clusters.☆71Updated 3 years ago
- Apache Avro <-> pandas DataFrame☆136Updated 6 months ago
- A validation library for Pandas data frames using user-friendly schemas☆190Updated last year
- Streaming reactive and dataflow graphs in Python☆444Updated 2 months ago
- Extend pandas to_sql function to perform multi-threaded, concurrent "insert or update" command in memory☆84Updated 10 months ago
- MLflow-tracking server example with Minio and H2O☆18Updated 5 years ago
- Pylint plugin for static code analysis on Airflow code☆91Updated 4 years ago
- Repository to maintain infrastructure to automate Data Workflows☆34Updated 3 years ago
- dagster scikit-learn pipeline example.☆44Updated last year
- Swiple enables you to easily observe, understand, validate and improve the quality of your data☆83Updated this week
- A simple converter from SpaCy Entities (Spans) to Huggingface BILOU formatted data (tokens and ner_tags)☆14Updated 4 months ago
- O!My Models (omymodels) is a library to generate Pydantic, Dataclasses, GinoORM Models, SqlAlchemy ORM, SqlAlchemy Core Table, Models fr…☆180Updated 4 months ago
- Example project using Cython and Poetry to build obfuscated .whl files for PyPI distribution☆17Updated 4 years ago
- A schema analyser for MongoDB, written in Python.☆75Updated 2 years ago