coiled / dask-mongo
☆19Updated last year
Alternatives and similar repositories for dask-mongo:
Users that are interested in dask-mongo are comparing it to the libraries listed below
- ✨ A Pydantic to PySpark schema library☆65Updated this week
- Repository to maintain infrastructure to automate Data Workflows☆34Updated 3 years ago
- ☆26Updated 10 months ago
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withou…☆113Updated 10 months ago
- Python Rest Client to interact against Schema Registry confluent server☆171Updated this week
- An extension to add Prefect flow visualizations into you Sphinx documentation.☆13Updated 2 years ago
- Generate avro schemas from python dataclasses, Pydantic models and Faust Records. Code generation from avro schemas. Serialize/Deserializ…☆222Updated last week
- Shed light on your data layout in order to monitor the health of your Lakehouse tables and identify when data maintenance operations shou…☆10Updated last year
- Docker images for dask☆234Updated last week
- Fugue collections for Prefect 2.0☆38Updated last year
- PySpark schema generator☆41Updated last year
- kedro cli plugin for generating a static kedro viz site (html, css, js) that can be deployed on many serverless tools.☆27Updated 2 years ago
- Helm charts for Dask☆92Updated this week
- MongoDB integrations for Apache Arrow. Export MongoDB documents to numpy array, parquet files, and pandas dataframes in one line of code.☆95Updated this week
- Dask integration for Snowflake☆30Updated 2 months ago
- Tools for making Prefect work better for typical data science workflows☆19Updated 2 years ago
- Prefect API Authentication/Authorization Proxy for on-premises deployments☆36Updated last month
- A Cookiecutter template for creating Faust projects quickly.☆70Updated 2 years ago
- SQLAlchemy dialect for Turbodbc☆23Updated 8 months ago
- Pandas helper functions☆30Updated last year
- Black for Databricks notebooks☆45Updated 2 weeks ago
- s3path is a pathlib extension for AWS S3 Service☆213Updated 2 months ago
- A Delta Lake reader for Dask☆48Updated 3 months ago
- A Python client for Apache Livy, enabling use of remote Apache Spark clusters.☆71Updated 3 years ago
- Cloud provider cluster managers for Dask. Supports AWS, Google Cloud Azure and more...☆138Updated last week
- Asynchronous actions for PySpark☆47Updated 3 years ago
- An opinionated implementation of exclusively using airflow DockerOperators for all Operators☆18Updated 2 years ago
- pytest plugin to run the tests with support of pyspark☆84Updated 10 months ago
- Composable filesystem hooks and operators for Apache Airflow.☆17Updated 3 years ago
- Kedro plugin to support running workflows on Microsoft Azure ML Pipelines☆37Updated 5 months ago