coiled / dask-mongoLinks
☆19Updated 2 years ago
Alternatives and similar repositories for dask-mongo
Users that are interested in dask-mongo are comparing it to the libraries listed below
Sorting:
- s3path is a pathlib extension for AWS S3 Service☆226Updated 6 months ago
- Black for Databricks notebooks☆47Updated 7 months ago
- Docker images for dask☆244Updated 3 weeks ago
- Pythonic file-system interface for Google Cloud Storage☆381Updated this week
- Pandas helper functions☆31Updated 2 years ago
- Cloud provider cluster managers for Dask. Supports AWS, Google Cloud Azure and more...☆145Updated 2 months ago
- Repository to maintain infrastructure to automate Data Workflows☆35Updated 4 years ago
- Python Rest Client to interact against Schema Registry confluent server☆179Updated last month
- pytest plugin to run the tests with support of pyspark☆87Updated 7 months ago
- Typed wrappers over pandas DataFrames with schema validation☆102Updated 2 years ago
- MongoDB integrations for Apache Arrow. Export MongoDB documents to numpy array, parquet files, and pandas dataframes in one line of code.☆113Updated last week
- Composable filesystem hooks and operators for Apache Airflow.☆17Updated 4 years ago
- The easiest way to integrate Kedro and Great Expectations☆54Updated 3 years ago
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withou…☆114Updated 2 months ago
- Opionated helpers for creating py.test fixtures for Docker integration and smoke testing environments☆97Updated 9 months ago
- Pylint plugin for static code analysis on Airflow code☆96Updated 5 years ago
- Fast iterative local development and testing of Apache Airflow workflows☆202Updated 2 weeks ago
- Fast Avro for Python☆692Updated 2 weeks ago
- A data modelling layer built on top of polars and pydantic☆197Updated 2 years ago
- kedro cli plugin for generating a static kedro viz site (html, css, js) that can be deployed on many serverless tools.☆28Updated 3 years ago
- Swiple enables you to easily observe, understand, validate and improve the quality of your data☆84Updated this week
- PySpark schema generator☆43Updated 2 years ago
- Kedro Plugin to support running pipelines on Kubernetes using Airflow.☆27Updated 10 months ago
- This library can convert a pydantic class to a avro schema or generate python code from a avro schema.☆82Updated 2 months ago
- Generate avro schemas from python dataclasses, Pydantic models and Faust Records. Code generation from avro schemas. Serialize/Deserializ…☆248Updated this week
- Shed light on your data layout in order to monitor the health of your Lakehouse tables and identify when data maintenance operations shou…☆10Updated 2 years ago
- A simple guide to understand Prefect and make it work with your own docker-compose configuration.☆161Updated last year
- SQLAlchemy dialect for BigQuery☆488Updated 3 weeks ago
- ✨ A Pydantic to PySpark schema library☆116Updated last week
- Kedro plugin to support running workflows on Microsoft Azure ML Pipelines☆39Updated last month