coiled / dask-bigquery
☆46Updated 9 months ago
Alternatives and similar repositories for dask-bigquery:
Users that are interested in dask-bigquery are comparing it to the libraries listed below
- Dask integration for Snowflake☆30Updated 5 months ago
- A Delta Lake reader for Dask☆49Updated 7 months ago
- Experimental MLflow plugin for Google Cloud Vertex AI☆37Updated last year
- Kedro Plugin to support running workflows on Kubeflow Pipelines☆53Updated 8 months ago
- Extension dtypes for pandas corresponding to GoogleSQL data types such as DATE, TIME, and JSON.☆30Updated this week
- BigQuery backend for Ibis☆19Updated 2 years ago
- Build your feature store with macros right within your dbt repository☆38Updated 2 years ago
- Demo repository to lambda-fy your dbt runs☆11Updated last year
- A python library bakeoff for medium sized datasets☆24Updated last year
- dagster scikit-learn pipeline example.☆44Updated 2 years ago
- A library to use `modal` as a backend for `joblib`.☆28Updated 3 months ago
- Notebooks that support blog posts and tech talks on Dask / Coiled.☆47Updated 2 months ago
- ☆30Updated 3 years ago
- An extension to add Prefect flow visualizations into you Sphinx documentation.☆13Updated 3 years ago
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withou…☆113Updated last year
- Fake Pandas / PySpark DataFrame creator☆46Updated last year
- Cluster tools for running Dask on Databricks☆13Updated 11 months ago
- An abstraction layer for parameter tuning☆35Updated 8 months ago
- ☆34Updated last month
- Primrose modeling framework for simple production models☆32Updated last year
- Running Python Code in BigQuery UDFs☆24Updated 4 years ago
- A Kedro plugin that provides pandas dropin replacements for the pandas datasets (e.g modin and cuDF)☆12Updated 4 years ago
- A scikit-learn compatible estimator based on business-rules with interactive dashboard included☆28Updated 3 years ago
- A lightweight tool to fetch tables from BigQuery as pandas DataFrame very fast using BigQuery Storage API combined with multiprocessing☆27Updated last year
- Example project for building scalable data pipelines with Kedro and Ibis.☆13Updated last year
- Kedro Plugin to support running pipelines on Kubernetes using Airflow.☆28Updated last month
- A place to provide Coiled feedback☆18Updated 2 months ago
- Lossless in-memory compression of pandas DataFrames and Series powered by the visions type system. Up to 10x less RAM needed for the same…☆28Updated 2 years ago
- A high-performance data streaming system using DuckDB and Apache Arrow Flight.☆77Updated 2 months ago
- Intake examples☆33Updated last year