coiled / dask-bigqueryLinks
☆48Updated last year
Alternatives and similar repositories for dask-bigquery
Users that are interested in dask-bigquery are comparing it to the libraries listed below
Sorting:
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withou…☆114Updated 2 months ago
- Dask integration for Snowflake☆30Updated 5 months ago
- Cloud provider cluster managers for Dask. Supports AWS, Google Cloud Azure and more...☆145Updated 2 months ago
- Capture all information throughout your model's development in a reproducible way and tie results directly to the model code!☆139Updated 2 weeks ago
- A Delta Lake reader for Dask☆53Updated 5 months ago
- A lightweight tool to fetch tables from BigQuery as pandas DataFrame very fast using BigQuery Storage API combined with multiprocessing☆27Updated 2 years ago
- Unified Distributed Execution☆57Updated last year
- Apache Avro <-> pandas DataFrame☆138Updated 4 months ago
- A scikit-learn compatible estimator based on business-rules with interactive dashboard included☆28Updated 4 years ago
- A web frontend for scheduling Jupyter notebook reports☆254Updated last year
- Primrose modeling framework for simple production models☆33Updated last year
- Build your feature store with macros right within your dbt repository☆39Updated 3 years ago
- ☆40Updated last month
- Coming soon☆62Updated 2 years ago
- Kedro Plugin to support running workflows on Kubeflow Pipelines☆56Updated 6 months ago
- Kedro-Accelerator speeds up pipelines by parallelizing I/O in the background.☆36Updated 3 years ago
- dagster scikit-learn pipeline example.☆46Updated 2 years ago
- Python stream processing for analytics☆41Updated 3 weeks ago
- JupyterHub extension for ContainDS Dashboards☆201Updated last year
- Automatically export Jupyter notebooks to various file formats (.py, .html, and more) on save.☆84Updated 3 months ago
- Distributed persistent Task Queue running on Dask☆38Updated 2 years ago
- IbisML is a library for building scalable ML pipelines using Ibis.☆119Updated 5 months ago
- An interactive grid for sorting, filtering, and editing DataFrames in Jupyter notebooks☆21Updated 3 years ago
- Notebooks that support blog posts and tech talks on Dask / Coiled.☆48Updated 10 months ago
- RFC document, tooling and other content related to the dataframe API standard☆107Updated last year
- Example project for building scalable data pipelines with Kedro and Ibis.☆13Updated last month
- Tries to shrink your Pandas column dtypes with no data loss so you have more spare RAM☆86Updated last year
- Experimental MLflow plugin for Google Cloud Vertex AI☆38Updated 7 months ago
- DataFrame support for scikit-learn.☆63Updated 3 months ago
- Fake Pandas / PySpark DataFrame creator☆48Updated last year