TristanBilot / bqfetchLinks
A lightweight tool to fetch tables from BigQuery as pandas DataFrame very fast using BigQuery Storage API combined with multiprocessing
☆27Updated 2 years ago
Alternatives and similar repositories for bqfetch
Users that are interested in bqfetch are comparing it to the libraries listed below
Sorting:
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withou…☆114Updated 2 months ago
- Build your feature store with macros right within your dbt repository☆39Updated 3 years ago
- Sample project that use Dagster, dbt, DuckDB and Dash to visualize car and motorcycle Spanish market☆58Updated 3 years ago
- Experimental MLflow plugin for Google Cloud Vertex AI☆38Updated 7 months ago
- ☆48Updated last year
- Templates for your Kedro projects.☆82Updated this week
- Cost Efficient Data Pipelines with DuckDB☆60Updated 7 months ago
- IbisML is a library for building scalable ML pipelines using Ibis.☆119Updated 5 months ago
- Sample configuration to deploy a modern data platform.☆89Updated 4 years ago
- Kedro Plugin to support running workflows on Kubeflow Pipelines☆56Updated 6 months ago
- Demo on how to use Prefect with Docker☆27Updated 3 years ago
- ☆47Updated last year
- The easiest way to integrate Kedro and Great Expectations☆54Updated 3 years ago
- Sample projects using Ploomber.☆86Updated last year
- Demo repository to lambda-fy your dbt runs☆11Updated 2 years ago
- ☆42Updated last month
- BigQuery DataFrames (also known as BigFrames)☆278Updated this week
- A simple and easy to use Data Quality (DQ) tool built with Python.☆51Updated 2 years ago
- Contribute to dlt verified sources 🔥☆102Updated last month
- Find data quality issues and clean your data in a single line of code with a Scikit-Learn compatible Transformer.☆135Updated 2 years ago
- Great Expectations Airflow operator☆169Updated last month
- Playground for using large language models into the Modern Data Stack for entity matching☆108Updated 2 years ago
- 🌳 WALD Stack Demo 🏎️☆33Updated last year
- A tool to deploy a mostly serverless MLflow tracking server on a GCP project with one command☆72Updated 7 months ago
- Plugins, extensions, case studies, articles, and video tutorials for Kedro☆94Updated last year
- Supporting materials/code examples for my course in data engineering for machine learning.☆39Updated 3 years ago
- The shared semantic layer definitions that dbt-core and MetricFlow use.☆90Updated last month
- An LLM-powered chatbot with the added context of the dbt knowledge base.☆39Updated last year
- Fake Pandas / PySpark DataFrame creator☆48Updated last year
- Linear regression in SQL using dbt☆75Updated 2 weeks ago