mehd-io / duckdb-pyspark-demo
Demo of DuckDB Spark API implements. Same Pyspark code, but DuckDB under the hood
☆14Updated last year
Alternatives and similar repositories for duckdb-pyspark-demo:
Users that are interested in duckdb-pyspark-demo are comparing it to the libraries listed below
- Cost Efficient Data Pipelines with DuckDB☆52Updated 8 months ago
- Demo on how to use Prefect with Docker☆25Updated 2 years ago
- Jupyter Cell / Line Magics for DuckDB☆48Updated 2 months ago
- ☆18Updated 9 months ago
- rust-for-data☆45Updated last year
- A FastMCP tool to search and retrieve Polars API documentation.☆46Updated last week
- Data-aware orchestration with dagster, dbt, and airbyte☆31Updated 2 years ago
- Build your feature store with macros right within your dbt repository☆38Updated 2 years ago
- A Beginner's Guide to DuckDB's Python Client☆41Updated 6 months ago
- This repo contains information about DuckDB extensions found on GitHub. Refreshed daily☆96Updated this week
- Repo for orienting dbt users to the Dagster asset framework☆54Updated 2 years ago
- Fake Pandas / PySpark DataFrame creator☆46Updated last year
- A "modern" Strava data pipeline fueled by dlt, duckdb, dbt, and evidence.dev☆32Updated 3 months ago
- DuckDB SQL Tools add DuckDB support to VSCode, and provide database schema and SQL query interfaces for the popular SQLTools extension, S…☆17Updated 9 months ago
- A monorepo of many Rill example projects☆36Updated this week
- A portable Datamart and Business Intelligence suite built with Docker, sqlmesh + dbtcore, DuckDB and Superset☆49Updated 5 months ago
- The Modern Data Stack in a Python package☆49Updated last year
- ☆38Updated 10 months ago
- Personal Finance Project to automatically collect swiss banking transaction into a DWH and visualise it☆26Updated last year
- Supporting materials/code examples for my course in data engineering for machine learning.☆38Updated 2 years ago
- Sample project that use Dagster, dbt, DuckDB and Dash to visualize car and motorcycle Spanish market☆58Updated 2 years ago
- Department of Education (DOE) for New South Wales (AUS) data stack in a box☆32Updated 5 months ago
- DuckDB with Dashboarding tools demo evidence, streamlit and rill☆16Updated last year
- Dask integration for Snowflake☆30Updated 5 months ago
- ☆16Updated last year
- Building 3D Trusted Data Pipelines With Dagster, Dbt, and Duckdb☆20Updated last year
- ☆11Updated 5 months ago
- Linear regression in SQL using dbt☆70Updated 3 months ago
- A high-performance data streaming system using DuckDB and Apache Arrow Flight.☆76Updated 2 months ago
- Arcane Insight is a data analytics project designed to harness the power of SQLMesh & DuckDB to collect, transform, and analyze data from…☆33Updated 3 months ago