Declarative text based tool for data analysts and engineers to extract, load, transform and orchestrate their data pipelines.
☆187Mar 1, 2026Updated last week
Alternatives and similar repositories for starlake
Users that are interested in starlake are comparing it to the libraries listed below
Sorting:
- Rewrite BigQuery, Redshift, Snowflake and Databricks queries into DuckDB compatible SQL (with deep transformation of functions, data type…☆68Jan 26, 2026Updated last month
- Department of Education (DOE) for New South Wales (AUS) data stack in a box☆36Nov 13, 2024Updated last year
- Boiling Insights - From raw S3 data to charts in seconds☆22Dec 12, 2024Updated last year
- Turning PySpark Into a Universal DataFrame API☆493Updated this week
- Natural Language Processing Project☆11Jul 6, 2021Updated 4 years ago
- This is the main repository for SDF documentation found at docs.sdf.com, as well as public schemas, benchmarks, and examples☆124Feb 5, 2025Updated last year
- ☆48Jul 25, 2024Updated last year
- A Python package to help Databricks Unity Catalog users to read and query Delta Lake tables with Polars, DuckDb, or PyArrow.☆27Mar 25, 2024Updated last year
- dbt plugin for Palm CLI☆20Mar 20, 2024Updated last year
- Scalable and efficient data transformation framework - backwards compatible with dbt.☆2,928Mar 2, 2026Updated last week
- ☆30Dec 4, 2024Updated last year
- A DataOps framework for building a lakehouse.☆56Updated this week
- ☆47Jul 4, 2024Updated last year
- learning-by-doing data model built with dbt-core☆15Dec 13, 2025Updated 2 months ago
- Tools for analyzing timestamped position data particularly vessel position reports from AIS.☆16Jun 3, 2024Updated last year
- JumpSpark - A modern cookiecutter template for pyspark projects with batteries included.☆10May 12, 2023Updated 2 years ago
- The smallest DuckDB SQL orchestrator on Earth.☆337Nov 22, 2025Updated 3 months ago
- BigQuery test kit is a framework written in python that allows you to be more confident in your SQL and check that they are ready to prod…☆55Feb 5, 2024Updated 2 years ago
- 🏃♀️ Minimalist SQL orchestrator☆307Updated this week
- Example projects built on MotherDuck☆43Updated this week
- A portable Datamart and Business Intelligence suite built with Docker, sqlmesh + dbtcore, DuckDB and Superset☆55Oct 13, 2025Updated 4 months ago
- Orchestrate Modal and OpenAI workloads with Dagster☆13Dec 11, 2024Updated last year
- Climate Resilience☆15Mar 11, 2025Updated 11 months ago
- ☆11Jul 20, 2023Updated 2 years ago
- Trino Iceberg Metadata Insights via Streamlit☆15Apr 9, 2025Updated 11 months ago
- Dagster Labs' open-source data platform, built with Dagster.☆441Updated this week
- CLI tool for dbt users to simplify creation of staging models (yml and sql) files☆274Jan 29, 2026Updated last month
- ☆36Mar 2, 2026Updated last week
- Repo for orienting dbt users to the Dagster asset framework☆56Oct 24, 2022Updated 3 years ago
- A low-overhead sampling profiler for PySpark, that outputs Flame Graphs☆16Dec 17, 2020Updated 5 years ago
- Demonstrating the concept of Google PubSub, a messaging queue service in Google, thru streaming fake financial data thru PubSub and query…☆15Aug 29, 2017Updated 8 years ago
- Terraform scripts for provisioning nodes in the Hetzner cloud☆20Oct 6, 2024Updated last year
- Test data management tool for any data source, batch or real-time. Generate, validate and clean up data all in one tool.☆73Feb 14, 2026Updated 3 weeks ago
- Use DuckDB within Excel with the xlDuckDb addin☆115Dec 5, 2025Updated 3 months ago
- Load dbt artifacts uploaded to GCS to BigQuery in order to track historical dbt results☆17Oct 26, 2023Updated 2 years ago
- Centralized whale instance using github actions, sourcing metadata from bigquery-public-data.☆18Jun 15, 2024Updated last year
- Pipeline definitions for managing data flows to power analytics at MIT Open Learning☆46Updated this week
- APISpec plugin to import OpenAPI specifications from a file☆19Jun 24, 2022Updated 3 years ago
- Explore conversational landscapes with AI - built with Next.js, LangChain, Memgraph, and Orbit☆21Nov 20, 2023Updated 2 years ago