Declarative text based tool for data analysts and engineers to extract, load, transform and orchestrate their data pipelines.
☆194Apr 12, 2026Updated this week
Alternatives and similar repositories for starlake
Users that are interested in starlake are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Rewrite BigQuery, Redshift, Snowflake and Databricks queries into DuckDB compatible SQL (with deep transformation of functions, data type…☆73Mar 28, 2026Updated 3 weeks ago
- ☆19Nov 22, 2024Updated last year
- Department of Education (DOE) for New South Wales (AUS) data stack in a box☆35Nov 13, 2024Updated last year
- Boiling Insights - From raw S3 data to charts in seconds☆23Dec 12, 2024Updated last year
- BigQuery test kit is a framework written in python that allows you to be more confident in your SQL and check that they are ready to prod…☆55Feb 5, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Create data pipeline with sqlmesh orchestrated by dagster☆29Oct 27, 2025Updated 5 months ago
- Turning PySpark Into a Universal DataFrame API☆501Updated this week
- A DataOps framework for building a lakehouse.☆56Updated this week
- This is the main repository for SDF documentation found at docs.sdf.com, as well as public schemas, benchmarks, and examples☆125Feb 5, 2025Updated last year
- ☆30Dec 4, 2024Updated last year
- The smallest DuckDB SQL orchestrator on Earth.☆343Nov 22, 2025Updated 4 months ago
- Example projects built on MotherDuck☆48Updated this week
- Scalable and efficient data transformation framework - backwards compatible with dbt.☆3,029Apr 8, 2026Updated last week
- A configuration-driven framework for building Dagster pipelines that enables teams to create and manage data workflows using YAML/JSON in…☆38Nov 13, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Repo for orienting dbt users to the Dagster asset framework☆56Oct 24, 2022Updated 3 years ago
- CLI tool for dbt users to simplify creation of staging models (yml and sql) files☆277Jan 29, 2026Updated 2 months ago
- 🏃♀️ Minimalist SQL orchestrator☆320Updated this week
- DuckDB Driver for SQLTools☆28Sep 27, 2025Updated 6 months ago
- A portable Datamart and Business Intelligence suite built with Docker, sqlmesh + dbtcore, DuckDB and Superset☆60Apr 5, 2026Updated last week
- How to run DBT on AWS Fargate☆13Oct 15, 2019Updated 6 years ago
- dbt plugin for Palm CLI☆20Mar 20, 2024Updated 2 years ago
- The universal metrics layer. Compatible with 15+ formats: Cube, MetricFlow, LookML, Omni, BSL, LDM, Cortex, Malloy, OSI, SML, TML, Hex, R…☆84Updated this week
- Load dbt artifacts uploaded to GCS to BigQuery in order to track historical dbt results☆17Oct 26, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆11Jul 20, 2023Updated 2 years ago
- Climate Resilience☆15Mar 11, 2025Updated last year
- Use DuckDB within Excel with the xlDuckDb addin☆126Apr 10, 2026Updated last week
- Dagster Labs' open-source data platform, built with Dagster.☆454Apr 10, 2026Updated last week
- ☆13Aug 27, 2021Updated 4 years ago
- BigTesty is a framework that allows to create Integration Tests with BigQuery on a real and short lived Infrastructure.☆67Jul 18, 2024Updated last year
- Demonstrating the concept of Google PubSub, a messaging queue service in Google, thru streaming fake financial data thru PubSub and query…☆15Aug 29, 2017Updated 8 years ago
- A collection of awesome resources for Evidence☆29Dec 11, 2024Updated last year
- JumpSpark - A modern cookiecutter template for pyspark projects with batteries included.☆10May 12, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆49Jul 25, 2024Updated last year
- Trino Iceberg Metadata Insights via Streamlit☆16Apr 9, 2025Updated last year
- hive_compared_bq compares/validates 2 (SQL like) tables, and graphically shows the rows/columns that are different.☆28Dec 13, 2017Updated 8 years ago
- The fastest business intelligence tool for humans and agents.☆2,555Updated this week
- A low-overhead sampling profiler for PySpark, that outputs Flame Graphs☆16Dec 17, 2020Updated 5 years ago
- Malloy is a modern open source language for describing data relationships and transformations.☆2,450Updated this week
- Sling is a CLI tool that extracts data from a source storage/database and loads it in a target storage/database.☆839Updated this week