Declarative text based tool for data analysts and engineers to extract, load, transform and orchestrate their data pipelines.
☆191Mar 22, 2026Updated last week
Alternatives and similar repositories for starlake
Users that are interested in starlake are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆19Nov 22, 2024Updated last year
- Department of Education (DOE) for New South Wales (AUS) data stack in a box☆36Nov 13, 2024Updated last year
- Boiling Insights - From raw S3 data to charts in seconds☆23Dec 12, 2024Updated last year
- BigQuery test kit is a framework written in python that allows you to be more confident in your SQL and check that they are ready to prod…☆55Feb 5, 2024Updated 2 years ago
- Create data pipeline with sqlmesh orchestrated by dagster☆28Oct 27, 2025Updated 5 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Turning PySpark Into a Universal DataFrame API☆497Updated this week
- A DataOps framework for building a lakehouse.☆56Updated this week
- This is the main repository for SDF documentation found at docs.sdf.com, as well as public schemas, benchmarks, and examples☆124Feb 5, 2025Updated last year
- ☆30Dec 4, 2024Updated last year
- The smallest DuckDB SQL orchestrator on Earth.☆342Nov 22, 2025Updated 4 months ago
- Example projects built on MotherDuck☆46Updated this week
- Scalable and efficient data transformation framework - backwards compatible with dbt.☆2,975Mar 19, 2026Updated last week
- A configuration-driven framework for building Dagster pipelines that enables teams to create and manage data workflows using YAML/JSON in…☆38Nov 13, 2024Updated last year
- Repo for orienting dbt users to the Dagster asset framework☆56Oct 24, 2022Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- CLI tool for dbt users to simplify creation of staging models (yml and sql) files☆276Jan 29, 2026Updated 2 months ago
- 🏃♀️ Minimalist SQL orchestrator☆314Mar 17, 2026Updated last week
- DuckDB Driver for SQLTools☆28Sep 27, 2025Updated 6 months ago
- A portable Datamart and Business Intelligence suite built with Docker, sqlmesh + dbtcore, DuckDB and Superset☆59Mar 9, 2026Updated 2 weeks ago
- How to run DBT on AWS Fargate☆13Oct 15, 2019Updated 6 years ago
- The universal metrics layer. Compatible with 15+ formats: Cube, MetricFlow, LookML, Omni, BSL, LDM, Cortex, Malloy, OSI, SML, TML, Hex, R…☆77Feb 25, 2026Updated last month
- Tools for analyzing timestamped position data particularly vessel position reports from AIS.☆16Jun 3, 2024Updated last year
- dbt plugin for Palm CLI☆20Mar 20, 2024Updated 2 years ago
- Orchestrate Modal and OpenAI workloads with Dagster☆13Dec 11, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Load dbt artifacts uploaded to GCS to BigQuery in order to track historical dbt results☆17Oct 26, 2023Updated 2 years ago
- ☆11Jul 20, 2023Updated 2 years ago
- Climate Resilience☆15Mar 11, 2025Updated last year
- Use DuckDB within Excel with the xlDuckDb addin☆126Updated this week
- ☆13Aug 27, 2021Updated 4 years ago
- Dagster Labs' open-source data platform, built with Dagster.☆450Mar 19, 2026Updated last week
- BigTesty is a framework that allows to create Integration Tests with BigQuery on a real and short lived Infrastructure.☆67Jul 18, 2024Updated last year
- Demonstrating the concept of Google PubSub, a messaging queue service in Google, thru streaming fake financial data thru PubSub and query…☆15Aug 29, 2017Updated 8 years ago
- A collection of awesome resources for Evidence☆29Dec 11, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- JumpSpark - A modern cookiecutter template for pyspark projects with batteries included.☆10May 12, 2023Updated 2 years ago
- Trino Iceberg Metadata Insights via Streamlit☆16Apr 9, 2025Updated 11 months ago
- hive_compared_bq compares/validates 2 (SQL like) tables, and graphically shows the rows/columns that are different.☆28Dec 13, 2017Updated 8 years ago
- The fastest business intelligence tool for humans and agents.☆2,537Updated this week
- Malloy is a modern open source language for describing data relationships and transformations.☆2,429Updated this week
- A low-overhead sampling profiler for PySpark, that outputs Flame Graphs☆16Dec 17, 2020Updated 5 years ago
- Sling is a CLI tool that extracts data from a source storage/database and loads it in a target storage/database.☆824Updated this week