Declarative text based tool for data analysts and engineers to extract, load, transform and orchestrate their data pipelines.
☆201May 3, 2026Updated this week
Alternatives and similar repositories for starlake
Users that are interested in starlake are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Rewrite BigQuery, Redshift, Snowflake and Databricks queries into DuckDB compatible SQL (with deep transformation of functions, data type…☆73Apr 20, 2026Updated 2 weeks ago
- ☆19Nov 22, 2024Updated last year
- Department of Education (DOE) for New South Wales (AUS) data stack in a box☆35Nov 13, 2024Updated last year
- Boiling Insights - From raw S3 data to charts in seconds☆23Dec 12, 2024Updated last year
- BigQuery test kit is a framework written in python that allows you to be more confident in your SQL and check that they are ready to prod…☆55Feb 5, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Create data pipeline with sqlmesh orchestrated by dagster☆29Oct 27, 2025Updated 6 months ago
- Turning PySpark Into a Universal DataFrame API☆506Apr 21, 2026Updated 2 weeks ago
- A DataOps framework for building a lakehouse.☆57Updated this week
- This is the main repository for SDF documentation found at docs.sdf.com, as well as public schemas, benchmarks, and examples☆125Feb 5, 2025Updated last year
- ☆30Dec 4, 2024Updated last year
- Example projects built on MotherDuck☆49Apr 26, 2026Updated last week
- Scalable and efficient data transformation framework - backwards compatible with dbt.☆3,057Apr 29, 2026Updated last week
- A configuration-driven framework for building Dagster pipelines that enables teams to create and manage data workflows using YAML/JSON in…☆38Nov 13, 2024Updated last year
- Repo for orienting dbt users to the Dagster asset framework☆56Oct 24, 2022Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- CLI tool for dbt users to simplify creation of staging models (yml and sql) files☆277Jan 29, 2026Updated 3 months ago
- 🏃♀️ Minimalist SQL orchestrator☆321Apr 28, 2026Updated last week
- DuckDB Driver for SQLTools☆29Sep 27, 2025Updated 7 months ago
- A portable Datamart and Business Intelligence suite built with Docker, sqlmesh + dbtcore, DuckDB and Superset☆60Apr 5, 2026Updated last month
- How to run DBT on AWS Fargate☆13Oct 15, 2019Updated 6 years ago
- The universal metrics layer. Compatible with 15+ formats: Cube, MetricFlow, LookML, Omni, BSL, LDM, Cortex, Malloy, OSI, SML, TML, Hex, R…☆89Apr 23, 2026Updated 2 weeks ago
- dbt plugin for Palm CLI☆20Mar 20, 2024Updated 2 years ago
- Orchestrate Modal and OpenAI workloads with Dagster☆13Dec 11, 2024Updated last year
- Load dbt artifacts uploaded to GCS to BigQuery in order to track historical dbt results☆17Oct 26, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆11Jul 20, 2023Updated 2 years ago
- Climate Resilience☆15Mar 11, 2025Updated last year
- Use DuckDB within Excel with the xlDuckDb addin☆131Apr 23, 2026Updated 2 weeks ago
- ☆13Aug 27, 2021Updated 4 years ago
- Dagster Labs' open-source data platform, built with Dagster.☆458Updated this week
- BigTesty is a framework that allows to create Integration Tests with BigQuery on a real and short lived Infrastructure.☆67Jul 18, 2024Updated last year
- Demonstrating the concept of Google PubSub, a messaging queue service in Google, thru streaming fake financial data thru PubSub and query…☆15Aug 29, 2017Updated 8 years ago
- A collection of awesome resources for Evidence☆29Dec 11, 2024Updated last year
- JumpSpark - A modern cookiecutter template for pyspark projects with batteries included.☆10May 12, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Trino Iceberg Metadata Insights via Streamlit☆16Apr 9, 2025Updated last year
- hive_compared_bq compares/validates 2 (SQL like) tables, and graphically shows the rows/columns that are different.☆28Dec 13, 2017Updated 8 years ago
- The fastest business intelligence tool for humans and agents.☆2,610Updated this week
- Malloy is a modern open source language for describing data relationships and transformations.☆2,460Updated this week
- ☆47Jul 4, 2024Updated last year
- Sling is a CLI tool that extracts data from a source storage/database and loads it in a target storage/database.☆849Updated this week
- dbd is a database prototyping tool that enables data analysts and engineers to quickly load and transform data in SQL databases.☆56Feb 13, 2022Updated 4 years ago