Declarative text based tool for data analysts and engineers to extract, load, transform and orchestrate their data pipelines.
☆202May 24, 2026Updated this week
Alternatives and similar repositories for starlake
Users that are interested in starlake are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Rewrite BigQuery, Redshift, Snowflake and Databricks queries into DuckDB compatible SQL (with deep transformation of functions, data type…☆73Apr 20, 2026Updated last month
- ☆20Nov 22, 2024Updated last year
- Department of Education (DOE) for New South Wales (AUS) data stack in a box☆36Nov 13, 2024Updated last year
- Boiling Insights - From raw S3 data to charts in seconds☆23Dec 12, 2024Updated last year
- BigQuery test kit is a framework written in python that allows you to be more confident in your SQL and check that they are ready to prod…☆55Feb 5, 2024Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Create data pipeline with sqlmesh orchestrated by dagster☆30Oct 27, 2025Updated 7 months ago
- Turning PySpark Into a Universal DataFrame API☆507May 20, 2026Updated last week
- A DataOps framework for building a lakehouse.☆57May 22, 2026Updated last week
- This is the main repository for SDF documentation found at docs.sdf.com, as well as public schemas, benchmarks, and examples☆125Feb 5, 2025Updated last year
- ☆30Dec 4, 2024Updated last year
- The smallest DuckDB SQL orchestrator on Earth.☆345Nov 22, 2025Updated 6 months ago
- Scalable and efficient data transformation framework - backwards compatible with dbt.☆3,092Updated this week
- A configuration-driven framework for building Dagster pipelines that enables teams to create and manage data workflows using YAML/JSON in…☆37Nov 13, 2024Updated last year
- Example projects built on MotherDuck☆52May 19, 2026Updated last week
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- CLI tool for dbt users to simplify creation of staging models (yml and sql) files☆276Updated this week
- Repo for orienting dbt users to the Dagster asset framework☆56Oct 24, 2022Updated 3 years ago
- 🏃♀️ Minimalist SQL orchestrator☆324Updated this week
- DuckDB Driver for SQLTools☆29Sep 27, 2025Updated 8 months ago
- A portable Datamart and Business Intelligence suite built with Docker, sqlmesh + dbtcore, DuckDB and Superset☆60Apr 5, 2026Updated last month
- How to run DBT on AWS Fargate☆13Oct 15, 2019Updated 6 years ago
- Tools for analyzing timestamped position data particularly vessel position reports from AIS.☆16Jun 3, 2024Updated last year
- The universal metrics layer. Compatible with 15+ formats: Cube, MetricFlow, LookML, Omni, BSL, LDM, Cortex, Malloy, OSI, SML, TML, Hex, R…☆94Updated this week
- dbt plugin for Palm CLI☆20Mar 20, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Orchestrate Modal and OpenAI workloads with Dagster☆13Dec 11, 2024Updated last year
- Load dbt artifacts uploaded to GCS to BigQuery in order to track historical dbt results☆17Oct 26, 2023Updated 2 years ago
- ☆11Jul 20, 2023Updated 2 years ago
- Use DuckDB within Excel with the xlDuckDb addin☆137Apr 23, 2026Updated last month
- ☆13Aug 27, 2021Updated 4 years ago
- Dagster Labs' open-source data platform, built with Dagster.☆460May 22, 2026Updated last week
- Demonstrating the concept of Google PubSub, a messaging queue service in Google, thru streaming fake financial data thru PubSub and query…☆15Aug 29, 2017Updated 8 years ago
- A collection of awesome resources for Evidence☆29Dec 11, 2024Updated last year
- JumpSpark - A modern cookiecutter template for pyspark projects with batteries included.☆10May 12, 2023Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆49Jul 25, 2024Updated last year
- The fastest business intelligence tool for humans and agents.☆2,634Updated this week
- Trino Iceberg Metadata Insights via Streamlit☆16Apr 9, 2025Updated last year
- Sample configuration to deploy a modern data platform.☆89Dec 28, 2021Updated 4 years ago
- A low-overhead sampling profiler for PySpark, that outputs Flame Graphs☆16Dec 17, 2020Updated 5 years ago
- Malloy is a modern open source language for describing data relationships and transformations.☆2,472May 21, 2026Updated last week
- ☆47Jul 4, 2024Updated last year