A framework to manage data, continuously
☆33Jan 20, 2025Updated last year
Alternatives and similar repositories for cdf
Users that are interested in cdf are comparing it to the libraries listed below
Sorting:
- Alto is a versatile data integration tool that allows you to easily run Singer plugins, build and cache PEX files encapsulating those plu…☆59Mar 29, 2023Updated 2 years ago
- ☆38Aug 29, 2025Updated 6 months ago
- target-bigquery is a Singer target for BigQuery. It supports storage write, GCS, streaming, and batch load methods. Built with the Melta…☆35Jul 1, 2025Updated 8 months ago
- Arcane Insight is a data analytics project designed to harness the power of SQLMesh & DuckDB to collect, transform, and analyze data from…☆38Jan 23, 2025Updated last year
- Contribute to dlt verified sources 🔥☆106Dec 9, 2025Updated 2 months ago
- Utility functions for dbt projects running on Athena☆12Mar 25, 2025Updated 11 months ago
- ☆11Aug 21, 2025Updated 6 months ago
- Personal project for setting up an open source data warehouse.☆32Jul 11, 2025Updated 7 months ago
- ☆17Nov 25, 2024Updated last year
- Redshift Python library for user agent detection (browsers, devices, etc) and parsing via UDFs☆10May 27, 2020Updated 5 years ago
- This is the main repository for SDF documentation found at docs.sdf.com, as well as public schemas, benchmarks, and examples☆124Feb 5, 2025Updated last year
- Template for getting started with Hybrid Dagster Cloud☆14Sep 19, 2025Updated 5 months ago
- A leightweight UI for Lakekeeper☆16Updated this week
- CLI tool for dbt users to simplify creation of staging models (yml and sql) files☆274Jan 29, 2026Updated last month
- A collection of utilities and tools for teams and organizations using dbt☆15Nov 24, 2023Updated 2 years ago
- A "modern" Strava data pipeline fueled by dlt, duckdb, dbt, and evidence.dev☆40May 11, 2025Updated 9 months ago
- Pipeline definitions for managing data flows to power analytics at MIT Open Learning☆46Updated this week
- ☆20Jun 28, 2023Updated 2 years ago
- demo examples how to load data from different sources to different destinations☆28Jan 29, 2026Updated last month
- ☆22Jul 18, 2024Updated last year
- ☆23Feb 14, 2025Updated last year
- A Singer tap that wraps Airbyte sources allowing them to be consumed by Singer targets☆26Mar 20, 2025Updated 11 months ago
- 📦 Serverless and local-first Open Data Platform☆308Jan 22, 2026Updated last month
- ☆21Aug 8, 2024Updated last year
- titan: a package manager for Snowflake DB☆23Oct 3, 2022Updated 3 years ago
- ☆48Dec 23, 2020Updated 5 years ago
- A dbt-core plugin to weave together multi-project dbt-core deployments☆191Jan 24, 2026Updated last month
- A dbt artifacts parser in python☆111Updated this week
- List of `pre-commit` hooks to ensure the quality of your `dbt` projects.☆19Nov 28, 2023Updated 2 years ago
- ELT With Airflow Helper - Classes and functions to make apache airflow life easier☆12Updated this week
- ☆30Dec 4, 2024Updated last year
- Personal Finance Project to automatically collect swiss banking transaction into a DWH and visualise it☆25Mar 3, 2024Updated 2 years ago
- Automatically creates dbt exposures from your BI tools. It currently supports Tableau (connecting to Snowflake).☆62Jan 25, 2024Updated 2 years ago
- Demo Project for Open Source MDS☆170Aug 27, 2025Updated 6 months ago
- fst: flow state tool | smooth where you want it, friction where you need it when data engineering☆33Jun 13, 2023Updated 2 years ago
- ☆59Jul 3, 2024Updated last year
- Nicely modeled data built on the Github Archive.☆69Jan 23, 2026Updated last month
- Python library for working with ThoughtSpot Modeling Language (TML) files programmatically☆10Oct 10, 2025Updated 4 months ago
- Provides automated YAML management and a streamlit workbench. Designed to optimize dev workflows.☆611Feb 5, 2026Updated 3 weeks ago