A framework to manage data, continuously
☆35Jan 20, 2025Updated last year
Alternatives and similar repositories for cdf
Users that are interested in cdf are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Alto is a versatile data integration tool that allows you to easily run Singer plugins, build and cache PEX files encapsulating those plu…☆60Mar 29, 2023Updated 3 years ago
- ☆38Aug 29, 2025Updated 8 months ago
- Arcane Insight is a data analytics project designed to harness the power of SQLMesh & DuckDB to collect, transform, and analyze data from…☆37Jan 23, 2025Updated last year
- ☆17Nov 25, 2024Updated last year
- ☆11Aug 21, 2025Updated 9 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Official ClickHouse Agentic Data Stack - self-host with ClickHouse, LibreChat, Langfuse, and ClickHouse MCP.☆62May 13, 2026Updated last week
- target-bigquery is a Singer target for BigQuery. It supports storage write, GCS, streaming, and batch load methods. Built with the Melta…☆37Jul 1, 2025Updated 10 months ago
- Contribute to dlt verified sources 🔥☆113Mar 30, 2026Updated last month
- This is the main repository for SDF documentation found at docs.sdf.com, as well as public schemas, benchmarks, and examples☆125Feb 5, 2025Updated last year
- rust-for-data☆53Jul 12, 2023Updated 2 years ago
- Personal project for setting up an open source data warehouse.☆32Jul 11, 2025Updated 10 months ago
- Department of Education (DOE) for New South Wales (AUS) data stack in a box☆35Nov 13, 2024Updated last year
- CLI tool for dbt users to simplify creation of staging models (yml and sql) files☆277Updated this week
- A Singer tap that wraps Airbyte sources allowing them to be consumed by Singer targets☆26Mar 20, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Reusable code for Hive☆16Aug 19, 2014Updated 11 years ago
- Pure functional parser combinator library which supports both applicative and monadic styles of parsing.☆33Sep 20, 2021Updated 4 years ago
- Pipeline definitions for managing data flows to power analytics at MIT Open Learning☆47Updated this week
- A "modern" Strava data pipeline fueled by dlt, duckdb, dbt, and evidence.dev☆40May 11, 2025Updated last year
- Redshift Python library for user agent detection (browsers, devices, etc) and parsing via UDFs☆10May 27, 2020Updated 5 years ago
- Lightweight, open source, locally-hosted Modern Data Stack☆19Apr 7, 2025Updated last year
- A Github Action to run `sdf` CLI in workflows.☆16Nov 21, 2024Updated last year
- titan: a package manager for Snowflake DB☆23Oct 3, 2022Updated 3 years ago
- ☆21Aug 8, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code for DE101 book at https://de101.startdataengineering.com/☆105Feb 22, 2026Updated 3 months ago
- A collection of utilities and tools for teams and organizations using dbt☆15Nov 24, 2023Updated 2 years ago
- List of `pre-commit` hooks to ensure the quality of your `dbt` projects.☆20Nov 28, 2023Updated 2 years ago
- ELT With Airflow Helper - Classes and functions to make apache airflow life easier☆13Updated this week
- Turning PySpark Into a Universal DataFrame API☆507Updated this week
- Utility functions for dbt projects running on Athena☆12Mar 25, 2025Updated last year
- My playground☆10Feb 27, 2023Updated 3 years ago
- Collection of utilities for working with BigQuery in Apache Beam☆10Nov 13, 2025Updated 6 months ago
- Template for getting started with Hybrid Dagster Cloud☆14Sep 19, 2025Updated 8 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- A leightweight UI for Lakekeeper☆17May 18, 2026Updated last week
- ☆46Jun 10, 2024Updated last year
- Shows a backtrace of your queries☆22May 12, 2018Updated 8 years ago
- A project to create Amazon-style Weekly Business Review reports☆50Mar 6, 2026Updated 2 months ago
- Sample ELT project using Dagster, data load tool and Snowflake☆42Jul 20, 2024Updated last year
- ☆12Jul 28, 2017Updated 8 years ago
- Template for my talks☆10May 1, 2018Updated 8 years ago