CityofPittsburgh / data-rivers
Apache Airflow and Beam ETL scripts for the City of Pittsburgh's data analysis pipelines
☆10Updated 4 months ago
Alternatives and similar repositories for data-rivers:
Users that are interested in data-rivers are comparing it to the libraries listed below
- Arcane Insight is a data analytics project designed to harness the power of SQLMesh & DuckDB to collect, transform, and analyze data from…☆30Updated this week
- Pipeline definitions for managing data flows to power analytics at MIT Open Learning☆41Updated this week
- This extension makes vscode seamlessly work with dbt and bigquery☆14Updated 2 years ago
- A simple and easy to use Data Quality (DQ) tool built with Python.☆49Updated last year
- A Singer.io Target for Snowflake☆11Updated last year
- Full stack data engineering tools and infrastructure set-up☆47Updated 3 years ago
- Utility functions for dbt projects running on Spark☆31Updated last year
- A modern ELT demo using airbyte, dbt, snowflake and dagster☆25Updated 2 years ago
- `tap-rest-api-msdk` is a Singer tap for generic rest-apis, built with the Meltano SDK for Singer Taps.☆24Updated 3 months ago
- A Python package that creates fine-grained dbt tasks on Apache Airflow☆62Updated 3 months ago
- Analytics Engineering best practices and standards used at Hiflylabs☆12Updated last year
- A Python package to help Databricks Unity Catalog users to read and query Delta Lake tables with Polars, DuckDb, or PyArrow.☆23Updated 9 months ago
- Make dbt great again! Enables end user to extend dbt to his/her needs☆42Updated last month
- Evaluation Matrix for Change Data Capture☆24Updated 5 months ago
- Example orchestration pipeline for Fivetran + dbt managed by Airflow☆21Updated 3 years ago
- An integration for dbt and fzf that allows interactive selection and search of dbt models.☆67Updated last year
- Repo for orienting dbt users to the Dagster asset framework☆51Updated 2 years ago
- ☆31Updated 3 weeks ago
- Notes that I should one day turn into a blog or something ...☆30Updated this week
- A framework to manage data, continuously☆31Updated last week
- ☆53Updated 6 months ago
- Apache Flink (Pyflink) and Related Projects☆29Updated 7 months ago
- Code for my "Efficient Data Processing in SQL" book.☆54Updated 5 months ago
- Configure and enforce conventions for your dbt project.☆36Updated this week
- All the basics to get a nice containerized dbt development environment☆57Updated 2 years ago
- From the SELECT team, a dbt package to automatically tag dbt-issued queries with informative metadata.☆44Updated 2 months ago
- Run dbt serverless in the Cloud (AWS)☆41Updated 5 years ago
- A collection of utilities and tools for teams and organizations using dbt☆13Updated last year