Sample configuration to deploy a modern data platform.
☆89Dec 28, 2021Updated 4 years ago
Alternatives and similar repositories for modern_data_platform
Users that are interested in modern_data_platform are comparing it to the libraries listed below
Sorting:
- ☆10May 16, 2022Updated 3 years ago
- This repo helps bootstrap the infrastructures with a modern data stack on Google Cloud Platform using Terraform.☆123Mar 11, 2022Updated 3 years ago
- Playground site for creating/validating data contracts☆11Aug 9, 2025Updated 6 months ago
- Debussy is an opinionated Data Architecture and Engineering framework, enabling data analysts and engineers to build better platforms and…☆28Mar 20, 2023Updated 2 years ago
- New generation opensource data stack☆75May 20, 2022Updated 3 years ago
- A sentiment analysis project performed on data collected from Twitter mentioning the two primary contestants in the 2020 US Elections.☆11Nov 1, 2020Updated 5 years ago
- Using Plotly to create a heatmap visualization of monthly and hourly data☆13Aug 9, 2021Updated 4 years ago
- Python utilities for BigQuery analyses.☆15Dec 10, 2020Updated 5 years ago
- Using the Parquet file format (with Avro) to process data with Apache Flink☆14Aug 17, 2015Updated 10 years ago
- Historical metadata of your data warehouse is a treasure trove to discover not just insights about changing data patterns, but also quali…☆13Jul 21, 2021Updated 4 years ago
- ☆14Feb 4, 2022Updated 4 years ago
- Snowflake Database, Schema, and Warehouse provisioning with Access Roles & Generating and Provisioning of Functional Roles & Snowflake So…☆50Nov 12, 2024Updated last year
- JumpSpark - A modern cookiecutter template for pyspark projects with batteries included.☆10May 12, 2023Updated 2 years ago
- A platform-agnostic index of Singer.io taps and targets.☆11Jan 29, 2021Updated 5 years ago
- Lightweight Streamlit app to test out metrics functionality in dbt☆10Feb 22, 2022Updated 4 years ago
- Repo with scripts and automation to help ensure best practices in Google Data Catalog☆13Feb 12, 2022Updated 4 years ago
- ☆10Sep 30, 2020Updated 5 years ago
- ☆12Feb 23, 2022Updated 4 years ago
- This is the repo for Bruin's Visual Studio Code extension.☆17Updated this week
- A lightweight tool to fetch tables from BigQuery as pandas DataFrame very fast using BigQuery Storage API combined with multiprocessing☆27Jun 16, 2023Updated 2 years ago
- ☆12Aug 3, 2024Updated last year
- Stack't is a small MDS-in-a box, that specializes in providing interoperability for object-centric event logs by mapping to a flexible & …☆14Apr 26, 2025Updated 10 months ago
- ☆51Feb 7, 2023Updated 3 years ago
- Cookiecutter template for creating GitHub Actions orchestrated Meltano projects☆27Feb 28, 2022Updated 4 years ago
- ☆14Oct 17, 2022Updated 3 years ago
- Deploy a complete data stack in just a couple of minutes.☆15Mar 6, 2024Updated 2 years ago
- This is a guided certification project, as a part of Data Science for Social Good initiative☆18Mar 9, 2020Updated 5 years ago
- ☆16Feb 17, 2026Updated 2 weeks ago
- Collection of dbt Tips and Tricks☆400Oct 12, 2022Updated 3 years ago
- Schema modelling framework for decentralised domain-driven ownership of data.☆261Dec 5, 2023Updated 2 years ago
- dbt docs but windows 95☆16Jun 7, 2022Updated 3 years ago
- This repository contains an example of how to leverage Cloud Composer and Cloud Dataflow to move data from a Microsoft SQL Server to BigQ…☆19Jun 10, 2025Updated 8 months ago
- Document based data access layer framework☆33Jun 27, 2025Updated 8 months ago
- Centralized whale instance using github actions, sourcing metadata from bigquery-public-data.☆18Jun 15, 2024Updated last year
- Google BigQuery data source for Apache Spark☆17Oct 1, 2024Updated last year
- Examples of real-time data visualization with Matplotlib's FuncAnimation☆17Jan 25, 2021Updated 5 years ago
- A library to wrap Jupyter Notebook into Kubeflow component☆16Dec 4, 2020Updated 5 years ago
- do more with dbt. dbt-fal helps you run Python alongside dbt, so you can send Slack alerts, detect anomalies and build machine learning m…☆857Apr 5, 2024Updated last year
- datascienv is package that helps you to setup your environment in single line of code with all dependency and it is also include pyforest…☆58Nov 1, 2021Updated 4 years ago