ryandawsonuk / data-platforms-toolsLinks
Guide to data platforms and tools
☆32Updated 3 years ago
Alternatives and similar repositories for data-platforms-tools
Users that are interested in data-platforms-tools are comparing it to the libraries listed below
Sorting:
- Example project using DBT, Databricks and AdventureWorks sample database☆12Updated 2 years ago
- A curated list of awesome Databricks resources, including Spark☆19Updated 11 months ago
- A curated list of data engineering tools for software developers☆10Updated 6 years ago
- Road to Azure Data Engineer Part-II: DP-201 - Designing an Azure Data Solution☆19Updated 4 years ago
- Hadoop/Hive/Spark container to perform CI tests☆11Updated 4 years ago
- FLaNK AI Weekly covering Apache NiFi, Apache Flink, Apache Kafka, Apache Spark, Apache Iceberg, Apache Ozone, Apache Pulsar, and more...☆21Updated this week
- Data Mesh Architecture☆78Updated 11 months ago
- Source code for 'BigQuery for Data Warehousing' by Mark Mucchetti☆16Updated 4 years ago
- Streaming Synthetic Sales Data Generator: Streaming sales data generator for Apache Kafka, written in Python☆44Updated 2 years ago
- Data Profiler for AWS Glue Data Catalog application as described in the AWS Big Data Blog post "Build an automatic data profiling and rep…☆20Updated 5 years ago
- Automatically loads new partitions in AWS Athena☆19Updated 4 years ago
- NiFi Processor for Apache Pulsar☆10Updated 6 months ago
- DataHub on AWS demonstration resources☆10Updated 2 years ago
- Metadata Driven Development (m3d) is a cloud and platform agnostic framework for the automated creation, management and governance of dat…☆31Updated 2 years ago
- ☆11Updated last year
- This is a basic Apache Pinot example for ingesting real-time MySQL change logs using Debezium☆27Updated 4 years ago
- M3D Engine is a Spark application for the development of scalable data transformations and ingestions in data lakes.☆18Updated 4 years ago
- Cassandra + Spark = ❤️ Machine Learning with Apache Spark & Cassandra☆20Updated 3 years ago
- For a series of posts on Amazon MSK, Amazon EKS, and Amazon EMR☆66Updated 3 years ago
- A Thinnest Viable Platform (TVP) as described in Team Topologies, using just a Wiki page for a data platform.☆16Updated 4 years ago
- Practical Model-Driven Enterprise Architecture, published by Packt☆39Updated 2 years ago
- Debussy is an opinionated Data Architecture and Engineering framework, enabling data analysts and engineers to build better platforms and…☆28Updated 2 years ago
- Code examples for the book☆35Updated last year
- A decision tree to help you decide on the right AWS compute service for your needs.☆30Updated 3 years ago
- Herd-UI is a search and discovery tool for business and technical users. Everyone in your organization can use Herd-UI to browse and unde…☆16Updated 2 years ago
- Hortonworks Data Platform Retail Analytics Demo☆13Updated 8 years ago
- Jupyter Notebooks with Snowpark☆15Updated 3 years ago
- Pipeline library for StreamSets Data Collector and Transformer☆33Updated 2 years ago
- ☆14Updated last year
- The Open Group Architecture Framework (Togaf)☆63Updated 5 years ago