intuit / QuickFabricLinks
A one-stop shop for all management and monitoring of Amazon Elastic Map Reduce (EMR) clusters across different AWS accounts and purposes.
☆35Updated 2 years ago
Alternatives and similar repositories for QuickFabric
Users that are interested in QuickFabric are comparing it to the libraries listed below
Sorting:
- costBuddy will gather cost information from multiple AWS accounts and generate a nice Grafana dashboard with alerting in place.☆118Updated 3 years ago
- kinesis-kafka-connector is connector based on Kafka Connect to publish messages to Amazon Kinesis streams or Amazon Kinesis Firehose.☆155Updated last year
- Web UI for Amazon Athena☆56Updated 2 years ago
- A best practices guide for using AWS EMR. The guide will cover best practices on the topics of cost, performance, security, operational e…☆107Updated last week
- Reference Architectures for Datalakes on AWS☆79Updated 5 years ago
- Workshop - Using AWS Lake Formation ML Transforms to cleanse the data in a data lake☆14Updated 5 years ago
- This code demonstrates the architecture featured on the AWS Big Data blog (https://aws.amazon.com/blogs/big-data/ ) which creates a concu…☆75Updated 6 years ago
- EMR Hudi Workshop content☆12Updated 3 years ago
- Replication utility for AWS Glue Data Catalog☆79Updated 10 months ago
- Enables synchronizing metadata changes (Create/Drop table/partition) from Hive Metastore to AWS Glue Data Catalog☆35Updated last year
- Amazon S3 Find and Forget is a solution to handle data erasure requests from data lakes stored on Amazon S3, for example, pursuant to the…☆242Updated 2 weeks ago
- Automated data quality suggestions and analysis with Deequ on AWS Glue☆85Updated 2 years ago
- ☆73Updated last year
- A self-paced workshop designed to allow you to get hands on with building a real-time data platform using serverless technologies such as…☆22Updated 6 years ago
- Herd is a managed data lake for the cloud. The Herd unified data catalog helps separate storage from compute in the cloud. Manage petabyt…☆135Updated 2 years ago
- Continuously synchronize directories from remote object store to local filesystem☆105Updated 4 months ago
- Superglue is a lineage-tracking tool built to help visualize the propagation of data through complex pipelines composed of tables, jobs …☆157Updated 2 years ago
- Streaming ETL with Apache Flink and Amazon Kinesis Data Analytics☆64Updated last year
- Manage your Kafka ACL at scale☆369Updated last year
- Kafka Configuration Provider for AWS Secrets Manager☆23Updated 2 years ago
- Sample Apache Flink application that can be deployed to Kinesis Analytics for Java. It reads taxi events from a Kinesis data stream, proc…☆86Updated last year
- Spark ETL example processing New York taxi rides public dataset on EKS☆45Updated 2 years ago
- ☆53Updated last year
- This GitHub project provides a series of lab exercises which help users get started using the Redshift platform.☆53Updated 4 years ago
- The AWS Glue Data Catalog is a fully managed, Apache Hive Metastore compatible, metadata repository. Customers can use the Data Catalog a…☆222Updated 3 months ago
- This repository contains ready-to-use notebook examples for a wide variety of use cases in Amazon EMR Studio.☆51Updated last year
- Amazon Redshift Advanced Monitoring☆272Updated 2 years ago
- ☆21Updated 7 years ago
- ☆40Updated 3 months ago
- Samples to help you get started with the Amazon Redshift Data API☆73Updated last year