awslabs / amazon-redshift-json-schema-induction
A tool to learn JSON schema from collection of documents and generate Create table statement for Redshift
☆20Updated 6 months ago
Alternatives and similar repositories for amazon-redshift-json-schema-induction:
Users that are interested in amazon-redshift-json-schema-induction are comparing it to the libraries listed below
- Glue Python Shell Job that adds AWS Organizations account tags to Cost and Usage Reports. You can submit feedback & requests for changes…☆16Updated 4 years ago
- Design pattern for orchestrating an incremental data ingestion pipeline using AWS Step Functions from an on premise location into an Amaz…☆28Updated 5 years ago
- This github repo contains Aurora MySQL and PostgreSQL Labs, Aurora Serverless Lab and Heterogeneous database migration with DMS Labs.☆31Updated 2 years ago
- ☆18Updated 3 years ago
- Sample code for the AWS Big Data Blog Post Building a scalable streaming data processor with Amazon Kinesis Data Streams on AWS Fargate☆37Updated 2 weeks ago
- ☆73Updated 11 months ago
- Data Profiler for AWS Glue Data Catalog application as described in the AWS Big Data Blog post "Build an automatic data profiling and rep…☆19Updated 4 years ago
- A CLI to manage and monitor permissions in AWS Lake Formation☆26Updated 2 years ago
- Web UI for Amazon Athena☆56Updated 2 years ago
- ☆53Updated last year
- Streaming ETL job cases in AWS Glue to integrate Iceberg and creating an in-place updatable data lake on Amazon S3☆23Updated 7 months ago
- The open source version of the Amazon Redshift Cluster Management Guide.☆48Updated last year
- Reference Architectures for Datalakes on AWS☆79Updated 4 years ago
- ☆31Updated last year
- The AWS Innovation Sandbox solution provisions isolated, self-contained, environments to help developers, security professionals, and inf…☆29Updated 11 months ago
- Cloudformation and SQL scripts used to replicate a POC environment from the "Data Lake to Data Warehouse: Enhancing Customer 360 with Ama…☆31Updated 5 years ago
- Data Pipeline for CDC data from MySQL DB to Amazon OpenSearch Service through Amazon Kinesis using Amazon Data Migration Service(DMS).☆32Updated 3 months ago
- Using Amazon Comprehend, Amazon Elasticsearch with Kibana, Amazon S3, Amazon Cognito to search over large number of documents.☆24Updated 11 months ago
- Operational Data Processing Framework developed using AWS Glue and Apache Hudi. This framework is suitable for Data Lake and Modern Data …☆22Updated last year
- ☆12Updated last year
- his solution helps customers more easily manage their fleet of servers, automate software inventory management, OS patch compliance, and …☆29Updated last year
- A tool to automate analytic platform evaluations. Barometer helps customers to get data points needed for service selection/service confi…☆19Updated 11 months ago
- ☆22Updated 4 years ago
- This repository shows how to setup Centralized CloudWatch Observability Manager using Terraform☆17Updated 5 months ago
- Tool for generating CodePipeline pipelines and related resources from a simple configuration.☆11Updated 4 years ago
- The open source version of the Amazon EMR Management Guide. You can submit feedback & requests for changes by submitting issues in this r…☆62Updated last year
- Glue scripts for converting AWS Service Logs for use in Athena☆141Updated last year
- Stream CDC into an Amazon S3 data lake in Apache Iceberg table format with AWS Glue Streaming and DMS☆32Updated 2 months ago
- secrets-helper helps you use AWS Secrets Manager to secure the use of CLI tools☆18Updated 4 years ago
- 🌉 Reference implementation for granting cross-account AWS Glue Data Catalog access from Amazon Athena☆30Updated 2 years ago