awslabs / amazon-redshift-json-schema-inductionLinks
A tool to learn JSON schema from collection of documents and generate Create table statement for Redshift
☆21Updated 7 months ago
Alternatives and similar repositories for amazon-redshift-json-schema-induction
Users that are interested in amazon-redshift-json-schema-induction are comparing it to the libraries listed below
Sorting:
- Design pattern for orchestrating an incremental data ingestion pipeline using AWS Step Functions from an on premise location into an Amaz…☆28Updated 5 years ago
- Data Profiler for AWS Glue Data Catalog application as described in the AWS Big Data Blog post "Build an automatic data profiling and rep…☆20Updated 5 years ago
- ☆18Updated 3 years ago
- This github repo contains Aurora MySQL and PostgreSQL Labs, Aurora Serverless Lab and Heterogeneous database migration with DMS Labs.☆31Updated 2 years ago
- ☆73Updated last year
- Streaming ETL job cases in AWS Glue to integrate Iceberg and creating an in-place updatable data lake on Amazon S3☆23Updated 8 months ago
- The open source version of the Amazon Redshift Cluster Management Guide.☆48Updated last year
- A CLI to manage and monitor permissions in AWS Lake Formation☆26Updated 2 years ago
- AWS Quick Start Team☆19Updated 8 months ago
- Glue Python Shell Job that adds AWS Organizations account tags to Cost and Usage Reports. You can submit feedback & requests for changes…☆16Updated 4 years ago
- Data Pipeline for CDC data from MySQL DB to Amazon OpenSearch Service through Amazon Kinesis using Amazon Data Migration Service(DMS).☆32Updated 4 months ago
- Operational Data Processing Framework developed using AWS Glue and Apache Hudi. This framework is suitable for Data Lake and Modern Data …☆22Updated last year
- Web UI for Amazon Athena☆56Updated 2 years ago
- ☆26Updated 9 months ago
- ☆27Updated 4 years ago
- Stream CDC into an Amazon S3 data lake in Apache Iceberg table format with AWS Glue Streaming and DMS☆32Updated 3 months ago
- Reference Architectures for Datalakes on AWS☆79Updated 5 years ago
- ☆32Updated last year
- ☆22Updated 4 years ago
- A Data Platform built for AWS, powered by Kubernetes.☆148Updated last year
- ☆12Updated last year
- ☆53Updated last year
- Sample code for the AWS Big Data Blog Post Building a scalable streaming data processor with Amazon Kinesis Data Streams on AWS Fargate☆37Updated last month
- The AWS Innovation Sandbox solution provisions isolated, self-contained, environments to help developers, security professionals, and inf…☆29Updated last year
- ☆25Updated last year
- A tool to automate analytic platform evaluations. Barometer helps customers to get data points needed for service selection/service confi…☆19Updated last year
- Cloudformation and SQL scripts used to replicate a POC environment from the "Data Lake to Data Warehouse: Enhancing Customer 360 with Ama…☆31Updated 5 years ago
- Framework to enforce long term health of your AWS Data Lake by providing visibility into operational, data quality and business metrics.☆30Updated 3 years ago
- Set of sample CloudFormation Documents and Systems Manager documents that show how the two service can be used together in deployments.☆35Updated 3 years ago
- ☆47Updated 4 years ago