awslabs / amazon-redshift-json-schema-induction
A tool to learn JSON schema from collection of documents and generate Create table statement for Redshift
☆19Updated 4 months ago
Alternatives and similar repositories for amazon-redshift-json-schema-induction:
Users that are interested in amazon-redshift-json-schema-induction are comparing it to the libraries listed below
- ☆18Updated 3 years ago
- ☆22Updated 4 years ago
- ☆66Updated 8 months ago
- Glue Python Shell Job that adds AWS Organizations account tags to Cost and Usage Reports. You can submit feedback & requests for changes…☆16Updated 3 years ago
- Framework to enforce long term health of your AWS Data Lake by providing visibility into operational, data quality and business metrics.☆17Updated 3 years ago
- ☆31Updated 11 months ago
- ☆53Updated last year
- Design pattern for orchestrating an incremental data ingestion pipeline using AWS Step Functions from an on premise location into an Amaz…☆28Updated 5 years ago
- Web UI for Amazon Athena☆56Updated 2 years ago
- A CLI to manage and monitor permissions in AWS Lake Formation☆26Updated 2 years ago
- The Automated Data Analytics on AWS solution provides an end-to-end data platform for ingesting, transforming, managing and querying data…☆89Updated 4 months ago
- A Data Platform built for AWS, powered by Kubernetes.☆127Updated last year
- Data Profiler for AWS Glue Data Catalog application as described in the AWS Big Data Blog post "Build an automatic data profiling and rep…☆19Updated 4 years ago
- This github repo contains Aurora MySQL and PostgreSQL Labs, Aurora Serverless Lab and Heterogeneous database migration with DMS Labs.☆31Updated last year
- Data Pipeline for CDC data from MySQL DB to Amazon OpenSearch Service through Amazon Kinesis using Amazon Data Migration Service(DMS).☆30Updated 3 weeks ago
- Streaming ETL job cases in AWS Glue to integrate Iceberg and creating an in-place updatable data lake on Amazon S3☆21Updated 5 months ago
- Reference Architectures for Datalakes on AWS☆79Updated 4 years ago
- AWS Quick Start Team☆18Updated 4 months ago
- The open source version of the Amazon Redshift Cluster Management Guide.☆48Updated last year
- Operational Data Processing Framework developed using AWS Glue and Apache Hudi. This framework is suitable for Data Lake and Modern Data …☆21Updated last year
- ☆19Updated 4 months ago
- ☆26Updated 4 years ago
- ☆22Updated 4 months ago
- ☆48Updated 4 years ago
- Stream CDC into an Amazon S3 data lake in Apache Iceberg table format with AWS Glue Streaming and DMS☆27Updated 3 months ago
- ☆26Updated 6 months ago
- Glue scripts for converting AWS Service Logs for use in Athena☆142Updated last year
- Sample code for the AWS Big Data Blog Post Building a scalable streaming data processor with Amazon Kinesis Data Streams on AWS Fargate☆36Updated 2 weeks ago
- A tool to automate analytic platform evaluations. Barometer helps customers to get data points needed for service selection/service confi…☆18Updated 8 months ago
- Source code for the post, 'Getting Started with Data Analysis on AWS, using S3, Glue, Amazon Athena, and QuickSight'☆28Updated 4 years ago