awslabs / amazon-redshift-json-schema-induction
A tool to learn JSON schema from collection of documents and generate Create table statement for Redshift
☆19Updated 5 months ago
Alternatives and similar repositories for amazon-redshift-json-schema-induction:
Users that are interested in amazon-redshift-json-schema-induction are comparing it to the libraries listed below
- Data Profiler for AWS Glue Data Catalog application as described in the AWS Big Data Blog post "Build an automatic data profiling and rep…☆19Updated 4 years ago
- Glue Python Shell Job that adds AWS Organizations account tags to Cost and Usage Reports. You can submit feedback & requests for changes…☆16Updated 4 years ago
- ☆69Updated 9 months ago
- Design pattern for orchestrating an incremental data ingestion pipeline using AWS Step Functions from an on premise location into an Amaz…☆28Updated 5 years ago
- ☆53Updated last year
- Streaming ETL job cases in AWS Glue to integrate Iceberg and creating an in-place updatable data lake on Amazon S3☆22Updated 6 months ago
- This github repo contains Aurora MySQL and PostgreSQL Labs, Aurora Serverless Lab and Heterogeneous database migration with DMS Labs.☆31Updated 2 years ago
- Stream CDC into an Amazon S3 data lake in Apache Iceberg table format with AWS Glue Streaming and DMS☆30Updated last month
- A CLI to manage and monitor permissions in AWS Lake Formation☆26Updated 2 years ago
- ☆31Updated last year
- Framework to enforce long term health of your AWS Data Lake by providing visibility into operational, data quality and business metrics.☆17Updated 3 years ago
- ☆26Updated 7 months ago
- A tool to automate analytic platform evaluations. Barometer helps customers to get data points needed for service selection/service confi…☆18Updated 9 months ago
- A collection of examples built with AWS DataOps Development Kit (DDK)☆41Updated last month
- Reference Architectures for Datalakes on AWS☆79Updated 4 years ago
- The AWS Innovation Sandbox solution provisions isolated, self-contained, environments to help developers, security professionals, and inf…☆29Updated 9 months ago
- A Data Platform built for AWS, powered by Kubernetes.☆127Updated last year
- The open source version of the Amazon EMR Management Guide. You can submit feedback & requests for changes by submitting issues in this r…☆62Updated last year
- Test IAM Policies in Multi Account Structures in your CI/CD pipeline☆29Updated 3 years ago
- ☆18Updated 3 years ago
- ☆11Updated 5 months ago
- Automate AWS lambda functions migration across account using CloudFormation☆12Updated 4 years ago
- Web UI for Amazon Athena☆56Updated 2 years ago
- Tool for generating CodePipeline pipelines and related resources from a simple configuration.☆11Updated 4 years ago
- This solution helps you deploy Data Lake Infrastructure on AWS using CDK Pipelines.☆94Updated 2 years ago
- ☆27Updated 4 years ago
- boto3 response formatter☆26Updated 10 months ago
- Sample code for the AWS Big Data Blog Post Building a scalable streaming data processor with Amazon Kinesis Data Streams on AWS Fargate☆37Updated last month
- An open-source framework that simplifies implementation of data solutions.☆128Updated this week
- Using Amazon Comprehend, Amazon Elasticsearch with Kibana, Amazon S3, Amazon Cognito to search over large number of documents.☆24Updated 10 months ago