segmentio / terraform-segment-data-lakes
Terraform modules which create AWS resources for a Segment Data Lake.
β37Updated 4 months ago
Alternatives and similar repositories for terraform-segment-data-lakes:
Users that are interested in terraform-segment-data-lakes are comparing it to the libraries listed below
- Provider for AWS Redshift entities, eg Users, Groups, Permissions, Schemas, Databasesβ47Updated 3 years ago
- Terraform module to create AWS Redshift resources πΊπ¦β87Updated 3 weeks ago
- A tool to learn JSON schema from collection of documents and generate Create table statement for Redshiftβ20Updated 6 months ago
- A CLI to manage and monitor permissions in AWS Lake Formationβ26Updated 2 years ago
- β73Updated 10 months ago
- Command-line app for tracking Snowplow events. Add analytics to your shell scripts and terminal sessionsβ9Updated last year
- Amazon S3 Find and Forget is a solution to handle data erasure requests from data lakes stored on Amazon S3, for example, pursuant to theβ¦β243Updated last month
- pg2kinesis uses logical decoding in Postgres 9.4 or later to capture a consistent, continuous stream of events from the database and publβ¦β59Updated 2 years ago
- Presto-like CLI tool for AWS Athenaβ84Updated 2 years ago
- Utility to create AWS Step Function activities out of command line programsβ16Updated this week
- Run templatable playbooks of SQL scripts in series and parallel on Redshift, PostgreSQL, BigQuery and Snowflakeβ81Updated last year
- Web UI for Amazon Athenaβ56Updated 2 years ago
- A fully-featured AWS Athena database driver (+ athenareader https://github.com/uber/athenadriver/tree/master/athenareader)β159Updated 3 months ago
- Shoprunner Terraform provider - Open Source initiativeβ37Updated 5 years ago
- A CLI and library to run Singer Taps and Targetsβ34Updated 3 years ago
- Cloudformation templates for deploying Airflow in ECSβ40Updated 6 years ago
- Sample code for the AWS Big Data Blog Post Building a scalable streaming data processor with Amazon Kinesis Data Streams on AWS Fargateβ37Updated last week
- CLENCLI enables you to quickly and predictably create, change, and improve your cloud projects. It is an open source tool that simplifiesβ¦β59Updated 2 years ago
- Bring AWS SSO-based credentials to the AWS SDKs until they have proper supportβ47Updated 4 years ago
- β22Updated 4 years ago
- Glue scripts for converting AWS Service Logs for use in Athenaβ141Updated last year
- Mirrors a Kinesis stream to Amazon S3 using the KCLβ42Updated 7 months ago
- Friendly CLI for Amazon Kinesis Data Streamsβ57Updated 4 years ago
- Reference Architectures for Datalakes on AWSβ79Updated 4 years ago
- π Reference implementation for granting cross-account AWS Glue Data Catalog access from Amazon Athenaβ30Updated 2 years ago
- kinesis-kafka-connector is connector based on Kafka Connect to publish messages to Amazon Kinesis streams or Amazon Kinesis Firehose.β155Updated last year
- Bring your own data Labs: Build a serverless data pipeline based on your own dataβ44Updated last year
- Data Pipeline for CDC data from MySQL DB to Amazon OpenSearch Service through Amazon Kinesis using Amazon Data Migration Service(DMS).β32Updated 3 months ago
- Library and worker to handle transfer of data in s3 into redshift. Includes table creation and manipulation, as well as time-based insertβ¦β61Updated 2 years ago
- Terraform module to deploy an Apache Airflow cluster on AWS, backed by RDS PostgreSQL for metadata, S3 for logs and SQS as message brokerβ¦β84Updated 2 years ago