segmentio / terraform-segment-data-lakes
Terraform modules which create AWS resources for a Segment Data Lake.
☆37Updated last month
Alternatives and similar repositories for terraform-segment-data-lakes:
Users that are interested in terraform-segment-data-lakes are comparing it to the libraries listed below
- A CLI to manage and monitor permissions in AWS Lake Formation☆26Updated last year
- Web UI for Amazon Athena☆55Updated 2 years ago
- A tool to learn JSON schema from collection of documents and generate Create table statement for Redshift☆19Updated 3 months ago
- Amazon S3 Find and Forget is a solution to handle data erasure requests from data lakes stored on Amazon S3, for example, pursuant to the…☆240Updated this week
- Deploy Presto on the cloud easily, using Terraform and Packer☆44Updated last year
- A fully-featured AWS Athena database driver (+ athenareader https://github.com/uber/athenadriver/tree/master/athenareader)☆156Updated 4 months ago
- Continuously synchronize directories from remote object store to local filesystem☆101Updated last week
- A CLI and library to run Singer Taps and Targets☆34Updated 2 years ago
- Terraform module to create AWS Redshift resources 🇺🇦☆82Updated 3 months ago
- The open source version of the Amazon Redshift Cluster Management Guide.☆48Updated last year
- AWS Quick Start Team☆60Updated 3 months ago
- Faker for Snowflake!☆33Updated 2 years ago
- Cloudformation templates for deploying Airflow in ECS☆40Updated 6 years ago
- Provider for AWS Redshift entities, eg Users, Groups, Permissions, Schemas, Databases☆47Updated 2 years ago
- Opinionated serverless event analytics pipeline☆43Updated last year
- ⚠️ MAINTENANCE-ONLY MODE: Snowplow maintained SQL data models for working with Snowplow web and mobile behavioral data.☆41Updated last week
- ☆67Updated 7 months ago
- CLENCLI enables you to quickly and predictably create, change, and improve your cloud projects. It is an open source tool that simplifies…☆59Updated 2 years ago
- A packaged Data Lake solution, that builds a highly functional Data Lake, with a data catalog queryable via Elasticsearch☆73Updated 4 years ago
- Terraform Provider for Fivetran☆41Updated this week
- pg2kinesis uses logical decoding in Postgres 9.4 or later to capture a consistent, continuous stream of events from the database and publ…☆59Updated 2 years ago
- Terraform module to deploy an Apache Airflow cluster on AWS, backed by RDS PostgreSQL for metadata, S3 for logs and SQS as message broker…☆84Updated 2 years ago
- Run templatable playbooks of SQL scripts in series and parallel on Redshift, PostgreSQL, BigQuery and Snowflake☆81Updated last year
- Reference Architectures for Datalakes on AWS☆79Updated 4 years ago
- Spark ETL example processing New York taxi rides public dataset on EKS☆44Updated 2 years ago
- Terraform Provider for Airbyte using the new Terraform Plugin Framework☆19Updated last year
- kinesis-kafka-connector is connector based on Kafka Connect to publish messages to Amazon Kinesis streams or Amazon Kinesis Firehose.☆154Updated last year