aws-samples/data-lake-as-code

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/aws-samples/data-lake-as-code)

aws-samples / data-lake-as-code

Data Lake as Code, featuring ChEMBL and OpenTargets

☆173

Alternatives and similar repositories for data-lake-as-code

Users that are interested in data-lake-as-code are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

aws-samples / aws-lakeformation-datasharing-workflow
View on GitHub
☆15Feb 12, 2026Updated 5 months ago
aws-samples / aws-cdk-pipelines-datalake-infrastructure
View on GitHub
This solution helps you deploy Data Lake Infrastructure on AWS using CDK Pipelines.
☆101Aug 12, 2022Updated 3 years ago
paulu-aws / biotechblueprint
View on GitHub
AWS Biotech Blueprint Multi-Account
☆12Jul 22, 2020Updated 6 years ago
aws-samples / aws-cdk-pipelines-datalake-etl
View on GitHub
This solution helps you deploy ETL jobs on data lake using CDK Pipelines.
☆69Aug 9, 2022Updated 3 years ago
aws-solutions-library-samples / data-lakes-on-aws
View on GitHub
Enterprise-grade, production-hardened, serverless data lake on AWS
☆482Oct 1, 2025Updated 9 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
aws-solutions / aws-data-lake-solution
View on GitHub
A deployable reference implementation intended to address pain points around conceptualizing data lake architectures that automatically c…
☆399Jun 3, 2024Updated 2 years ago
aws-samples / aws-building-data-lake-reinvent-session-stg206
View on GitHub
Collection of Cloud Formation Templates, Lambda Scripts and sample code required to provision an AWS Data Lake for a ReInvent Lab Exercis…
☆26Apr 9, 2019Updated 7 years ago
awslabs / aws-ddk
View on GitHub
An open source development framework to help you build data workflows and modern data architecture on AWS.
☆271Feb 9, 2026Updated 5 months ago
aws-samples / aws-analytics-reference-architecture
View on GitHub
☆157Feb 29, 2024Updated 2 years ago
aws-samples / aws-lakeformation-ml-transforms
View on GitHub
Workshop - Using AWS Lake Formation ML Transforms to cleanse the data in a data lake
☆14Dec 22, 2019Updated 6 years ago
Aiven-Labs / demo-opensearch-python
View on GitHub
This repository contains code example in how to write search queries with OpenSearch Python client
☆10Sep 20, 2023Updated 2 years ago
aws-samples / data-purging-aws-data-lake
View on GitHub
☆22Jul 14, 2020Updated 6 years ago
aws-samples / biotech-blueprint-multi-account
View on GitHub
The AWS Biotech Blueprint Multi Account is a landing zone for life sciences startups looking to build well architected research environme…
☆36Jan 19, 2022Updated 4 years ago
aws-samples / amazon-omics-end-to-end-genomics
View on GitHub
☆19Mar 29, 2023Updated 3 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
awslabs / amazon-s3-find-and-forget
View on GitHub
Amazon S3 Find and Forget is a solution to handle data erasure requests from data lakes stored on Amazon S3, for example, pursuant to the…
☆247Jul 15, 2026Updated last week
aws-samples / aws-codeguru-profiler-demo-application
View on GitHub
Example application demonstrating the features of Amazon CodeGuru Profiler
☆24Dec 19, 2025Updated 7 months ago
tokern / lakecli
View on GitHub
A CLI to manage and monitor permissions in AWS Lake Formation
☆25Feb 8, 2023Updated 3 years ago
Study-Tracker / Study-Tracker
View on GitHub
Study management software for research organizations.
☆29Jan 22, 2026Updated 6 months ago
aws-samples / faropt
View on GitHub
☆16Nov 12, 2024Updated last year
aws-samples / serverless-data-analytics
View on GitHub
CloudFormation templates and scripts to setup the AWS services for the workshop, Athena & Redshift Spectrum queries
☆177Apr 28, 2020Updated 6 years ago
awslabs / aws-healthomics-tools
View on GitHub
☆43Updated this week
aws-samples / aws-genomics-workflows
View on GitHub
Genomics Workflows on AWS
☆148Jul 31, 2023Updated 2 years ago
aws-samples / eventbridge-salesforce-integration
View on GitHub
☆12Aug 12, 2022Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
aws-samples / aws-cross-account-cicd-pipeline
View on GitHub
Example of how to use CDK to create a CodePipeline CI/CD pipeline, and how to configure it to deploy resources on different AWS Accounts.
☆111Mar 25, 2025Updated last year
databricks / genomics-pipelines
View on GitHub
secondary analysis pipelines parallelized with apache spark
☆18Mar 2, 2022Updated 4 years ago
aws-samples / bring-your-own-data-labs
View on GitHub
Bring your own data Labs: Build a serverless data pipeline based on your own data
☆43May 22, 2023Updated 3 years ago
aws-samples / open-on-demand-on-aws
View on GitHub
☆24Nov 19, 2025Updated 8 months ago
r7kamura / api-gateway-lambda-example
View on GitHub
An example application to integrate Amazon API Gateway and Amazon Lambda.
☆12Aug 5, 2015Updated 10 years ago
aws-solutions / mlops-workload-orchestrator
View on GitHub
The MLOps Workload Orchestrator solution helps you streamline and enforce architecture best practices for machine learning (ML) model pro…
☆156Jun 10, 2025Updated last year
aws-samples / amazon-sagemaker-mlops-byoc-using-codepipeline-aws-cdk
View on GitHub
Sample solution to build a deployment pipeline for Amazon SageMaker.
☆14Jul 18, 2022Updated 4 years ago
aws-samples / amazon-sagemaker-cdk-examples
View on GitHub
amazon-sagemaker-cdk-examples uses AWS CDK to simplify common architectures in machine leaning operations using Sagemaker and other AWS s…
☆69Mar 28, 2024Updated 2 years ago
rickhw / rickhw.github.io
View on GitHub
Complete Think
☆12Updated this week
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
aws-samples / datamesh-ui
View on GitHub
☆32Feb 29, 2024Updated 2 years ago
aws-samples / amazon-honeycode-quicksight-integration-sample
View on GitHub
Extract data from your Amazon Honeycode apps using Honeycode API and AWS Lambda functions, and write it to Amazon S3. Data stored in S3 c…
☆13Dec 5, 2023Updated 2 years ago
aws-samples / amazon-quicksight-sdk-proserve
View on GitHub
☆73Nov 10, 2023Updated 2 years ago
aws-samples / aws-ml-data-lake-workshop
View on GitHub
As customers move from building data lakes and analytics on AWS to building machine learning solutions, one of their biggest challenges i…
☆63Nov 28, 2018Updated 7 years ago
ontodev / rdftab.rs
View on GitHub
RDF Tables in Rust
☆17Aug 26, 2022Updated 3 years ago
aws-solutions / performance-dashboard-on-aws
View on GitHub
A simple cost-effective web application to build and publish dashboards.
☆175Nov 22, 2024Updated last year
IBPA / DeepPep
View on GitHub
Deep proteome inference from peptide profiles
☆13Jul 16, 2020Updated 6 years ago