tokern / lakecli
A CLI to manage and monitor permissions in AWS Lake Formation
☆27Updated 2 years ago
Alternatives and similar repositories for lakecli:
Users that are interested in lakecli are comparing it to the libraries listed below
- Automated data quality suggestions and analysis with Deequ on AWS Glue☆84Updated 2 years ago
- AWS Quick Start Team☆18Updated 6 months ago
- ☆72Updated 10 months ago
- Reference Architectures for Datalakes on AWS☆79Updated 4 years ago
- Streaming ETL with Apache Flink and Amazon Kinesis Data Analytics☆64Updated last year
- ☆30Updated last year
- The open source version of the Amazon Redshift Cluster Management Guide.☆48Updated last year
- ☆22Updated 4 years ago
- Design pattern for orchestrating an incremental data ingestion pipeline using AWS Step Functions from an on premise location into an Amaz…☆28Updated 5 years ago
- Data Profiler for AWS Glue Data Catalog application as described in the AWS Big Data Blog post "Build an automatic data profiling and rep…☆19Updated 4 years ago
- A tool to learn JSON schema from collection of documents and generate Create table statement for Redshift☆20Updated 5 months ago
- ☆19Updated 6 months ago
- Learn how to build an end-to-end streaming architecture to ingest, analyze, and visualize streaming data in near real-time☆34Updated 2 years ago
- Web UI for Amazon Athena☆56Updated 2 years ago
- An open-source framework that simplifies implementation of data solutions.☆131Updated this week
- Streaming ETL job cases in AWS Glue to integrate Iceberg and creating an in-place updatable data lake on Amazon S3☆23Updated 7 months ago
- ☆23Updated 6 months ago
- Amazon S3 Find and Forget is a solution to handle data erasure requests from data lakes stored on Amazon S3, for example, pursuant to the…☆243Updated last month
- 🐋 Docker image for AWS Glue Spark/Python☆23Updated last year
- dbt (data build tool) projects targeting AWS analytics services (redshift, glue, emr, athena) and open table formats☆29Updated 2 years ago
- Operational Data Processing Framework developed using AWS Glue and Apache Hudi. This framework is suitable for Data Lake and Modern Data …☆21Updated last year
- ☆88Updated last year
- ☆27Updated 4 years ago
- A Java application that replays events that are stored in objects in Amazon S3 into a Amazon Kinesis stream as if they occurred in real t…☆51Updated 3 months ago
- ☆53Updated last year
- A VS Code Extension to make it easier to manage and develop Spark jobs on EMR☆34Updated last month
- Build DataOps platform with Apache Airflow and dbt on AWS☆55Updated 3 years ago
- Framework to enforce long term health of your AWS Data Lake by providing visibility into operational, data quality and business metrics.☆17Updated 3 years ago
- Sample Apache Beam pipeline that can be deployed to Amazon Managed Service for Apache Flink. It reads taxi events from a Kinesis data str…☆47Updated last year
- Sample code for the AWS Big Data Blog Post Building a scalable streaming data processor with Amazon Kinesis Data Streams on AWS Fargate☆37Updated last week