iconara / athena-guide-content
Content for the Athena Guide (https://athena.guide)
☆10Updated 5 months ago
Alternatives and similar repositories for athena-guide-content:
Users that are interested in athena-guide-content are comparing it to the libraries listed below
- DataHub on AWS demonstration resources☆10Updated 2 years ago
- A tool to learn JSON schema from collection of documents and generate Create table statement for Redshift☆20Updated 6 months ago
- A Python sampling profiler for AWS Lambda functions (and not only).☆12Updated 3 years ago
- Sample code supporting the `Generating REST APIs from data classes in Python` blog post☆11Updated 11 months ago
- Amazon EMR on EKS Custom Image CLI☆31Updated 6 months ago
- Data Profiler for AWS Glue Data Catalog application as described in the AWS Big Data Blog post "Build an automatic data profiling and rep…☆19Updated 4 years ago
- A template for an AWS Lambda function that triggers Prefect Flow Runs☆20Updated 3 years ago
- Python utility for caching in Lambda Functions☆38Updated 4 years ago
- Automatically loads new partitions in AWS Athena☆18Updated 4 years ago
- Parquet file management in S3 for Athena / Spectrum / Presto partitioning☆22Updated 2 months ago
- ☆30Updated last year
- The sane way of building a data layer in Airflow☆24Updated 5 years ago
- Python implementation of Age-Partitioned Bloom Filter with S3 periodic backup support.☆11Updated 3 months ago
- ☆11Updated 8 months ago
- Data Pipeline for CDC data from MySQL DB to Amazon OpenSearch Service through Amazon Kinesis using Amazon Data Migration Service(DMS).☆32Updated 3 months ago
- This repository has a collection of utilities for Glue Crawlers. These utilities come in the form of AWS CloudFormation templates or AWS …☆19Updated 3 years ago
- Learn how to build an end-to-end streaming architecture to ingest, analyze, and visualize streaming data in near real-time☆34Updated 2 years ago
- Using the Parquet file format with Python☆15Updated last year
- A CLI to manage and monitor permissions in AWS Lake Formation☆26Updated 2 years ago
- In this pattern, data records are ingested and then modified with simple transformations such as field level substitutions and data enric…☆12Updated 6 years ago
- Code to be contributed to the Apache Airflow (incubating) project for ETL workflow management for integrating with the Snowflake Data War…☆25Updated 7 years ago
- Spark ETL example processing New York taxi rides public dataset on EKS☆44Updated 2 years ago
- Streaming ETL job cases in AWS Glue to integrate Iceberg and creating an in-place updatable data lake on Amazon S3☆23Updated 7 months ago
- 🐋 Docker image for AWS Glue Spark/Python☆23Updated last year
- ☆22Updated 4 years ago
- Chatlytics is a data query and visualization platform for chat!☆13Updated 8 years ago
- A cookiecutter template to create AWS Lambda function☆23Updated 6 years ago
- An extension to the Amazon SQS client that enables sending and receiving messages up to 2GB via Amazon S3.☆44Updated 7 months ago
- The open source version of the Amazon Redshift Cluster Management Guide.☆48Updated last year
- Sample Apache Beam pipeline that can be deployed to Amazon Managed Service for Apache Flink. It reads taxi events from a Kinesis data str…☆47Updated last year