aws-samples / spark-streaming-sql-s3-connectorLinks
An Apache Spark Structured Streaming S3 connector for reading S3 files using Amazon S3 event notifications to AWS SQS
☆15Updated last year
Alternatives and similar repositories for spark-streaming-sql-s3-connector
Users that are interested in spark-streaming-sql-s3-connector are comparing it to the libraries listed below
Sorting:
- The AWS Glue Data Catalog is a fully managed, Apache Hive Metastore compatible, metadata repository. Customers can use the Data Catalog a…☆227Updated 10 months ago
- A best practices guide for using AWS EMR. The guide will cover best practices on the topics of cost, performance, security, operational e…☆110Updated last week
- Example code for running Spark and Hive jobs on EMR Serverless.☆168Updated last year
- ☆25Updated last year
- Application to securely map users on a multi tenant Amazon EMR cluster to different IAM Roles and then assume the mapped Role.☆24Updated 2 years ago
- Spark ETL example processing New York taxi rides public dataset on EKS☆44Updated 3 years ago
- Performant Redshift data source for Apache Spark☆141Updated 3 weeks ago
- ☆75Updated 2 years ago
- Automated data quality suggestions and analysis with Deequ on AWS Glue☆90Updated 3 years ago
- ☆13Updated last year
- Docker image for running Spark 3 on Kubernetes on AWS☆26Updated 4 years ago
- ☆42Updated last month
- Spark runtime on AWS Lambda☆113Updated 5 months ago
- Best practices and recommendations for getting started with Amazon EMR on EKS.☆67Updated 2 weeks ago
- Spark Structured Streaming Kinesis Data Streams connector supports both GetRecords and SubscribeToShard (Enhanced Fan-Out, EFO)☆39Updated last week
- Example applications in Java, Python and SQL for Kinesis Data Analytics, demonstrating sources, sinks, and operators.☆147Updated last year
- Project to concentrate files and settings for AWS EMR monitoring. Source: https://aws.amazon.com/blogs/big-data/monitor-and-optimize-anal…☆15Updated last year
- AWS Glue Libraries are additions and enhancements to Spark for ETL operations.☆696Updated 3 weeks ago
- Implementations of open source Apache Hadoop/Hive interfaces which allow for ingesting data from Amazon DynamoDB☆228Updated 3 weeks ago
- ☆25Updated 2 years ago
- This repository contains the dbt-glue adapter☆141Updated last month
- Amazon Redshift Advanced Monitoring☆273Updated 3 months ago
- ☆20Updated 2 years ago
- ☆23Updated 11 months ago
- ☆25Updated 2 years ago
- The Amazon S3 Tables catalog is a client library that bridges control plane operations provided by S3 Tables to engines like Apache Spark…☆149Updated 2 weeks ago
- Helm Chart for deploying Spark history server in Amazon EKS for S3 Spark Event Logs☆28Updated this week
- Collection of code examples for Amazon Managed Service for Apache Flink☆87Updated 3 weeks ago
- MCP Server for Apache Spark History Server. The bridge between Agentic AI and Apache Spark.☆128Updated last week
- This repository has moved into https://github.com/dbt-labs/dbt-adapters☆250Updated last year