roncemer / spark-sql-kinesis
Kinesis Connector for Spark Structured Streaming
☆11Updated last year
Alternatives and similar repositories for spark-sql-kinesis:
Users that are interested in spark-sql-kinesis are comparing it to the libraries listed below
- Spark ETL example processing New York taxi rides public dataset on EKS☆44Updated 2 years ago
- Example code for running Spark and Hive jobs on EMR Serverless.☆157Updated last week
- Amazon EMR Notebook to show how to read from and write to Delta tables with Amazon EMR☆17Updated 5 months ago
- The AWS Glue Data Catalog is a fully managed, Apache Hive Metastore compatible, metadata repository. Customers can use the Data Catalog a…☆210Updated 8 months ago
- Spark Structured Streaming Kinesis Data Streams connector supports both GetRecords and SubscribeToShard (Enhanced Fan-Out, EFO)☆28Updated last month
- This repository contains the dbt-glue adapter☆107Updated last week
- A best practices guide for using AWS EMR. The guide will cover best practices on the topics of cost, performance, security, operational e…☆102Updated last month
- A Python Library to support running data quality rules while the spark job is running⚡☆167Updated last week
- 🐋 Docker image for AWS Glue Spark/Python☆23Updated last year
- Kinesis Connector for Structured Streaming☆136Updated 6 months ago
- Performant Redshift data source for Apache Spark☆137Updated this week
- Spark runtime on AWS Lambda☆104Updated 4 months ago
- A command-line interface for packaging, deploying, and running your EMR Serverless Spark jobs☆41Updated 8 months ago
- Pythonic Programming Framework to orchestrate jobs in Databricks Workflow☆192Updated last month
- PySpark data-pipeline testing and CICD☆28Updated 4 years ago
- Automated data quality suggestions and analysis with Deequ on AWS Glue☆84Updated 2 years ago
- Apache flink