fabiogouw / spark-aws-messagingLinks
A custom sink provider for Apache Spark that sends the content of a dataframe to an AWS SQS
☆23Updated last year
Alternatives and similar repositories for spark-aws-messaging
Users that are interested in spark-aws-messaging are comparing it to the libraries listed below
Sorting:
- Example code for running Spark and Hive jobs on EMR Serverless.☆168Updated 11 months ago
- Resources for video demonstrations and blog posts related to DataOps on AWS☆182Updated 3 years ago
- ☆269Updated last year
- Spark runtime on AWS Lambda☆113Updated 3 months ago
- Apache flink☆74Updated 5 months ago
- ☆107Updated 11 months ago
- The AWS Glue Data Catalog is a fully managed, Apache Hive Metastore compatible, metadata repository. Customers can use the Data Catalog a…☆226Updated 8 months ago
- Soda Spark is a PySpark library that helps you with testing your data in Spark Dataframes☆63Updated 3 years ago
- ☆63Updated last year
- Jupyter notebooks and AWS CloudFormation template to show how Hudi, Iceberg, and Delta Lake work☆47Updated 3 years ago
- Playground for Lakehouse (Iceberg, Hudi, Spark, Flink, Trino, DBT, Airflow, Kafka, Debezium CDC)☆61Updated 2 years ago
- This repository has moved into https://github.com/dbt-labs/dbt-adapters☆251Updated 10 months ago
- 📚 Tech blogs & talks by companies that run Apache Flink in production☆185Updated last week
- Automated data quality suggestions and analysis with Deequ on AWS Glue☆90Updated 2 years ago
- A best practices guide for using AWS EMR. The guide will cover best practices on the topics of cost, performance, security, operational e…☆109Updated 2 months ago
- Java SDK for the Snowflake Ingest Service -☆79Updated 3 weeks ago
- BigQuery data source for Apache Spark: Read data from BigQuery into DataFrames, write DataFrames into BigQuery tables.☆411Updated last week
- A command-line interface for packaging, deploying, and running your EMR Serverless Spark jobs☆45Updated last year
- Delta Lake examples☆235Updated last year
- This repository contains the dbt-glue adapter☆138Updated this week
- A VS Code Extension to make it easier to manage and develop Spark jobs on EMR☆39Updated 9 months ago
- Glue VSCode devcontainer setup☆14Updated 2 years ago
- A repository of sample code to accompany our blog post on Airflow and dbt.☆181Updated 2 years ago
- The Amazon S3 Tables catalog is a client library that bridges control plane operations provided by S3 Tables to engines like Apache Spark…☆143Updated 4 months ago
- Python API for Deequ☆806Updated 8 months ago
- For a series of posts on Amazon MSK, Amazon EKS, and Amazon EMR☆67Updated 3 years ago
- ☆81Updated 7 months ago
- Repo for everything open table formats (Iceberg, Hudi, Delta Lake) and the overall Lakehouse architecture☆125Updated last month
- Airflow Deployment on AWS ECS Fargate Using Cloudformation☆206Updated 3 years ago
- Creates a Simulation of Fake Web Events☆85Updated 3 years ago