gwenshap / lambda_s3_kafkaLinks
AWS Lambda function to get events in Kafka topic when files are uploaded to S3
☆24Updated 6 years ago
Alternatives and similar repositories for lambda_s3_kafka
Users that are interested in lambda_s3_kafka are comparing it to the libraries listed below
Sorting:
- Cloudbox Labs blog code☆35Updated 6 years ago
- ☆10Updated 7 years ago
- Some AWS EMR examples☆16Updated 7 years ago
- Examples of using the DataStax Apache Kafka Connector.☆46Updated last year
- Spark stream from kafka(json) to s3(parquet)☆15Updated 6 years ago
- Quickstart PySpark with Anaconda on AWS/EMR using Terraform☆47Updated 5 months ago
- Real-world Spark pipelines examples☆83Updated 7 years ago
- Optimizing downstream data processing with Amazon Kinesis Data Firehose and Amazon EMR running Apache Spark☆14Updated 2 years ago
- Reference architecture for real-time stream processing with Apache Flink on Amazon EMR, Amazon Kinesis, and Amazon Elasticsearch Service.☆72Updated last year
- Sample files for Pinot tutorial☆18Updated last year
- Slowly Changing Dimension type 2 using Hive query language using exclusive join technique with ORC Hive tables, partitioned and clustered…☆16Updated 6 years ago
- How to manage Slowly Changing Dimensions with Apache Hive☆55Updated 5 years ago
- A Spark-based data comparison tool at scale which facilitates software development engineers to compare a plethora of pair combinations o…☆51Updated last week
- Kafka Examples repository.☆44Updated 6 years ago
- These are some code examples☆55Updated 5 years ago
- The open source version of the Amazon Redshift Cluster Management Guide.☆48Updated 2 years ago
- A tutorial on how to get started with Presto.☆56Updated 3 years ago
- An example PySpark project with pytest☆16Updated 7 years ago
- Scalable CDC Pattern Implemented using PySpark☆18Updated 5 years ago
- Sample processing code using Spark 2.1+ and Scala☆51Updated 4 years ago
- A collection of examples to help show different ways to managing state in Apache Flink☆27Updated 6 years ago
- Solving data streaming problems using joins, processor, punctuator and state store☆11Updated 4 years ago
- ☆58Updated 10 months ago
- Delta Lake Examples☆12Updated 5 years ago
- Real-time anomaly detection using Kafka, KSQL User Defined Function and a pre-trained model☆30Updated last year
- Composable filesystem hooks and operators for Apache Airflow.☆17Updated 3 years ago
- Friendly ML feature store☆45Updated 3 years ago
- A sample implementation of the Spark Datasource API☆24Updated 8 years ago
- Jupyter notebooks and AWS CloudFormation template to show how Hudi, Iceberg, and Delta Lake work☆47Updated 2 years ago
- Interactive Notebooks that support the book☆40Updated 4 years ago