hakanilter / aws-emr-examples
Some AWS EMR examples
☆16Updated 7 years ago
Alternatives and similar repositories for aws-emr-examples:
Users that are interested in aws-emr-examples are comparing it to the libraries listed below
- Optimizing downstream data processing with Amazon Kinesis Data Firehose and Amazon EMR running Apache Spark☆13Updated 2 years ago
- Examples of all Machine Learning Algorithm in Apache Spark☆15Updated 7 years ago
- Hive Storage Handler for Kinesis.☆11Updated 9 years ago
- Flink stream filtering examples☆19Updated 8 years ago
- Using the Parquet file format (with Avro) to process data with Apache Flink☆14Updated 9 years ago
- CloudFormation templates and scripts demonstrating how to build a promotion recommendation system using Kinesis and SageMaker.☆28Updated 7 years ago
- ☆10Updated 8 years ago
- An example PySpark project with pytest☆16Updated 7 years ago
- Simple machine learning in Python/Tensorflow with model saving☆14Updated 7 years ago
- Example code for building your own MemSQL Streamliner Pipelines☆23Updated 8 years ago
- Spark stream from kafka(json) to s3(parquet)☆15Updated 6 years ago
- The open source version of the Amazon EMR Management Guide. You can submit feedback & requests for changes by submitting issues in this r…☆62Updated last year
- SQL Windowing Functions for Hadoop☆65Updated 2 years ago
- Examples for Fast Data Processing with Spark☆59Updated 11 years ago
- A collection of examples to help show different ways to managing state in Apache Flink☆27Updated 6 years ago
- Starter project for building MemSQL Streamliner Pipelines☆32Updated 8 years ago
- Flink Examples☆39Updated 9 years ago
- ☆50Updated 4 years ago
- Paper: A Zero-rename committer for object stores☆20Updated 3 years ago
- Unix tee, but for Kinesis streams☆12Updated 3 years ago
- Real-time anomaly detection using Kafka, KSQL User Defined Function and a pre-trained model☆30Updated last year
- Spark with Scala example projects☆34Updated 6 years ago
- ☆24Updated 9 years ago
- Use cases built on SnappyData. Use cases contained here: 1. Ad Analytics 2. Streaming data ingestion from RabbitMQ.☆32Updated 2 years ago
- ☆38Updated 7 years ago
- ☆7Updated 9 years ago
- ☆21Updated 9 years ago
- Scalable CDC Pattern Implemented using PySpark☆18Updated 5 years ago
- Spark UDFs to deserialize Avro messages with schemas stored in Schema Registry.☆19Updated 7 years ago
- Spark package to "plug" holes in data using SQL based rules ⚡️ 🔌☆28Updated 4 years ago