hakanilter / aws-emr-examplesLinks
Some AWS EMR examples
☆16Updated 7 years ago
Alternatives and similar repositories for aws-emr-examples
Users that are interested in aws-emr-examples are comparing it to the libraries listed below
Sorting:
- Optimizing downstream data processing with Amazon Kinesis Data Firehose and Amazon EMR running Apache Spark☆14Updated 2 years ago
- Hive Storage Handler for Kinesis.☆11Updated 10 years ago
- Examples of all Machine Learning Algorithm in Apache Spark☆15Updated 7 years ago
- Example code for building your own MemSQL Streamliner Pipelines☆23Updated 8 years ago
- Testing Scala code with scalatest☆12Updated 2 years ago
- An example PySpark project with pytest☆16Updated 7 years ago
- Flink Examples☆39Updated 9 years ago
- CloudFormation templates and scripts demonstrating how to build a promotion recommendation system using Kinesis and SageMaker.☆28Updated 7 years ago
- Using the Parquet file format (with Avro) to process data with Apache Flink☆14Updated 9 years ago
- Spark stream from kafka(json) to s3(parquet)☆15Updated 6 years ago
- ☆21Updated 9 years ago
- Scalable CDC Pattern Implemented using PySpark☆18Updated 5 years ago
- Spark package to "plug" holes in data using SQL based rules ⚡️ 🔌☆29Updated 5 years ago
- phData Pulse application log aggregation and monitoring☆13Updated 5 years ago
- The open source version of the Amazon EMR Management Guide. You can submit feedback & requests for changes by submitting issues in this r…☆62Updated 2 years ago
- Connect DBVisualizer to Hortonwork HiveServer2☆9Updated 10 years ago
- Spark in Kaggle competitions☆10Updated 9 years ago
- Utilities for writing tests that use Apache Spark.☆24Updated 6 years ago
- Code and setup information for Introduction to Machine Learning with Spark☆12Updated 9 years ago
- Starter project for building MemSQL Streamliner Pipelines☆32Updated 8 years ago
- Bullet is a streaming query engine that can be plugged into any singular data stream using a Stream Processing framework like Apache Stor…☆41Updated 2 years ago
- ☆22Updated 6 years ago
- This is a basic Apache Pinot example for ingesting real-time MySQL change logs using Debezium☆27Updated 4 years ago
- Shunting Yard is a real-time data replication tool that copies data between Hive Metastores.☆20Updated 3 years ago
- Spark Scala docker container sample for AWS testing - EKS & S3☆24Updated 6 years ago
- ☆10Updated 8 years ago
- Examples for Fast Data Processing with Spark☆59Updated 11 years ago
- Spark UDFs to deserialize Avro messages with schemas stored in Schema Registry.☆20Updated 7 years ago
- ☆9Updated 9 years ago
- ☆38Updated 7 years ago