Chabane / bigdata-playground
A complete example of a big data application using : Kubernetes (kops/aws), Apache Spark SQL/Streaming/MLib, Apache Flink, Scala, Python, Apache Kafka, Apache Hbase, Apache Parquet, Apache Avro, Apache Storm, Twitter Api, MongoDB, NodeJS, Angular, GraphQL
☆208Updated 5 years ago
Alternatives and similar repositories for bigdata-playground:
Users that are interested in bigdata-playground are comparing it to the libraries listed below
- A proof of concept using Divolte, Kafka, Druid and Superset☆61Updated 4 years ago
- Real-world Spark pipelines examples☆83Updated 6 years ago
- Spark Structured Streaming / Kafka / Cassandra / Elastic☆183Updated last year
- ☆75Updated 4 years ago
- StreamLine - Streaming Analytics☆164Updated last year
- Spark structured streaming with Kafka data source and writing to Cassandra☆63Updated 5 years ago
- ☆240Updated 3 years ago
- An example Apache Beam project.☆111Updated 7 years ago
- Self-contained examples of Apache Spark streaming integrated with Apache Kafka.☆199Updated 6 years ago
- Build configuration-driven ETL pipelines on Apache Spark☆159Updated 2 years ago
- Structured Streaming Machine Learning example with Spark 2.0☆92Updated 7 years ago
- Examples of Spark 2.0☆211Updated 3 years ago
- Big Data ETL and Utilities for Hadoop Map Reduce, Spark and Storm☆103Updated last year
- This project describes how to write full ETL data pipeline using spark.☆15Updated 2 years ago
- DataQuality for BigData☆143Updated last year
- These are some code examples☆55Updated 5 years ago
- Example Maven configuration for a Spark, Scala project☆54Updated 2 years ago
- Experiments with Apache Flink.☆5Updated last year
- ☆81Updated last year
- Simple examle for Spark Streaming over Kafka topic☆106Updated 4 years ago
- ☆48Updated 4 years ago
- Code for Tutorial on designing clickstream analytics application using Hadoop☆54Updated 9 years ago
- spark + drools☆102Updated 2 years ago
- The Internals of Spark Structured Streaming☆416Updated 2 years ago
- Apache Flink™ training material website☆79Updated 4 years ago
- A light Kafka to HDFS/S3 ETL library based on Apache Spark☆41Updated 7 years ago
- Multi docker container images for main Big Data Tools. (Hadoop, Spark, Kafka, HBase, Cassandra, Zookeeper, Zeppelin, Drill, Flink, Hive, …☆34Updated last month
- How to manage Slowly Changing Dimensions with Apache Hive☆55Updated 5 years ago
- Takes a kafka stream into spark, apply transformations and sink into Druid. Everything Dockerised.☆30Updated last year
- Apache Spark and Apache Kafka integration example☆123Updated 7 years ago