LuQQiu / DataPipelineLinks
Real time stock data pipeline --play with Kafka, Cassandra, Spark, Redis, Node.js, Zookeeper
☆81Updated 8 years ago
Alternatives and similar repositories for DataPipeline
Users that are interested in DataPipeline are comparing it to the libraries listed below
Sorting:
- Big Data Modeling, MapReduce, Spark, PySpark @ Santa Clara University☆163Updated 3 weeks ago
- Updated repository☆157Updated 4 years ago
- Real-time report dashboard with Apache Kafka, Apache Spark Streaming and Node.js☆50Updated 2 years ago
- Sentiment Analysis of a Twitter Topic with Spark Structured Streaming☆55Updated 7 years ago
- Analyze and visualize Twitter Sentiment on a world map using Spark MLlib☆140Updated 4 years ago
- Docker container for Kafka - Spark Streaming - Cassandra☆97Updated 6 years ago
- ☆53Updated 3 years ago
- PySpark Code for Hands-on Learners☆116Updated 6 years ago
- Materials for IBM Spark contest. About the real-world application of big data and spark.☆79Updated 6 years ago
- Twitter Sentiment Analysis using Spark and Kafka☆113Updated 6 years ago
- Realtime social media data analytics with Apache Spark, Python, Kafka, Pandas, etc☆52Updated 9 years ago
- ☆105Updated 6 years ago
- Real-time Machine Learning with Apache Spark on Twitter Public Stream☆68Updated 8 years ago
- PySpark Algorithms Book: https://www.amazon.com/dp/B07X4B2218/ref=sr_1_2☆87Updated 5 years ago
- Examples To Help You Learn Apache Spark☆78Updated 7 years ago
- How to manage Slowly Changing Dimensions with Apache Hive☆55Updated 6 years ago
- A project with examples of using few commonly used data manipulation/processing/transformation APIs in Apache Spark 2.0.0☆25Updated 4 years ago
- Self-contained examples using Apache Spark with the functional features of Java 8☆66Updated 7 years ago
- Repo for all my code on the articles I post on medium☆107Updated 3 years ago
- Apache Spark (PySpark) Practice on Real Data☆273Updated 5 years ago
- Takes a kafka stream into spark, apply transformations and sink into Druid. Everything Dockerised.☆30Updated 2 years ago
- Example blueprint application for processing high-speed trading data.☆84Updated 2 years ago
- Learning Spark SQL, published by Packt☆42Updated 2 years ago
- Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validatio…☆55Updated 2 years ago
- Simple examle for Spark Streaming over Kafka topic☆107Updated 5 years ago
- Code files uploaded by Packt publishing☆33Updated 4 years ago
- Various data stream/batch process demo with Apache Scala Spark 🚀☆11Updated 5 years ago
- Repository used for Spark Trainings☆54Updated 2 years ago
- Apache Spark (Scala, PySpark, SparkR) Code, Tricks, and References☆69Updated 6 years ago
- Docker compose files for various kafka stacks☆32Updated 7 years ago