LuQQiu / DataPipeline
Real time stock data pipeline --play with Kafka, Cassandra, Spark, Redis, Node.js, Zookeeper
☆81Updated 7 years ago
Related projects ⓘ
Alternatives and complementary repositories for DataPipeline
- Real-time Machine Learning with Apache Spark on Twitter Public Stream☆68Updated 7 years ago
- How to manage Slowly Changing Dimensions with Apache Hive☆55Updated 5 years ago
- Twitter Sentiment Analysis using Spark and Kafka☆114Updated 5 years ago
- Docker container for Kafka - Spark Streaming - Cassandra☆97Updated 5 years ago
- Simple examle for Spark Streaming over Kafka topic☆107Updated 4 years ago
- Various data stream/batch process demo with Apache Scala Spark 🚀☆11Updated 4 years ago
- Sentiment Analysis of a Twitter Topic with Spark Structured Streaming☆55Updated 5 years ago
- Analyze and visualize Twitter Sentiment on a world map using Spark MLlib☆138Updated 3 years ago
- PySpark Code for Hands-on Learners☆114Updated 5 years ago
- Counting Tweets Per User in Real-Time☆41Updated 7 years ago
- ☆105Updated 4 years ago
- Updated repository☆157Updated 2 years ago
- Example blueprint application for processing high-speed trading data.☆84Updated 11 months ago
- Takes a kafka stream into spark, apply transformations and sink into Druid. Everything Dockerised.☆30Updated last year
- A project with examples of using few commonly used data manipulation/processing/transformation APIs in Apache Spark 2.0.0☆25Updated 3 years ago
- Just a boilerplate for PySpark and Flask☆35Updated 6 years ago
- Helping you get Airflow running in production.☆9Updated 5 years ago
- Repo for all my code on the articles I post on medium☆105Updated 2 years ago
- Spark structured streaming with Kafka data source and writing to Cassandra☆64Updated 4 years ago
- A plugin to Apache Airflow to allow you to run Spark Submit Commands as an Operator☆75Updated 5 years ago
- PySpark Algorithms Book: https://www.amazon.com/dp/B07X4B2218/ref=sr_1_2☆83Updated 4 years ago
- Spark Structured Streaming / Kafka / Cassandra / Elastic☆184Updated last year
- ☆53Updated 2 years ago
- A light Kafka to HDFS/S3 ETL library based on Apache Spark☆41Updated 7 years ago
- Big Data Modeling, MapReduce, Spark, PySpark @ Santa Clara University☆154Updated last week
- Spark Streaming HBase Example☆22Updated 8 years ago
- Docker compose files for various kafka stacks☆33Updated 6 years ago
- Real-Time Data Processing Pipeline & Visualization with Docker, Spark, Kafka and Cassandra☆83Updated 7 years ago
- ☆24Updated 8 years ago
- Spark SQL UDF examples☆56Updated 6 years ago