LuQQiu / DataPipeline
Real time stock data pipeline --play with Kafka, Cassandra, Spark, Redis, Node.js, Zookeeper
☆80Updated 7 years ago
Related projects: ⓘ
- Realtime social media data analytics with Apache Spark, Python, Kafka, Pandas, etc☆52Updated 8 years ago
- Sentiment Analysis of a Twitter Topic with Spark Structured Streaming☆54Updated 5 years ago
- Real-time Machine Learning with Apache Spark on Twitter Public Stream☆68Updated 7 years ago
- Real-time report dashboard with Apache Kafka, Apache Spark Streaming and Node.js☆49Updated last year
- Counting Tweets Per User in Real-Time☆41Updated 7 years ago
- Twitter Sentiment Analysis using Spark and Kafka☆113Updated 5 years ago
- Big Data Modeling, MapReduce, Spark, PySpark @ Santa Clara University☆149Updated 2 weeks ago
- Analyze and visualize Twitter Sentiment on a world map using Spark MLlib☆136Updated 3 years ago
- ☆53Updated 2 years ago
- A project with examples of using few commonly used data manipulation/processing/transformation APIs in Apache Spark 2.0.0☆25Updated 3 years ago
- A movie search engine based on ElasticSearch using Python☆18Updated 7 years ago
- Updated repository☆157Updated 2 years ago
- The demo of using Kafka, Spark, Hive, Cassandra, etc by using Docker. It produces the production ready environment for any kinds of big d…☆30Updated 4 years ago
- Docker container for Kafka - Spark Streaming - Cassandra☆98Updated 5 years ago
- Takes a kafka stream into spark, apply transformations and sink into Druid. Everything Dockerised.☆30Updated 11 months ago
- PySpark Code for Hands-on Learners☆112Updated 4 years ago
- An API to Analyze Cab GeoLocation Data and a Simulated App for finding an available cab in Real-Time☆63Updated 9 years ago
- PySpark Cookbook, published by Packt☆89Updated last year
- Training models with Apache Spark, PySpark for Titanic Kaggle competition☆14Updated 7 years ago
- Examples To Help You Learn Apache Spark☆78Updated 5 years ago
- Ingest tweets with Kafka. Use Spark to track popular hashtags and trendsetters for each hashtag☆29Updated 8 years ago
- Repo for all my code on the articles I post on medium☆105Updated last year
- Various data stream/batch process demo with Apache Scala Spark 🚀☆11Updated 4 years ago
- Repository used for Spark Trainings☆53Updated last year
- How to manage Slowly Changing Dimensions with Apache Hive☆55Updated 5 years ago
- Just a boilerplate for PySpark and Flask☆35Updated 6 years ago
- Final Project for IoT: Big Data Processing and Analytics class. Analyzing U.S nationwide temperature from IoT sensors in real-time☆65Updated 7 years ago
- Example blueprint application for processing high-speed trading data.☆84Updated 9 months ago
- Apche Spark Structured Streaming with Kafka using Python(PySpark)☆41Updated 5 years ago
- ☆14Updated 7 years ago