LuQQiu / DataPipeline
Real time stock data pipeline --play with Kafka, Cassandra, Spark, Redis, Node.js, Zookeeper
☆81Updated 8 years ago
Alternatives and similar repositories for DataPipeline:
Users that are interested in DataPipeline are comparing it to the libraries listed below
- Real-time report dashboard with Apache Kafka, Apache Spark Streaming and Node.js☆50Updated last year
- Realtime social media data analytics with Apache Spark, Python, Kafka, Pandas, etc☆51Updated 8 years ago
- Sentiment Analysis of a Twitter Topic with Spark Structured Streaming☆55Updated 6 years ago
- Updated repository☆157Updated 3 years ago
- Materials for IBM Spark contest. About the real-world application of big data and spark.☆77Updated 6 years ago
- Real-time Machine Learning with Apache Spark on Twitter Public Stream☆68Updated 7 years ago
- Docker container for Kafka - Spark Streaming - Cassandra☆98Updated 5 years ago
- A movie search engine based on ElasticSearch using Python☆18Updated 8 years ago
- Twitter Sentiment Analysis using Spark and Kafka☆115Updated 5 years ago
- Example blueprint application for processing high-speed trading data.☆84Updated last year
- Takes a kafka stream into spark, apply transformations and sink into Druid. Everything Dockerised.☆30Updated last year
- ☆53Updated 2 years ago
- Apche Spark Structured Streaming with Kafka using Python(PySpark)☆40Updated 5 years ago
- PySpark Machine Learning Examples☆44Updated 7 years ago
- A project with examples of using few commonly used data manipulation/processing/transformation APIs in Apache Spark 2.0.0☆25Updated 3 years ago
- Analyze and visualize Twitter Sentiment on a world map using Spark MLlib☆139Updated 3 years ago
- PySpark Code for Hands-on Learners☆116Updated 5 years ago
- Real-world Spark pipelines examples☆83Updated 7 years ago
- Various data stream/batch process demo with Apache Scala Spark 🚀☆11Updated 5 years ago
- PySpark Cookbook, published by Packt☆91Updated 2 years ago
- Learning Spark SQL, published by Packt☆42Updated 2 years ago
- Apache Spark 2x Machine Learning Cookbook, published by Packt☆29Updated 2 years ago
- Code repository for Learning Apache Spark 2, published by Packt☆21Updated 2 years ago
- Play with various big data technologies☆31Updated 2 years ago
- ☆26Updated last year
- Self-contained examples of Apache Spark streaming integrated with Apache Kafka.☆199Updated 7 years ago
- ☆105Updated 5 years ago
- Real-Time Data Processing Pipeline & Visualization with Docker, Spark, Kafka and Cassandra☆84Updated 7 years ago
- Big Data Modeling, MapReduce, Spark, PySpark @ Santa Clara University☆155Updated 4 months ago
- Counting Tweets Per User in Real-Time☆42Updated 7 years ago