LuQQiu / DataPipelineLinks
Real time stock data pipeline --play with Kafka, Cassandra, Spark, Redis, Node.js, Zookeeper
☆81Updated 8 years ago
Alternatives and similar repositories for DataPipeline
Users that are interested in DataPipeline are comparing it to the libraries listed below
Sorting:
- Real-time report dashboard with Apache Kafka, Apache Spark Streaming and Node.js☆50Updated last year
- Updated repository☆157Updated 3 years ago
- Play with various big data technologies☆31Updated 2 years ago
- Docker container for Kafka - Spark Streaming - Cassandra☆98Updated 5 years ago
- Examples To Help You Learn Apache Spark☆77Updated 6 years ago
- Big Data Modeling, MapReduce, Spark, PySpark @ Santa Clara University☆158Updated 6 months ago
- Real-time Machine Learning with Apache Spark on Twitter Public Stream☆67Updated 8 years ago
- Counting Tweets Per User in Real-Time☆42Updated 7 years ago
- Various data stream/batch process demo with Apache Scala Spark 🚀☆11Updated 5 years ago
- A movie search engine based on ElasticSearch using Python☆18Updated 8 years ago
- PySpark Code for Hands-on Learners☆116Updated 5 years ago
- Real-Time Data Processing Pipeline & Visualization with Docker, Spark, Kafka and Cassandra☆85Updated 8 years ago
- PySpark Machine Learning Examples☆44Updated 7 years ago
- A real-time streaming ETL pipeline for streaming and performing sentiment analysis on Twitter data using Apache Kafka, Apache Spark and D…☆30Updated 4 years ago
- Repo for all my code on the articles I post on medium☆107Updated 2 years ago
- Apache Spark (PySpark) Practice on Real Data☆274Updated 5 years ago
- ☆25Updated 6 years ago
- Data Streaming Nanodegree (from Udacity) exercises, projects and their solutions☆17Updated last year
- An API to Analyze Cab GeoLocation Data and a Simulated App for finding an available cab in Real-Time☆63Updated 10 years ago
- PySpark Algorithms Book: https://www.amazon.com/dp/B07X4B2218/ref=sr_1_2☆86Updated 5 years ago
- Ingest tweets with Kafka. Use Spark to track popular hashtags and trendsetters for each hashtag☆29Updated 9 years ago
- This is a simple streaming application that utilises Kafka and Python☆46Updated 6 years ago
- Realtime social media data analytics with Apache Spark, Python, Kafka, Pandas, etc☆51Updated 8 years ago
- Repository used for Spark Trainings☆53Updated 2 years ago
- Data Engineering Project at Insight☆15Updated 9 years ago
- A project with examples of using few commonly used data manipulation/processing/transformation APIs in Apache Spark 2.0.0☆25Updated 3 years ago
- Learning Spark SQL, published by Packt☆42Updated 2 years ago
- Materials for IBM Spark contest. About the real-world application of big data and spark.☆78Updated 6 years ago
- Example blueprint application for processing high-speed trading data.☆84Updated last year
- Sentiment Analysis of a Twitter Topic with Spark Structured Streaming☆55Updated 6 years ago