Real time stock data pipeline --play with Kafka, Cassandra, Spark, Redis, Node.js, Zookeeper
☆79Feb 19, 2017Updated 9 years ago
Alternatives and similar repositories for DataPipeline
Users that are interested in DataPipeline are comparing it to the libraries listed below
Sorting:
- ☆17Feb 3, 2018Updated 8 years ago
- Developed a high performance data processing platform using Apache Kafka, Apache Cassandra, and Apache Spark to analyze stock price and r…☆85Jul 23, 2017Updated 8 years ago
- extendable field for use in Django Models☆29May 7, 2023Updated 2 years ago
- 使用shell脚本部署Apache Doris (incubating) FE & BE☆11Jul 8, 2019Updated 6 years ago
- python interface to bnlearn and other probabilistic graphical model libraries☆10Mar 26, 2020Updated 5 years ago
- Airflow POC demo : 1) env set up 2) airflow DAG 3) Spark/ML pipeline | #DE☆11Dec 19, 2022Updated 3 years ago
- My Data Engineering project @ Insight Data Science☆10Jul 23, 2018Updated 7 years ago
- ☆10Sep 25, 2024Updated last year
- Writer Identification of Handwritten Documents☆13Oct 18, 2017Updated 8 years ago
- spark流数据处理,可以从flume-ng,kafka接收数据☆11Sep 16, 2015Updated 10 years ago
- An analysis on Aadhaar dataset using Mapreduce and Spark☆14Feb 28, 2018Updated 8 years ago
- Collect/process data via various data sources : website / js website / API. Run scrapping pipeline via Celery, and Travis cron task. Du…☆14Jul 24, 2024Updated last year
- 基于 spark 混合查询平台,支持不同源数据库的联合查询,mysql hive presto ...☆14Aug 3, 2017Updated 8 years ago
- Data self exporting and monitoring platform based on Hive data warehouse. https://hc.smartloli.org☆36Jul 28, 2017Updated 8 years ago
- 🤞💖🎁树莓派不吃灰,跑个flink 实时分析任务不香吗?✨🌹☆14May 16, 2024Updated last year
- Profiling Spark Applications for Performance Comparison and Diagnosis☆17Nov 11, 2018Updated 7 years ago
- 大数据【企业级360°全方位用户画像】标签开发部分源码☆19Dec 18, 2020Updated 5 years ago
- spring+spark streaming+kafka 10版本集成和异常问题处理☆17Jul 21, 2017Updated 8 years ago
- Coding blog thingy.☆21Nov 6, 2025Updated 3 months ago
- This is a simple streaming application that utilises Kafka and Python☆45Jan 11, 2019Updated 7 years ago
- 使用flink快速构建实时监控系统报警☆19Sep 7, 2019Updated 6 years ago
- 使用Hive进行大数据分析实战☆23Aug 8, 2018Updated 7 years ago
- Machine learning model to predict NSE stocks for a year☆20Jan 14, 2019Updated 7 years ago
- Housing loan risk assessment from its origination data☆18Sep 27, 2023Updated 2 years ago
- Play with various big data technologies☆31Dec 26, 2022Updated 3 years ago
- Developed an ETL pipeline for a Data Lake that extracts data from S3, processes the data using Spark, and loads the data back into S3 as …☆17Oct 1, 2019Updated 6 years ago
- This is a real-time dashboard example using Spark Streaming and Node.js☆26Dec 17, 2025Updated 2 months ago
- An end-to-end Recommendation System built on Azure Databricks☆55Jul 29, 2019Updated 6 years ago
- A tool for translating Scala source code into readable and maintainable Java code☆13Jan 3, 2026Updated 2 months ago
- Unlock level without hassle in Candy Crush Saga☆22Sep 5, 2017Updated 8 years ago
- 解析Mysql binlog日志并发至Kafka☆23Nov 25, 2016Updated 9 years ago
- SparkSQL数据分析案例☆23Dec 3, 2016Updated 9 years ago
- featselector是一个基于统计分析和模型选择的特征选择器.☆14Mar 4, 2019Updated 7 years ago
- spring-boot利用scala写spark程序骨架☆28Oct 22, 2019Updated 6 years ago
- A real-time streaming ETL pipeline for streaming and performing sentiment analysis on Twitter data using Apache Kafka, Apache Spark and D…☆29Aug 8, 2020Updated 5 years ago
- Adjusting an implementaion of AlphaZero to trading.☆28Dec 8, 2018Updated 7 years ago
- Header files for MT4/MT5 platform.☆10May 8, 2016Updated 9 years ago
- A batch-processing system base on Spring Boot and Spring Batch. 一个基于SpringBoot和SpringBatch的批处理系统。☆10Sep 10, 2018Updated 7 years ago
- Final Project for Data Engineering Zoomcamp Course 2024 🧙🔥☆11Apr 17, 2024Updated last year