Real time stock data pipeline --play with Kafka, Cassandra, Spark, Redis, Node.js, Zookeeper
☆80Feb 19, 2017Updated 9 years ago
Alternatives and similar repositories for DataPipeline
Users that are interested in DataPipeline are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- some changes based on an existing open source projects--VNPY☆13Oct 22, 2017Updated 8 years ago
- Developed a high performance data processing platform using Apache Kafka, Apache Cassandra, and Apache Spark to analyze stock price and r…☆86Jul 23, 2017Updated 8 years ago
- This is a real-time dashboard example using Spark Streaming and Node.js☆25Dec 17, 2025Updated 5 months ago
- Real-Time Data Processing Pipeline & Visualization with Docker, Spark, Kafka and Cassandra☆85May 4, 2017Updated 9 years ago
- My Data Engineering project @ Insight Data Science☆10Jul 23, 2018Updated 7 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Big data technologies☆11Apr 10, 2017Updated 9 years ago
- Kafka docker image, ready for k8s and openshift clusters☆18Jun 9, 2021Updated 5 years ago
- Real-time Machine Learning with Apache Spark on Twitter Public Stream☆68Apr 27, 2017Updated 9 years ago
- 使用shell脚本部署Apache Doris (incubating) FE & BE☆11Jul 8, 2019Updated 6 years ago
- A big data project to apply Hadoop map- reduce to derive some statistics from IMDB movie data.☆26Feb 19, 2015Updated 11 years ago
- 用户画像代码,根据算法推算出用户的性别和年龄比率☆11Dec 18, 2017Updated 8 years ago
- This is a crawler(reptile)☆45Mar 14, 2019Updated 7 years ago
- An end-to-end Recommendation System built on Azure Databricks☆56Jul 29, 2019Updated 6 years ago
- Data self exporting and monitoring platform based on Hive data warehouse. https://hc.smartloli.org☆36Jul 28, 2017Updated 8 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A streaming ETL for fish☆13Jan 24, 2019Updated 7 years ago
- A small distributed key value store written in java, uses event driven architecture and supports replication.☆16Sep 19, 2024Updated last year
- A consumer of a Kafka topic based on Flink☆12Oct 5, 2022Updated 3 years ago
- Demo showcasing Spark Streaming, Kafka, Kudu - all in Python☆27Jun 12, 2017Updated 9 years ago
- Project for reading data from kafka and writing to kafka and HBase with kerberos☆24Dec 8, 2016Updated 9 years ago
- ☆25Aug 23, 2017Updated 8 years ago
- Play with various big data technologies☆10Jul 12, 2017Updated 8 years ago
- Data engineering interviews Q&A for data community by data community☆68Jun 7, 2020Updated 6 years ago
- Housing loan risk assessment from its origination data☆21Sep 27, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- 大数据【企业级360°全方位用户画像】标签开发部分源码☆20Dec 18, 2020Updated 5 years ago
- ☆14Nov 3, 2016Updated 9 years ago
- Data Engineering Project at Insight☆15Nov 17, 2015Updated 10 years ago
- Django Based Hotel Management App☆15Nov 22, 2022Updated 3 years ago
- Some popular algorithms(dbscan,knn,fm etc.) on spark☆32May 29, 2018Updated 8 years ago
- A real-time streaming ETL pipeline for streaming and performing sentiment analysis on Twitter data using Apache Kafka, Apache Spark and D…☆29Aug 8, 2020Updated 5 years ago
- A collection of tools that help me work with Avro☆23Jan 7, 2010Updated 16 years ago
- spring+spark streaming+kafka 10版本集成和异常问题处理☆17Jul 21, 2017Updated 8 years ago
- A list of my scientific publication☆12May 1, 2021Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- oracle数据同步到Greenplum的shell脚本☆11May 13, 2019Updated 7 years ago
- Developed an ETL pipeline for a Data Lake that extracts data from S3, processes the data using Spark, and loads the data back into S3 as …☆17Oct 1, 2019Updated 6 years ago
- Coding blog thingy.☆21Nov 6, 2025Updated 7 months ago
- 一个基于ElasticSearch的业务日志记录工具☆10Nov 5, 2018Updated 7 years ago
- 解析Mysql binlog日志并发至Kafka☆23Nov 25, 2016Updated 9 years ago
- Final Project for Data Engineering Zoomcamp Course 2024 🧙🔥☆11Apr 17, 2024Updated 2 years ago
- Rust And Delta Demo. Explanation and walkthrough on delta-rs☆10Aug 21, 2023Updated 2 years ago