Real time stock data pipeline --play with Kafka, Cassandra, Spark, Redis, Node.js, Zookeeper
☆79Feb 19, 2017Updated 9 years ago
Alternatives and similar repositories for DataPipeline
Users that are interested in DataPipeline are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆10Sep 25, 2024Updated last year
- Developed a high performance data processing platform using Apache Kafka, Apache Cassandra, and Apache Spark to analyze stock price and r…☆85Jul 23, 2017Updated 8 years ago
- This is a real-time dashboard example using Spark Streaming and Node.js☆26Dec 17, 2025Updated 4 months ago
- Airflow POC demo : 1) env set up 2) airflow DAG 3) Spark/ML pipeline | #DE☆11Dec 19, 2022Updated 3 years ago
- python interface to bnlearn and other probabilistic graphical model libraries☆10Mar 26, 2020Updated 6 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- extendable field for use in Django Models☆29May 7, 2023Updated 2 years ago
- Big data technologies☆11Apr 10, 2017Updated 9 years ago
- Real-time Machine Learning with Apache Spark on Twitter Public Stream☆68Apr 27, 2017Updated 9 years ago
- 使用shell脚本部署Apache Doris (incubating) FE & BE☆11Jul 8, 2019Updated 6 years ago
- 用户画像代码,根据算法推算出用户的性别和年龄比率☆11Dec 18, 2017Updated 8 years ago
- A Real-time Apache log monitor using Kafka & Spark Streaming, with fake log generator.☆24Feb 19, 2020Updated 6 years ago
- Collect/process data via various data sources : website / js website / API. Run scrapping pipeline via Celery, and Travis cron task. Du…☆14Jul 24, 2024Updated last year
- Data self exporting and monitoring platform based on Hive data warehouse. https://hc.smartloli.org☆36Jul 28, 2017Updated 8 years ago
- A streaming ETL for fish☆13Jan 24, 2019Updated 7 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- An Improvement for Apache Open Source Software Pig☆10May 5, 2017Updated 9 years ago
- ☆11Feb 7, 2021Updated 5 years ago
- 基于ansible的Greenplum集群多主机节点一键安装工具//dbswitch.gitee.io/docs-site/☆15Jul 4, 2021Updated 4 years ago
- Demo showcasing Spark Streaming, Kafka, Kudu - all in Python☆27Jun 12, 2017Updated 8 years ago
- Project for reading data from kafka and writing to kafka and HBase with kerberos☆24Dec 8, 2016Updated 9 years ago
- featselector是一个基于统计分析和模型选择的特征选择器.☆14Mar 4, 2019Updated 7 years ago
- Play with various big data technologies☆10Jul 12, 2017Updated 8 years ago
- An example project using Spark Streaming with Kafka message and Avro serialization☆12Aug 21, 2015Updated 10 years ago
- ☆14Nov 3, 2016Updated 9 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A real-time streaming ETL pipeline for streaming and performing sentiment analysis on Twitter data using Apache Kafka, Apache Spark and D…☆29Aug 8, 2020Updated 5 years ago
- Check out the dash visualization at https://dash-drug-explorer.plot.ly/out☆12Dec 26, 2022Updated 3 years ago
- A collection of tools that help me work with Avro☆23Jan 7, 2010Updated 16 years ago
- Spark in Action, 2nd edition - chapter 7 - Ingestion from files☆20Apr 21, 2023Updated 3 years ago
- A Node.js module backed by Redis that helps with rate limiting.☆19Apr 1, 2013Updated 13 years ago
- Data Mining and Analytics in Intelligent Business Services, UC Berkeley School of Information☆20May 17, 2013Updated 12 years ago
- Experimentation for Engineers (Manning, 2023)☆20Dec 15, 2022Updated 3 years ago
- Developed an ETL pipeline for a Data Lake that extracts data from S3, processes the data using Spark, and loads the data back into S3 as …☆17Oct 1, 2019Updated 6 years ago
- A simple examle for Python Kafka Avro☆86Aug 27, 2018Updated 7 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Demonstrate the use of Ansible best practices in a workshop☆10Jul 7, 2019Updated 6 years ago
- ☆13Feb 4, 2021Updated 5 years ago
- 一个基于ElasticSearch的业务日志记录工具☆10Nov 5, 2018Updated 7 years ago
- A deep learning based bioinformatics project on epigenetics in Type 2 Diabetes.☆17Mar 25, 2023Updated 3 years ago
- Genetic Algorithm Feature Engineering☆15Oct 3, 2017Updated 8 years ago
- This repo demonstrates how to load a sample Parquet formatted file from an AWS S3 Bucket. A python job will then be submitted to a Apach…☆19Jun 23, 2016Updated 9 years ago
- Created by Platform Services GitHub tool on Sun Jan 08 2017☆12Mar 11, 2026Updated last month