Real-Time Data Processing Pipeline & Visualization with Docker, Spark, Kafka and Cassandra
☆85May 4, 2017Updated 9 years ago
Alternatives and similar repositories for data-processing-pipeline
Users that are interested in data-processing-pipeline are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Examples of all Machine Learning Algorithm in Apache Spark☆15Nov 2, 2017Updated 8 years ago
- Real time stock data pipeline --play with Kafka, Cassandra, Spark, Redis, Node.js, Zookeeper☆80Feb 19, 2017Updated 9 years ago
- The Smallest Docker Images☆18Nov 2, 2018Updated 7 years ago
- Docker container for Kafka - Spark Streaming - Cassandra☆96Jun 17, 2019Updated 6 years ago
- Structured Streaming is a reference application showing how to easily integrate structured streaming Apache Spark Structured Streaming, …☆13Nov 17, 2018Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Scripts for building Cloudera Manager parcel and CSD for Livy Spark Server☆21Oct 18, 2017Updated 8 years ago
- Spark, Airflow, Kafka☆24Apr 30, 2023Updated 3 years ago
- ☆11Apr 15, 2019Updated 7 years ago
- Race mapper for displaying participant progress and location - Kafka, KSQL, Kibana and MQTT based integration☆18Sep 23, 2019Updated 6 years ago
- A ready to go Big Data cluster (Hadoop + Hadoop Streaming + Spark + PySpark) with Docker and Docker Swarm!☆22May 20, 2025Updated last year
- Create an updating mechanism for an Arduino within the resin.io ecosystem.☆16Sep 13, 2017Updated 8 years ago
- ☆10May 24, 2021Updated 5 years ago
- Container Assembly Builder☆15Sep 23, 2021Updated 4 years ago
- ECHO: An Adaptive Orchestration Platform for Hybrid Dataflows across Cloud and Edge Resources☆17Apr 5, 2019Updated 7 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Chrome Extension for Development/Testing/Exploring GraphQL Servers☆14Oct 1, 2018Updated 7 years ago
- ☆51May 21, 2026Updated 2 weeks ago
- Skeleton project for Apache Airflow training participants to work on.☆17Apr 13, 2026Updated last month
- Insight Data Engineering Project☆15Jun 1, 2021Updated 5 years ago
- Project based learning for Data Engineering fundamentals.☆13Jan 15, 2021Updated 5 years ago
- Building Microservices with Spring Boot☆14May 7, 2015Updated 11 years ago
- My professional portfolio with some of my best data science projects.☆11Jun 22, 2017Updated 8 years ago
- Spark Structured Streaming / Kafka / Cassandra / Elastic☆186Feb 7, 2023Updated 3 years ago
- using Redis for data science and data engineering☆16Jan 14, 2020Updated 6 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- 😈Complete End to End ETL Pipeline with Spark, Airflow, & AWS☆51Aug 23, 2019Updated 6 years ago
- Kafka Sink Connect OrientDB https://www.confluent.io/hub/sanjuthomas/kafka-connect-orientdb☆10Jan 26, 2026Updated 4 months ago
- ☆35Dec 20, 2016Updated 9 years ago
- Udacity Data Engineering Nano Degree (DEND)☆188Jan 20, 2020Updated 6 years ago
- Kafka Connect connector for CDC data from postgres☆11Aug 27, 2017Updated 8 years ago
- Simple demonstration of how to build a complex real time machine learning visualization tool.☆16Mar 26, 2016Updated 10 years ago
- ☆27May 1, 2024Updated 2 years ago
- ☆17Apr 13, 2017Updated 9 years ago
- Code for my talk "Stateful & Reactive Streaming Applications Without a Database" at WeAreDevelopers 2018☆11May 20, 2018Updated 8 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Deliver Pinpoint Campaigns Driven by Machine Learning on AWS SageMaker☆18Feb 10, 2019Updated 7 years ago
- Training Material for Gardening Days☆13May 6, 2021Updated 5 years ago
- 12 Week Data Science Immersive☆28May 13, 2015Updated 11 years ago
- Fivetran's Salesforce dbt package☆52May 26, 2026Updated last week
- free bike for everyone☆15Aug 20, 2019Updated 6 years ago
- Spark structured streaming with Kafka data source and writing to Cassandra☆62Dec 5, 2019Updated 6 years ago
- Data Pipeline Clientlib provides an interface to tail and publish to data pipeline topics.☆110Aug 17, 2022Updated 3 years ago