Real-Time Data Processing Pipeline & Visualization with Docker, Spark, Kafka and Cassandra
☆85May 4, 2017Updated 9 years ago
Alternatives and similar repositories for data-processing-pipeline
Users that are interested in data-processing-pipeline are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Examples of all Machine Learning Algorithm in Apache Spark☆15Nov 2, 2017Updated 8 years ago
- Real time stock data pipeline --play with Kafka, Cassandra, Spark, Redis, Node.js, Zookeeper☆80Feb 19, 2017Updated 9 years ago
- Docker container for Kafka - Spark Streaming - Cassandra☆96Jun 17, 2019Updated 6 years ago
- Mirror of Apache livy (Incubating)☆13Feb 8, 2024Updated 2 years ago
- The Social Media Lab's Fact Check Assistant is an AI-powered bot for simple fact checking. It was created as demo for the Social Media La…☆14Feb 14, 2026Updated 3 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Tutorial on real-time data visualization. Python websocket server & d3.js + crossfilter.js frontend☆36Dec 4, 2018Updated 7 years ago
- Create an updating mechanism for an Arduino within the resin.io ecosystem.☆16Sep 13, 2017Updated 8 years ago
- Race mapper for displaying participant progress and location - Kafka, KSQL, Kibana and MQTT based integration☆18Sep 23, 2019Updated 6 years ago
- ☆10May 24, 2021Updated 4 years ago
- ☆50Feb 11, 2020Updated 6 years ago
- Insight Data Engineering Project☆15Jun 1, 2021Updated 4 years ago
- MPI-oriented extension of the Spark computational model☆24Jun 5, 2018Updated 7 years ago
- Data Science for Good links.☆14Nov 10, 2021Updated 4 years ago
- Slinky, a high-performance web crawler / text analytics in Python, Redis, Hadoop, R, Gephi☆40Aug 30, 2010Updated 15 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Building Microservices with Spring Boot☆14May 7, 2015Updated 11 years ago
- Code for my blogs on Data Engineering☆15Nov 9, 2020Updated 5 years ago
- Collection of AWS Lambda functions in Python☆11Mar 13, 2019Updated 7 years ago
- My professional portfolio with some of my best data science projects.☆11Jun 22, 2017Updated 8 years ago
- Memcached session store for Connect backed by memjs☆13Nov 14, 2022Updated 3 years ago
- A web application for real-time machine learning and sentiment analysis on Tweets☆43Sep 13, 2017Updated 8 years ago
- A database with automatic dynamic imputation of missing values.☆11Nov 2, 2017Updated 8 years ago
- Spark Structured Streaming / Kafka / Cassandra / Elastic☆186Feb 7, 2023Updated 3 years ago
- using Redis for data science and data engineering☆16Jan 14, 2020Updated 6 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Integrating Symbolic Programming and Neuromorphic Modeling for Edge Labs with NVIDIA Jetson, DGX Spark, and GPU-based DNN/ML Systems☆16Updated this week
- DBCA IT assets management system☆13Updated this week
- How to run DBT on AWS Fargate☆13Oct 15, 2019Updated 6 years ago
- 😈Complete End to End ETL Pipeline with Spark, Airflow, & AWS☆51Aug 23, 2019Updated 6 years ago
- Kafka Sink Connect OrientDB https://www.confluent.io/hub/sanjuthomas/kafka-connect-orientdb☆10Jan 26, 2026Updated 3 months ago
- The shared memory version of the Alternating Directions Implicit Solver for Isogeometric Analysis☆10Jan 26, 2019Updated 7 years ago
- Project used to generate ML.NET AutoML code for machine learning.☆11Jul 19, 2021Updated 4 years ago
- A collection of utilities and tools for teams and organizations using dbt☆15Nov 24, 2023Updated 2 years ago
- demo app running prometheus.io monitoring on resin devices☆24Aug 4, 2016Updated 9 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Docker example with kafka connect and sink☆12Feb 12, 2018Updated 8 years ago
- ☆27May 1, 2024Updated 2 years ago
- Code for my talk "Stateful & Reactive Streaming Applications Without a Database" at WeAreDevelopers 2018☆11May 20, 2018Updated 7 years ago
- A python script to convert your youtube URL to an mp3 file and download it to the same directory as the .py file.☆10May 20, 2025Updated 11 months ago
- 🎁 Shows recommended files in Nextcloud☆15Updated this week
- 12 Week Data Science Immersive☆28May 13, 2015Updated 11 years ago
- CloudFormation template to create a VPC and subnets☆11Dec 2, 2021Updated 4 years ago