rogaha/data-processing-pipeline

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/rogaha/data-processing-pipeline)

rogaha / data-processing-pipeline

Real-Time Data Processing Pipeline & Visualization with Docker, Spark, Kafka and Cassandra

☆85

Alternatives and similar repositories for data-processing-pipeline

Users that are interested in data-processing-pipeline are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

akashsethi24 / Machine-Learning
View on GitHub
Examples of all Machine Learning Algorithm in Apache Spark
☆15Nov 2, 2017Updated 8 years ago
Yannael / kafka-sparkstreaming-cassandra
View on GitHub
Docker container for Kafka - Spark Streaming - Cassandra
☆96Jun 17, 2019Updated 7 years ago
aimlcommunity / Breast-Cancer-Detection-using-Machine-Learning
View on GitHub
This is a guided certification project, as a part of Data Science for Social Good initiative
☆18Mar 9, 2020Updated 6 years ago
zhujun98 / data-engineering
View on GitHub
Spark, Airflow, Kafka
☆24Apr 30, 2023Updated 3 years ago
GuruCharan94 / az-podcast-transcriber
View on GitHub
A podcast transcription service built on Azure that transcribes any new episode of your podcast and displays synchronized transcripts alo…
☆10Dec 10, 2022Updated 3 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
scylladb / kafka-connect-scylladb
View on GitHub
Kafka Connect Scylladb Sink
☆54Updated this week
ybangaru / wallstreetbets-sentiment-analysis
View on GitHub
☆10May 24, 2021Updated 5 years ago
darenasc / data-science-for-good
View on GitHub
Data Science for Good links.
☆14Nov 10, 2021Updated 4 years ago
bpatters / graphiql-feen
View on GitHub
Chrome Extension for Development/Testing/Exploring GraphQL Servers
☆14Oct 1, 2018Updated 7 years ago
balena-io-experimental / balena-arduino-programmer
View on GitHub
Create an updating mechanism for an Arduino within the resin.io ecosystem.
☆16Sep 13, 2017Updated 8 years ago
argx / fake-fews
View on GitHub
Candidate solution for Facebook's fake news problem using machine learning and crowd-sourcing. Takes form of a Chrome extension. Develope…
☆13Aug 25, 2017Updated 8 years ago
angelddaz / de-challenges
View on GitHub
Project based learning for Data Engineering fundamentals.
☆13Jan 15, 2021Updated 5 years ago
maxgherman / udacity-cloud-devops-engineer
View on GitHub
Udacity Cloud DevOps Engineer Nanodegree program
☆32Updated this week
crawles / twitter-nlp
View on GitHub
A web application for real-time machine learning and sentiment analysis on Tweets
☆42Sep 13, 2017Updated 8 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
Adityav2410 / HAPT-Recognition
View on GitHub
Human Activities and Postural Transitions’ Recognition using Smartphone Data
☆14Oct 2, 2017Updated 8 years ago
alexhisen / mobx-forms-demo
View on GitHub
☆12Jan 12, 2023Updated 3 years ago
nareshk1290 / Udacity-Data-Engineering
View on GitHub
Udacity Data Engineering Nano Degree (DEND)
☆189Jan 20, 2020Updated 6 years ago
kboom / iga-adi-sm
View on GitHub
The shared memory version of the Alternating Directions Implicit Solver for Isogeometric Analysis
☆10Jan 26, 2019Updated 7 years ago
sanjuthomas / kafka-connect-orientdb
View on GitHub
Kafka Sink Connect OrientDB https://www.confluent.io/hub/sanjuthomas/kafka-connect-orientdb
☆10Jan 26, 2026Updated 5 months ago
aqidd / vue-realtime-dashboard
View on GitHub
a simple vuejs realtime dashboard with firebase, google maps & chartjs
☆24Dec 6, 2017Updated 8 years ago
voi-oss / dbt-toolkit
View on GitHub
A collection of utilities and tools for teams and organizations using dbt
☆15Nov 24, 2023Updated 2 years ago
vatsal220 / medium_articles
View on GitHub
This repository will contain the code associated to the scripts I've been writing in my medium articles
☆47Aug 12, 2023Updated 2 years ago
fabiogjardim / datalab
View on GitHub
☆10Jan 27, 2025Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
thehannankhan / youtube-to-mp3-converter
View on GitHub
A python script to convert your youtube URL to an mp3 file and download it to the same directory as the .py file.
☆10May 20, 2025Updated last year
aws-samples / aws-sagemaker-ml-blog-predictive-campaigns
View on GitHub
Deliver Pinpoint Campaigns Driven by Machine Learning on AWS SageMaker
☆18Feb 10, 2019Updated 7 years ago
ansrivas / spark-structured-streaming
View on GitHub
Spark structured streaming with Kafka data source and writing to Cassandra
☆62Dec 5, 2019Updated 6 years ago
balena-io-experimental / balena-prometheus
View on GitHub
demo app running prometheus.io monitoring on resin devices
☆24Aug 4, 2016Updated 9 years ago
basil-b2s / Language-Detector
View on GitHub
NLP Model for predicting 17 different languages
☆16Oct 19, 2023Updated 2 years ago
jerrinss5 / Multi-threaded-Proxy-Server
View on GitHub
Multi-threaded simple proxy server in Python with file caching
☆11Oct 4, 2020Updated 5 years ago
calrissian / spark-jetty-server
View on GitHub
Recipes and examples for Apache Spark
☆13Jan 21, 2015Updated 11 years ago
acheamponge / VERSUZ
View on GitHub
A Hiphop v. Literature project to demonstrate using NLP that Hip-Hop is a form of literature and rap artists are literary geniuses.
☆13Nov 13, 2020Updated 5 years ago
trulia / node-optimizely
View on GitHub
Runs optimizely experiments in node using either jsdom (slow & stable) or cheerio+node-vm (young blood)
☆15Sep 1, 2017Updated 8 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
threepointone / require.import
View on GitHub
sync-friendly code splits for webpack
☆25Mar 11, 2017Updated 9 years ago
lsongdev / ofo
View on GitHub
free bike for everyone
☆15Aug 20, 2019Updated 6 years ago
shravan-kuchkula / dataEngineering
View on GitHub
A repo to track data engineering projects
☆14Nov 11, 2022Updated 3 years ago
Wittline / D3JS-Dashboard
View on GitHub
Building Responsive DashBoard with D3.js and ASP.NET MVC from scratch (SQL SERVER - SSIS - API REST)
☆14May 31, 2023Updated 3 years ago
duongngyn0510 / centralized-server-monitoring
View on GitHub
☆11Nov 21, 2023Updated 2 years ago
BIDS / Kira
View on GitHub
Kira is an astronomy image processing toolkit implemented with Apache Spark.
☆15Feb 9, 2016Updated 10 years ago
devinbrady / transcribe-podcast
View on GitHub
Use the Google Cloud Speech API to transcribe audio files from a podcast.
☆20May 17, 2017Updated 9 years ago