SourabhSinghRana/real-time_crypto_data_pipeline_using_kafka

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/SourabhSinghRana/real-time_crypto_data_pipeline_using_kafka)

SourabhSinghRana / real-time_crypto_data_pipeline_using_kafka

I am using confluent Kafka cluster to produce and consume scraped data. In this project, I've created a real-time data pipeline that utilizes Kafka to scrape, process, and load data onto S3 in JSON format. With a producer-consumer architecture, I ensure that the data is in the right format for loading onto S3 by performing minor transformations

☆29

Alternatives and similar repositories for real-time_crypto_data_pipeline_using_kafka

Users that are interested in real-time_crypto_data_pipeline_using_kafka are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

vinniepsychosis / ETL-Apple-Health
View on GitHub
This project involves an ETL (Extract, Transform, Load) process to analyze sleep data exported from Apple Health
☆29Apr 29, 2023Updated 3 years ago
darshilparmar / sql-for-data-engineering-course
View on GitHub
sql-for-data-engineering-course
☆18May 12, 2023Updated 3 years ago
priye-1 / Real_time_End_to_End_Pipeline_using_Kafka
View on GitHub
☆19May 27, 2023Updated 3 years ago
ArpiteshSrivastava / spotify-data-engineering-project
View on GitHub
In this project, we will build and ETL(Extract,Transform,Load) pipeline using the Spotify API on AWS. The pipeline will retrieve data fro…
☆25May 6, 2023Updated 3 years ago
limadelrey / kafka-connect-cdc-medium
View on GitHub
Kafka Connect: How to create a real time data pipeline using Change Data Capture (CDC)
☆13Jan 24, 2021Updated 5 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
darshilparmar / twitter-airflow-data-engineering-project
View on GitHub
YouTube tutorial project
☆109Oct 17, 2023Updated 2 years ago
acheamponge / VERSUZ
View on GitHub
A Hiphop v. Literature project to demonstrate using NLP that Hip-Hop is a form of literature and rap artists are literary geniuses.
☆13Nov 13, 2020Updated 5 years ago
liamhartley / spotify_analysis
View on GitHub
Analyse Spotify playlists, albums and artists.
☆35Nov 15, 2022Updated 3 years ago
Rohanvp07 / Covid-19-Analysis-and-Prediction
View on GitHub
☆16Feb 20, 2026Updated 5 months ago
bxffour / delly
View on GitHub
A simple cli tool that deletes files matching an extension within a given directory structure.
☆12Sep 27, 2023Updated 2 years ago
priye-1 / airflow_data_pipeline
View on GitHub
☆16May 29, 2023Updated 3 years ago
c85 / ibm-de-capstone
View on GitHub
Capstone Project for the IBM Data Engineering Professional Certification.
☆13Mar 7, 2022Updated 4 years ago
amar-jay / go-graphql-boilerplate
View on GitHub
A golang and graphql/restapi boilerplate build for fast and quick build.
☆13Apr 28, 2024Updated 2 years ago
aravindr18 / RedditR--Insight-Data-Engineering-Project
View on GitHub
RedditR for Content Engagement and Recommendation
☆18Dec 21, 2017Updated 8 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
monoscope-tech / monoscope-go
View on GitHub
Monoscope's Golang client SDK.
☆19Mar 1, 2026Updated 4 months ago
ahmadluay9 / travel-planner-app
View on GitHub
This project aims to build a traveling recommendation application using Google Places API and OpenAI LLM.
☆11Mar 19, 2024Updated 2 years ago
mihirkudale / iNeuron-Data-Science-Assignments
View on GitHub
This repo contains all iNeuron Full Stack Data Science Assignments
☆12Jun 6, 2023Updated 3 years ago
RbkGh / MoneyTransferRESTApi
View on GitHub
A dead simple Java REST API(without Spring) to transfer money between accounts
☆15Aug 29, 2019Updated 6 years ago
yennanliu / NYC_Taxi_Pipeline
View on GitHub
Stream/batch system with Hadoop, Spark on NYC taxi data | #DE
☆26Apr 10, 2026Updated 3 months ago
Fissayo / IBM-Data-Analyst-Capstone-project
View on GitHub
This is my capstone project from the IBM Data Analyst course. In each analytics process, the data is stored in the Jupyter notebooks that…
☆10May 25, 2023Updated 3 years ago
dilkhush-raj / e-tutor
View on GitHub
E-Learning Platform using MERN stack
☆18Jun 18, 2022Updated 4 years ago
Dmitry543 / Fine-Tuning-OCR-Model-with-PaddleOCR
View on GitHub
Comprehensive guide and codebase for fine-tuning the OCR model using PaddleOCR
☆23Jan 24, 2025Updated last year
darshilparmar / dataengineering-youtube-analysis-project
View on GitHub
Data Engineering YouTube Analysis Project by Darshil Parmar
☆247Dec 8, 2023Updated 2 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
next-game-solutions / tronr
View on GitHub
R toolbox to explore the TRON blockchain
☆10Jul 18, 2021Updated 5 years ago
andrem8 / surf_dash
View on GitHub
☆176May 20, 2022Updated 4 years ago
entbappy / Hate-Speech-Classification
View on GitHub
☆17Feb 9, 2023Updated 3 years ago
rafiqhasan / AI_DL_ML_Repo
View on GitHub
Deep Learning Projects on TensorFlow and Keras
☆20Jun 13, 2024Updated 2 years ago
Apress / data-science-fund-for-python-and-mongodb
View on GitHub
Source code for ' Data Science Fundamentals for Python and MongoDB' by David Paper
☆32May 15, 2018Updated 8 years ago
mansik95 / IMDB-Analysis
View on GitHub
This repository contains analysis of IMDB data from multiple sources and analysis of movies/cast/box office revenues, movie brands and fr…
☆31Jun 1, 2020Updated 6 years ago
susanli2016 / Mathematics-for-Machine-Learning-Specialization
View on GitHub
☆18Oct 31, 2020Updated 5 years ago
darshilparmar / uber-etl-pipeline-data-engineering-project
View on GitHub
☆344Aug 13, 2024Updated last year
samueltc / ABBYY
View on GitHub
Simple Python wrapper for ABBYY Cloud OCR
☆17Feb 5, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
darshilparmar / stock-market-kafka-data-engineering-project
View on GitHub
☆215Aug 13, 2023Updated 2 years ago
Tanishka-dev / Uber-Clone-React-Native
View on GitHub
Deployed on expo go
☆23May 22, 2022Updated 4 years ago
nama1arpit / reddit-streaming-pipeline
View on GitHub
A real-time reddit data streaming pipeline for sentiment analysis of various subreddits
☆148Aug 23, 2023Updated 2 years ago
darshilparmar / Udacity-Data-Engineer-nanodegree
View on GitHub
Classwork projects and home works done through Udacity data engineering nano degree
☆10Jun 6, 2021Updated 5 years ago
FareedKhan-dev / AI-outlier-detection
View on GitHub
Outlier Detection with AI + ML
☆15Sep 12, 2025Updated 10 months ago
pran4ajith / spark-twitter-streaming
View on GitHub
A real-time streaming ETL pipeline for streaming and performing sentiment analysis on Twitter data using Apache Kafka, Apache Spark and D…
☆29Aug 8, 2020Updated 5 years ago
AjinkyaGhadge / Google-Cloud-based-ALPR
View on GitHub
A simple to use python script for Automatic License Plate Recognition using Google Cloud Vision API.
☆15Apr 29, 2018Updated 8 years ago