I am using confluent Kafka cluster to produce and consume scraped data. In this project, I've created a real-time data pipeline that utilizes Kafka to scrape, process, and load data onto S3 in JSON format. With a producer-consumer architecture, I ensure that the data is in the right format for loading onto S3 by performing minor transformations
☆29May 2, 2023Updated 3 years ago
Alternatives and similar repositories for real-time_crypto_data_pipeline_using_kafka
Users that are interested in real-time_crypto_data_pipeline_using_kafka are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This project involves an ETL (Extract, Transform, Load) process to analyze sleep data exported from Apple Health☆29Apr 29, 2023Updated 3 years ago
- sql-for-data-engineering-course☆18May 12, 2023Updated 3 years ago
- In this project, we will build and ETL(Extract,Transform,Load) pipeline using the Spotify API on AWS. The pipeline will retrieve data fro…☆25May 6, 2023Updated 3 years ago
- ☆146Jan 31, 2023Updated 3 years ago
- Collection of my favorite Python packages from 2020☆11Jan 12, 2021Updated 5 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- A big data project to develop a real-time data pipeline for analyzing the popularity and sentiments of trending topics on Twitter.☆24Jun 21, 2022Updated 3 years ago
- Data pipeline that scrapes Rust cheater Steam profiles☆54Feb 13, 2022Updated 4 years ago
- 😈Complete End to End ETL Pipeline with Spark, Airflow, & AWS☆51Aug 23, 2019Updated 6 years ago
- Qs from other and my answer on them☆10Jan 27, 2024Updated 2 years ago
- YouTube tutorial project☆108Oct 17, 2023Updated 2 years ago
- A simple cli tool that deletes files matching an extension within a given directory structure.☆12Sep 27, 2023Updated 2 years ago
- A golang and graphql/restapi boilerplate build for fast and quick build.☆13Apr 28, 2024Updated 2 years ago
- Business challenge that requires building a data platform for retailer data analytics.☆18Feb 19, 2023Updated 3 years ago
- Capstone Project for the IBM Data Engineering Professional Certification.☆13Mar 7, 2022Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Monoscope's Golang client SDK.☆19Mar 1, 2026Updated 2 months ago
- Pyspark Spotify ETL☆17Aug 19, 2021Updated 4 years ago
- A highly scalable microservice to handle WhatsApp, SMS and email-based notifications.☆20Mar 29, 2021Updated 5 years ago
- Insight Data Engineering project: A platform built in HDFS, Spark and Airflow to help you to find social influencers from GitHub Net…☆16May 21, 2024Updated last year
- Data warehouse implementation for an e-commerce website “Infibeam” that sells digital and consumer electronics.☆23Jan 28, 2018Updated 8 years ago
- A data engineering project with Airflow, dbt, Terrafrom, GCP and much more!☆26Nov 8, 2022Updated 3 years ago
- An interactive platform that enables individuals to share their unique experiences across various stages of life and emotional journeys. …☆14Feb 29, 2024Updated 2 years ago
- Case Studies and Projects in Machine Learning/EDA/DL☆24Jun 18, 2024Updated last year
- Stream/batch system with Hadoop, Spark on NYC taxi data | #DE☆26Apr 10, 2026Updated last month
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆17Dec 14, 2021Updated 4 years ago
- R toolbox to explore the TRON blockchain☆10Jul 18, 2021Updated 4 years ago
- ☆55Apr 3, 2026Updated last month
- Data Engineering YouTube Analysis Project by Darshil Parmar☆241Dec 8, 2023Updated 2 years ago
- ☆171May 20, 2022Updated 3 years ago
- This project aims to build a traveling recommendation application using Google Places API and OpenAI LLM.☆11Mar 19, 2024Updated 2 years ago
- Deep Learning Projects on TensorFlow and Keras☆20Jun 13, 2024Updated last year
- This repo contains all iNeuron Full Stack Data Science Assignments☆12Jun 6, 2023Updated 2 years ago
- Demonstration of using Apache Spark to build robust ETL pipelines while taking advantage of open source, general purpose cluster computin…☆24Aug 11, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆18Oct 31, 2020Updated 5 years ago
- Project on belief embedding☆22Jun 4, 2025Updated 11 months ago
- ☆17Feb 9, 2023Updated 3 years ago
- Deployed on expo go☆23May 22, 2022Updated 3 years ago
- This project aims to predict smartphone prices using a combination of batch and stream processing techniques in a Big Data environment. T…☆26Apr 15, 2024Updated 2 years ago
- The repository for the CMU Data Pipeline course. This year's course should use branch 2017☆40May 2, 2017Updated 9 years ago
- 🖼️ | Quickly deploy a custom RunPod Endpoint API using your own model ckpt.☆32Jun 11, 2025Updated 11 months ago