I am using confluent Kafka cluster to produce and consume scraped data. In this project, I've created a real-time data pipeline that utilizes Kafka to scrape, process, and load data onto S3 in JSON format. With a producer-consumer architecture, I ensure that the data is in the right format for loading onto S3 by performing minor transformations
☆29May 2, 2023Updated 3 years ago
Alternatives and similar repositories for real-time_crypto_data_pipeline_using_kafka
Users that are interested in real-time_crypto_data_pipeline_using_kafka are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This project involves an ETL (Extract, Transform, Load) process to analyze sleep data exported from Apple Health☆29Apr 29, 2023Updated 3 years ago
- sql-for-data-engineering-course☆18May 12, 2023Updated 3 years ago
- ☆19May 27, 2023Updated 3 years ago
- In this project, we will build and ETL(Extract,Transform,Load) pipeline using the Spotify API on AWS. The pipeline will retrieve data fro…☆25May 6, 2023Updated 3 years ago
- ☆146Jan 31, 2023Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Apartments Data Pipeline using Airflow and Spark.☆24Mar 28, 2022Updated 4 years ago
- Collection of my favorite Python packages from 2020☆11Jan 12, 2021Updated 5 years ago
- Data pipeline that scrapes Rust cheater Steam profiles☆54Feb 13, 2022Updated 4 years ago
- 😈Complete End to End ETL Pipeline with Spark, Airflow, & AWS☆51Aug 23, 2019Updated 6 years ago
- YouTube tutorial project☆108Oct 17, 2023Updated 2 years ago
- Pipeline that extracts data from the Spotify API to build a more detailed version of Spotify Wrapped☆49Mar 13, 2026Updated 2 months ago
- A simple cli tool that deletes files matching an extension within a given directory structure.☆12Sep 27, 2023Updated 2 years ago
- Business challenge that requires building a data platform for retailer data analytics.☆18Feb 19, 2023Updated 3 years ago
- A Hiphop v. Literature project to demonstrate using NLP that Hip-Hop is a form of literature and rap artists are literary geniuses.☆13Nov 13, 2020Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- RedditR for Content Engagement and Recommendation☆18Dec 21, 2017Updated 8 years ago
- Capstone Project for the IBM Data Engineering Professional Certification.☆13Mar 7, 2022Updated 4 years ago
- This project focuses on building a robust data pipeline using Apache Airflow to automate the ingestion of weather data from the OpenWeath…☆22Feb 3, 2026Updated 4 months ago
- Pyspark Spotify ETL☆17Aug 19, 2021Updated 4 years ago
- A highly scalable microservice to handle WhatsApp, SMS and email-based notifications.☆20Mar 29, 2021Updated 5 years ago
- Insight Data Engineering project: A platform built in HDFS, Spark and Airflow to help you to find social influencers from GitHub Net…☆16May 21, 2024Updated 2 years ago
- Data warehouse implementation for an e-commerce website “Infibeam” that sells digital and consumer electronics.☆23Jan 28, 2018Updated 8 years ago
- A dead simple Java REST API(without Spring) to transfer money between accounts☆15Aug 29, 2019Updated 6 years ago
- A data engineering project with Airflow, dbt, Terrafrom, GCP and much more!☆26Nov 8, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- An interactive platform that enables individuals to share their unique experiences across various stages of life and emotional journeys. …☆14Feb 29, 2024Updated 2 years ago
- Winners solutions for [WNS Analytics Wizard 2018](https://datahack.analyticsvidhya.com/contest/wns-analytics-hackathon-2018/)☆25Dec 13, 2018Updated 7 years ago
- Data Engineering YouTube Analysis Project by Darshil Parmar☆242Dec 8, 2023Updated 2 years ago
- ☆173May 20, 2022Updated 4 years ago
- This project aims to build a traveling recommendation application using Google Places API and OpenAI LLM.☆11Mar 19, 2024Updated 2 years ago
- Deep Learning Projects on TensorFlow and Keras☆20Jun 13, 2024Updated last year
- Develop ML models predict taxi trip duration in NYC. Ranked : Top 6% | RMSLE : 0.377 (Kaggle) | #DS☆17Jan 7, 2023Updated 3 years ago
- Project on belief embedding☆23Jun 4, 2025Updated last year
- ☆17Feb 9, 2023Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A real-time reddit data streaming pipeline for sentiment analysis of various subreddits☆148Aug 23, 2023Updated 2 years ago
- ☆215Aug 13, 2023Updated 2 years ago
- ☆343Aug 13, 2024Updated last year
- In this web scraping project, my goal is to extract real-time stock market data from the renowned Yahoo Finance website. By leveraging we…☆13Jun 12, 2023Updated 2 years ago
- E-Learning Platform using MERN stack☆18Jun 18, 2022Updated 3 years ago
- *****PROJECT SPECIFICATION: Machine Learning Capstone Analysis Project***** This capstone project involves machine learning modeling and…☆15Mar 28, 2018Updated 8 years ago
- ☆16Feb 20, 2026Updated 3 months ago