NitinDatta8 / realtime-data-streamingLinks
End-to-end data engineering pipeline with various technologies to ingest real time data.
☆24Updated 2 years ago
Alternatives and similar repositories for realtime-data-streaming
Users that are interested in realtime-data-streaming are comparing it to the libraries listed below
Sorting:
- Data Engineering portfolio projects, resources used to study data tools...☆30Updated last year
- An end-to-end data engineering pipeline that orchestrates data ingestion, processing, and storage using Apache Airflow, Python, Apache Ka…☆312Updated 11 months ago
- This project provides a comprehensive data pipeline solution to extract, transform, and load (ETL) Reddit data into a Redshift data wareh…☆203Updated 2 years ago
- End-to-end Data Project (DA/DS/DE/MLOps) - retail/e-commerce - interpretable dynamic clustering☆15Updated 7 months ago
- End to end data engineering project with kafka, airflow, spark, postgres and docker.☆108Updated last month
- Cool DE Projects☆57Updated last month
- ☆12Updated last year
- Learn PySpark from Basics to Advanced. Checkout the YouTube Series : [PySpark - Zero to Hero]☆122Updated 5 months ago
- ☆21Updated last year
- Hands-on MLOps projects to explore and learn the practical aspects of machine learning engineering for production.☆96Updated 10 months ago
- ☆316Updated last year
- ☆107Updated 2 years ago
- This project serves as a comprehensive guide to building an end-to-end data engineering pipeline using TCP/IP Socket, Apache Spark, OpenA…☆44Updated 2 years ago
- Production ML rental prediction system.☆50Updated last year
- This repo demonstrates the development of a real-time data pipeline designed to ingest, process, and analyze stock market data. Using cut…☆48Updated last year
- This repository contains the code for a realtime election voting system. The system is built using Python, Kafka, Spark Streaming, Postgr…☆45Updated 2 years ago
- This repo contains all the code used in the Python for Data Engineering Course☆336Updated last year
- Nyc_Taxi_Data_Pipeline - DE Project☆136Updated last year
- ☆90Updated last year
- ☆106Updated last year
- Resources and projects from Udacity Data Engineering with AWS nano degree programme☆28Updated 2 years ago
- Welcome to my data engineering projects repository! Here you will find a collection of data engineering projects that I have worked on.☆24Updated 2 years ago
- Ultimate guide for mastering Spark Performance Tuning and Optimization concepts and for preparing for Data Engineering interviews☆202Updated last month
- ☆61Updated 2 weeks ago
- In this project, we setup and end to end data engineering using Apache Spark, Azure Databricks, Data Build Tool (DBT) using Azure as our …☆38Updated 2 years ago
- Enrolled in DataTalks Zoomcamp https://github.com/DataTalksClub/mlops-zoomcamp☆20Updated 3 years ago
- This project introduces PySpark, a powerful open-source framework for distributed data processing. We explore its architecture, component…☆37Updated last year
- ☆24Updated 2 years ago
- This is a repository to demonstrate my details, skills, projects and to keep track of my progression in Data Analytics and Data Science t…☆137Updated last year
- DataTalks.Club's Data Engineering Zoomcamp Project☆23Updated 3 years ago