NitinDatta8 / realtime-data-streamingLinks
End-to-end data engineering pipeline with various technologies to ingest real time data.
☆19Updated 2 years ago
Alternatives and similar repositories for realtime-data-streaming
Users that are interested in realtime-data-streaming are comparing it to the libraries listed below
Sorting:
- Hands-on MLOps projects to explore and learn the practical aspects of machine learning engineering for production.☆91Updated 8 months ago
- End-to-end Data Project (DA/DS/DE/MLOps) - retail/e-commerce - interpretable dynamic clustering☆15Updated 5 months ago
- An end-to-end data engineering pipeline that orchestrates data ingestion, processing, and storage using Apache Airflow, Python, Apache Ka…☆295Updated 10 months ago
- ☆102Updated 10 months ago
- ☆1,426Updated 3 years ago
- Cool DE Projects☆46Updated last week
- Data Engineering portfolio projects, resources used to study data tools...☆28Updated last year
- Learn PySpark from Basics to Advanced. Checkout the YouTube Series : [PySpark - Zero to Hero]☆111Updated 3 months ago
- ☆299Updated last year
- End to end data engineering project with kafka, airflow, spark, postgres and docker.☆107Updated 8 months ago
- Production ML rental prediction system.☆49Updated last year
- Enrolled in DataTalks Zoomcamp https://github.com/DataTalksClub/mlops-zoomcamp☆20Updated 3 years ago
- ☆108Updated 2 years ago
- More than 2000+ Data engineer interview questions.☆1,467Updated 3 months ago
- An ETL pipeline that extracts weather and air quality data from public APIs, transforms the data into a clean, analyzable format, and loa…☆31Updated last year
- This repo demonstrates the development of a real-time data pipeline designed to ingest, process, and analyze stock market data. Using cut…☆47Updated last year
- This repo contains all the code used in the Python for Data Engineering Course☆323Updated last year
- Personal Data Engineering Projects☆967Updated 2 years ago
- Machine Learning In Production (MLOps)☆222Updated 2 weeks ago
- This repository is to show my Data Analytics & Engineering skills, share projects, and track my progress.☆60Updated 2 years ago
- ☆21Updated last year
- This project provides a comprehensive data pipeline solution to extract, transform, and load (ETL) Reddit data into a Redshift data wareh…☆168Updated 2 years ago
- This project introduces PySpark, a powerful open-source framework for distributed data processing. We explore its architecture, component…☆36Updated last year
- ☆365Updated last year
- The web page for DataTalks.Club, a global online community of data enthusiasts☆237Updated this week
- Roadmap for Data Engineering☆240Updated last year
- Stream processing pipeline from Finnhub websocket using Spark, Kafka, Kubernetes and more☆370Updated 2 years ago
- Code/Notes for the Data Engineering Zoomcamp by DataTalksClub☆32Updated 2 years ago
- ☆56Updated last year
- Open Source LeetCode for PySpark, Spark, Pandas and DBT/Snowflake☆245Updated 5 months ago