needmukesh / Hadoop-Books
☆37Updated 6 years ago
Alternatives and similar repositories for Hadoop-Books:
Users that are interested in Hadoop-Books are comparing it to the libraries listed below
- My documents for self-learning fundamental of Data engineering skills☆12Updated last year
- All of my individual learning materials, documents, and notes from the process of getting the Coursera IBM Data Engineer Professional Cer…☆86Updated 2 years ago
- PySpark Tutorial for Beginners - Practical Examples in Jupyter Notebook with Spark version 3.4.1. The tutorial covers various topics like…☆103Updated last year
- own way of studying data science, machine learning and AI (Python)☆91Updated last year
- An end-to-end data engineering pipeline that orchestrates data ingestion, processing, and storage using Apache Airflow, Python, Apache Ka…☆237Updated last month
- Stream processing pipeline from Finnhub websocket using Spark, Kafka, Kubernetes and more☆338Updated last year
- ☆346Updated last year
- End to end data engineering project with kafka, airflow, spark, postgres and docker.☆86Updated last week
- ☆12Updated 3 years ago
- ☆151Updated 2 years ago
- Building ETL Pipelines with Python☆129Updated 8 months ago
- Nyc_Taxi_Data_Pipeline - DE Project☆103Updated 5 months ago
- Roadmap for Data Engineering☆226Updated 9 months ago
- Data Engineering with AWS, 2nd edition - Published by Packt☆136Updated last year
- Sample code and documentation for very basic things that I can't remember but want to aggregate in one place☆14Updated 3 years ago
- Data Engineering with Google Cloud Platform, published by Packt☆114Updated last year
- This repository will contain all of the resources for the Mage component of the Data Engineering Zoomcamp: https://github.com/DataTalksCl…☆98Updated 7 months ago
- Tiểu Luận Chuyên Ngành☆11Updated 8 months ago
- Data pipeline performing ETL to AWS Redshift using Spark, orchestrated with Apache Airflow☆142Updated 4 years ago
- End-to-end Data Project (DA/DS/DE/MLOps) - retail/e-commerce - interpretable dynamic clustering☆14Updated 3 months ago
- I'm partaking in a Data Engineering Bootcamp / Zoomcamp. I'll store files and progress here.☆103Updated 2 years ago
- Code for blog at https://www.startdataengineering.com/post/python-for-de/☆72Updated 9 months ago
- Code for "Efficient Data Processing in Spark" Course☆290Updated 6 months ago
- Master's thesis on Big Data☆34Updated 2 years ago
- Projects done in the Data Engineer Nanodegree Program by Udacity.com☆149Updated 2 years ago
- Simple stream processing pipeline☆99Updated 9 months ago
- ☆32Updated 3 years ago
- Interview Materials, Books and more.....☆15Updated last year
- Get data from API, run a scheduled script with Airflow, send data to Kafka and consume with Spark, then write to Cassandra☆136Updated last year
- An end-to-end data engineering pipeline that fetches data from Wikipedia, cleans and transforms it with Apache Airflow and saves it on Az…☆23Updated last year