hemant-rout / BigData
☆33Updated 2 months ago
Alternatives and similar repositories for BigData
Users that are interested in BigData are comparing it to the libraries listed below
Sorting:
- Ultimate guide for mastering Spark Performance Tuning and Optimization concepts and for preparing for Data Engineering interviews☆129Updated 11 months ago
- Step by step instructions to create a production-ready data pipeline☆50Updated 4 months ago
- ☆28Updated last year
- Learn PySpark from Basics to Advanced. Checkout the YouTube Series : [PySpark - Zero to Hero]☆57Updated 4 months ago
- PySpark Tutorial for Beginners - Practical Examples in Jupyter Notebook with Spark version 3.4.1. The tutorial covers various topics like…☆112Updated last year
- I will attempt to create my own spotify wrapped by collecting data from the spotify API, perform transformations and create informative d…☆74Updated 2 years ago
- Contains spark dataframe solutions of leetcode questions☆25Updated 2 years ago
- YouTube tutorial project☆102Updated last year
- This is an all-in-one repository for Data Engineers, ideal for beginners & interview preparation, which includes Python as the main Progr…☆29Updated 2 years ago
- Code for blog at https://www.startdataengineering.com/post/python-for-de/☆76Updated 11 months ago
- ☆41Updated 2 years ago
- Get data from API, run a scheduled script with Airflow, send data to Kafka and consume with Spark, then write to Cassandra☆139Updated last year
- ☆139Updated 2 years ago
- This repo contains "Databricks Certified Data Engineer Associate" Questions and related docs.☆144Updated 9 months ago
- Welcome to my data engineering projects repository! Here you will find a collection of data engineering projects that I have worked on.☆17Updated 2 years ago
- This project provides a comprehensive data pipeline solution to extract, transform, and load (ETL) Reddit data into a Redshift data wareh…☆137Updated last year
- Django-based course management platform for Zoomcamps☆67Updated this week
- Data Engineering YouTube Analysis Project by Darshil Parmar☆195Updated last year
- Recohut - Learn data engineering, data science☆97Updated last year
- Code for "Advanced data transformations in SQL" free live workshop☆81Updated last week
- Data Engineering with AWS, 2nd edition - Published by Packt☆144Updated last year
- Classwork projects and home works done through Udacity data engineering nano degree☆74Updated last year
- Repository related to Spark SQL and Pyspark using Python3☆37Updated 2 years ago
- ☆51Updated last year
- Cracking Data Engineering Interview Guide, published by Packt☆41Updated last year
- ☆195Updated last year
- ☆10Updated 2 weeks ago
- Repository for Spark using Python material. It is popularly known as PySpark.☆20Updated 3 years ago
- Sample repo for startdataengineering DE 101 free course☆62Updated 10 months ago
- I'm partaking in a Data Engineering Bootcamp / Zoomcamp. I'll store files and progress here.☆105Updated 2 years ago