hemant-rout / BigDataLinks
☆39Updated 11 months ago
Alternatives and similar repositories for BigData
Users that are interested in BigData are comparing it to the libraries listed below
Sorting:
- An end-to-end data engineering pipeline that orchestrates data ingestion, processing, and storage using Apache Airflow, Python, Apache Ka…☆310Updated 11 months ago
- I'm partaking in a Data Engineering Bootcamp / Zoomcamp. I'll store files and progress here.☆111Updated 3 years ago
- Learn PySpark from Basics to Advanced. Checkout the YouTube Series : [PySpark - Zero to Hero]☆120Updated 4 months ago
- Apache Spark 3 - Spark Programming in Python for Beginners☆514Updated last year
- PySpark Tutorial for Beginners - Practical Examples in Jupyter Notebook with Spark version 3.4.1. The tutorial covers various topics like…☆141Updated 2 years ago
- Ultimate guide for mastering Spark Performance Tuning and Optimization concepts and for preparing for Data Engineering interviews☆200Updated last month
- ☆41Updated 2 years ago
- ☆525Updated 4 years ago
- ☆369Updated 2 years ago
- Data Engineering with AWS, 2nd edition - Published by Packt☆168Updated 2 years ago
- Get data from API, run a scheduled script with Airflow, send data to Kafka and consume with Spark, then write to Cassandra☆144Updated 2 years ago
- The resources of the preparation course for Databricks Data Engineer Associate certification exam☆582Updated last month
- YouTube tutorial project☆107Updated 2 years ago
- This repo contains "Databricks Certified Data Engineer Associate" Questions and related docs.☆220Updated last year
- ☆59Updated 2 years ago
- ☆30Updated 2 years ago
- ☆172Updated last year
- ☆32Updated 3 years ago
- Data Engineering with Databricks Cookbook, published by Packt☆127Updated last year
- Sample repo for startdataengineering DE 101 free course☆74Updated last year
- Django-based course management platform for Zoomcamps☆78Updated this week
- PySpark Cheat Sheet - example code to help you learn PySpark and develop apps faster☆488Updated last year
- The code from the Machine Learning Bookcamp book☆515Updated 5 months ago
- Learn by doing: DIY project groups at DataTalks.Club☆415Updated last year
- Data Engineering with Python, published by Packt☆780Updated 3 years ago
- own way of studying data science, machine learning and AI (Python)☆111Updated 2 years ago
- All important Python tools a Data Engineer needs☆27Updated last year
- This repository will contain all of the resources for the Mage component of the Data Engineering Zoomcamp: https://github.com/DataTalksCl…☆102Updated last year
- Projects done in the Data Engineer Nanodegree Program by Udacity.com☆164Updated 3 years ago
- The web page for DataTalks.Club, a global online community of data enthusiasts☆256Updated this week