afaqueahmad7117 / spark-experimentsView external linksLinks
Ultimate guide for mastering Spark Performance Tuning and Optimization concepts and for preparing for Data Engineering interviews
☆202Dec 31, 2025Updated last month
Alternatives and similar repositories for spark-experiments
Users that are interested in spark-experiments are comparing it to the libraries listed below
Sorting:
- In this project, we will build and ETL(Extract,Transform,Load) pipeline using the Spotify API on AWS. The pipeline will retrieve data fro…☆25May 6, 2023Updated 2 years ago
- Here lies all the pieces of portfolio projects and documents that I have been harvesting throughout the journey of learning Data Analysis…☆11Nov 22, 2023Updated 2 years ago
- Deployed an kafka instance in AWS EC2 Instance to streamline the data into Databricks☆10Aug 15, 2023Updated 2 years ago
- Contains spark dataframe solutions of leetcode questions☆25Dec 13, 2022Updated 3 years ago
- ☆16May 23, 2025Updated 8 months ago
- ELT Data Pipeline implementation in Data Warehousing environment☆30May 2, 2025Updated 9 months ago
- ☆15Jul 31, 2022Updated 3 years ago
- I have tried to solve some complex SQL interview questions that had been asked in several company. Collected this question from Ankit Ban…☆102May 15, 2022Updated 3 years ago
- A sample implementation of stream writes to an Iceberg table on GCS using Flink and reading it using Trino☆22May 30, 2022Updated 3 years ago
- ☆17Jun 23, 2024Updated last year
- PySpark Tutorials and Materials☆18Mar 1, 2021Updated 4 years ago
- A Python package to submit and manage Apache Spark applications on Kubernetes.☆46Aug 9, 2025Updated 6 months ago
- This repository contains my solutions to the top 50 LeetCode SQL challenges implemented using PySpark DataFrame and PySpark SQL.☆27Mar 16, 2024Updated last year
- ☆19Sep 5, 2021Updated 4 years ago
- Learn PySpark from Basics to Advanced. Checkout the YouTube Series : [PySpark - Zero to Hero]☆122Sep 7, 2025Updated 5 months ago
- ☆10May 3, 2021Updated 4 years ago
- Machine Learning Engineer interview preparation. Brushing up Data Structures & Algorithms, System Design and SQL☆24Jun 10, 2021Updated 4 years ago
- ☆60Jan 9, 2024Updated 2 years ago
- Repo which holds the materials for the EMR Zero To Hero☆27May 7, 2022Updated 3 years ago
- This project involves an ETL (Extract, Transform, Load) process to analyze sleep data exported from Apple Health☆29Apr 29, 2023Updated 2 years ago
- More than 2000+ Data engineer interview questions.☆1,506Jan 13, 2026Updated last month
- Git Repository☆153Jan 9, 2026Updated last month
- Implemented Faster R CNN on Custom Dataset☆22Dec 28, 2020Updated 5 years ago
- Apache Spark Interview Question and Answers☆21Oct 13, 2020Updated 5 years ago
- GitHub repository related to the course Mastering Elastic Map Reduce for Data Engineers☆24Jul 31, 2022Updated 3 years ago
- PySpark Projects☆27Feb 3, 2026Updated last week
- A data engineering project with Kafka, Spark Streaming, dbt, Docker, Airflow, Terraform, GCP and much more!☆837Apr 16, 2022Updated 3 years ago
- The official repository for the Rock the JVM Spark Optimization with Scala course☆58Dec 4, 2023Updated 2 years ago
- Question Answering task using Deep Learning on SQuAD dataset☆21Dec 8, 2022Updated 3 years ago
- Master Big Data With PySpark and AWS☆132Jun 27, 2023Updated 2 years ago
- Fundamentals of Spark with Python (using PySpark), code examples☆362Oct 29, 2022Updated 3 years ago
- Data Engineering with Google Cloud Platform, published by Packt☆120Sep 20, 2023Updated 2 years ago
- ☆29Jul 29, 2023Updated 2 years ago
- Basic TensorFlow mechanics, operations, class definitions, and neural networks building. Examples from deeplearning.ai Tensorflow course …☆35Apr 12, 2019Updated 6 years ago
- Pyspark RDD, DataFrame and Dataset Examples in Python language☆1,342Dec 7, 2025Updated 2 months ago
- Learn various Algorithms of Machine Learning like SVC, Decision Tree , Random Forest , Logistic Regression, Linear Regression and much Mo…☆11Jul 31, 2019Updated 6 years ago
- Data sets and ML models versioning example from DVC get started☆10Jun 4, 2024Updated last year
- Unit testing using databricks connect☆32Nov 3, 2021Updated 4 years ago
- ☆10Jun 21, 2021Updated 4 years ago