This repository focuses on providing interview scenario questions that I have encountered during interviews. The questions are designed to simulate real-world scenarios and test your problem-solving and technical skills. By exploring these scenarios, you can gain insights into common interview topics and prepare yourself for similar challenges.
☆50Feb 11, 2025Updated last year
Alternatives and similar repositories for interview-scenerios-spark-sql
Users that are interested in interview-scenerios-spark-sql are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This is a repo with links to everything you'd ever want to learn about data engineering☆11Dec 3, 2024Updated last year
- ☆12Jan 14, 2023Updated 3 years ago
- Distributed Data Systems with Azure Databricks, published by Packt☆12Jan 18, 2023Updated 3 years ago
- My Leetcode Solutions☆13Sep 2, 2025Updated 7 months ago
- ☆17Feb 1, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Kafka-Notes☆15Jun 20, 2021Updated 4 years ago
- Provision AWS infrastructure using Terraform (By HashiCorp): an example of web application logging customer data☆12Dec 19, 2025Updated 3 months ago
- Collection of Databricks and Jupyter Notebooks☆22Feb 9, 2026Updated 2 months ago
- ☆11Jul 26, 2020Updated 5 years ago
- A code-based tutorial for production level data streaming with PySpark plus Optimus for data cleaning, Confluent Kafka, & Apache Drill u…☆28Jul 8, 2019Updated 6 years ago
- Analytics Vidhya Janata Hack Time Series #27th Solution☆12May 27, 2020Updated 5 years ago
- ☆28Aug 29, 2022Updated 3 years ago
- The dataset is of a Global Pharmacy Company. The dataset comprises of Historical sales, Product Information and products which need forec…☆28Aug 27, 2019Updated 6 years ago
- ❓❓ Does anybody know that Python is an object-oriented programming language? Learn all about OOP in Python with real-world examples. ✔☆33Aug 22, 2020Updated 5 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆38Aug 11, 2024Updated last year
- ☆31Feb 6, 2025Updated last year
- Practice your Pyspark skills!☆105Oct 22, 2021Updated 4 years ago
- ☆18Apr 25, 2021Updated 4 years ago
- ☆31May 15, 2024Updated last year
- A blockchain implementation in Python☆16Dec 8, 2022Updated 3 years ago
- More than 2000+ Data engineer interview questions.☆1,573Jan 13, 2026Updated 3 months ago
- Our work on Reinforcement learning that we share with the rest of the world☆13Jan 7, 2019Updated 7 years ago
- Basic yet complete Machine Learning pipeline for NLP tasks☆25Sep 20, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Advanced SQL - Discover sequential, step-by-step explanations and solutions, accompanied by the necessary database creation codes, availa…☆26Sep 13, 2023Updated 2 years ago
- Open MMLab Detection Toolbox with PyTorch☆12Jun 11, 2019Updated 6 years ago
- Complete high-quality practice tests of 50 questions each will help you master your Confluent Certified Developer for Apache Kafka (CCDAK…☆39Jun 30, 2020Updated 5 years ago
- Big Data Modeling, MapReduce, Spark, PySpark @ Santa Clara University☆166Dec 4, 2025Updated 4 months ago
- Data engineering interviews Q&A for data community by data community☆66Jun 7, 2020Updated 5 years ago
- Submissions for HackX Hackathon by Scaler Academy☆10Oct 1, 2023Updated 2 years ago
- Jumping into C++ Practice Problems☆10Aug 6, 2017Updated 8 years ago
- PySpark Cheatsheet☆36Jan 18, 2023Updated 3 years ago
- ETL pipeline using pyspark (Spark - Python)☆118Apr 4, 2020Updated 6 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Implementation of NeurIPS2021 paper <On Effective Scheduling of Model-based Reinforcement Learning>☆13Nov 16, 2021Updated 4 years ago
- This repo contains "Databricks Certified Data Engineer Associate" Questions and related docs.☆236Aug 11, 2024Updated last year
- papers about reinforcement learning☆13Jan 4, 2021Updated 5 years ago
- All my projects on Big Data are provided☆27Dec 5, 2016Updated 9 years ago
- ☆51Sep 6, 2024Updated last year
- ☆14Apr 28, 2019Updated 6 years ago
- ☆17Nov 22, 2022Updated 3 years ago