β13Feb 18, 2022Updated 4 years ago
Alternatives and similar repositories for Data_Engineering_Essentials_Hands_on_SQL_Python_and_Spark
Users that are interested in Data_Engineering_Essentials_Hands_on_SQL_Python_and_Spark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Source code for 'Up and Running with DAX for Power BI' by Alison Boxβ12Jun 10, 2022Updated 3 years ago
- π£ Azure interview questions and answers to help you prepare for your next technical interview in 2026.β30Jan 4, 2026Updated 4 months ago
- The goal of this project is to analyse the impact of Covid-19 on the Aviation industry through data engineering processes using technologβ¦β13Jun 26, 2022Updated 3 years ago
- β13Jun 7, 2024Updated last year
- β18Aug 15, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- β12Jul 27, 2021Updated 4 years ago
- Apache Spark using SQLβ14Aug 18, 2021Updated 4 years ago
- An example CI/CD pipeline using GitHub Actions for doing continuous deployment of AWS Glue jobs built on PySpark and Jupyter Notebooks.β13Oct 15, 2020Updated 5 years ago
- OpenCV code to extract face and name from government issued ID cardsβ13Dec 27, 2015Updated 10 years ago
- β11Aug 15, 2025Updated 9 months ago
- Extract, transform, and load data for analytic processing using AWS Glueβ17May 2, 2021Updated 5 years ago
- Arabic OCR OCR system for Arabic language that converts images (multi-fonts) of typed text to machine-encoded text. The system currently β¦β10Oct 12, 2021Updated 4 years ago
- A tool for improving the output of generic Arabic OCR systems using an n-gram based post-correction approach.β10Sep 22, 2021Updated 4 years ago
- Social Media Analysis, scalable solution, flexible deployment that analyses social media contentsβ10Jul 20, 2023Updated 2 years ago
- End-to-end encrypted email - Proton Mail β’ AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- β18Nov 16, 2018Updated 7 years ago
- Building pipeline to process the real-time data using Spark and Mongodb.β12Oct 30, 2019Updated 6 years ago
- Public GitHub repo for SciPy 2022 tutorial (Introduction to Numerical Computing With NumPy)β13Aug 24, 2022Updated 3 years ago
- β14Mar 11, 2023Updated 3 years ago
- β17Apr 17, 2026Updated last month
- A code sample that allows you to send a payload from the Twitter API to Google Sheets.β18Mar 23, 2021Updated 5 years ago
- Functional Data Engineering tutorial in Python & Airflow.β17Mar 24, 2023Updated 3 years ago
- ETL processing toolset with SQL-like language and GIS capabilities, built on core Spark. Extensible and modular. REPL includedβ16May 12, 2026Updated last week
- β15May 7, 2025Updated last year
- Open source password manager - Proton Pass β’ AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Minimal packages to demonstrate usage of ros1_bridge with custom message typesβ16Oct 31, 2019Updated 6 years ago
- β17May 16, 2020Updated 6 years ago
- Project is in active development and has been moved to https://repository.datamart.ru/datamarts/prostore.β17Apr 22, 2022Updated 4 years ago
- Notebooks/materials on Big Data with PySpark skill track from datacamp (primarily). Also, contains books/cheat-sheets.β14Mar 4, 2022Updated 4 years ago
- WIP - Scaling Spark Data Platform with EKS. The solution uses Karpenter and Cluster Autoscaler, Yunikorn for advanced scheduling.β16May 9, 2023Updated 3 years ago
- overview into resources for analyzing the games, working with the data and showcasing applications of the broadcast tracking data.β19Jun 18, 2021Updated 4 years ago
- Vim homeβ24May 31, 2022Updated 3 years ago
- This repository is for demonstrating the capability to do SQL-based UPDATES, DELETES, and INSERTS directly in the Data Lake using Amazon β¦β18Aug 25, 2021Updated 4 years ago
- A simple OCR labeling tool built on Flask and PyQt5β24Dec 6, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ΠΠ°ΡΠ΅ΡΠΈΠ°Π»Ρ ΠΊΡΡΡΠ° Airflow 101β15Jun 15, 2020Updated 5 years ago
- β30Aug 17, 2024Updated last year
- β12Mar 15, 2025Updated last year
- A tutorial on building a real-time data streaming application pipeline with Apache Kafkaπ₯π₯π₯β24Apr 29, 2022Updated 4 years ago
- TF2 implementation of Text recognition CRNNβ13Sep 27, 2019Updated 6 years ago
- Vue School Free Weekend Courses (2023 version)β20Sep 28, 2024Updated last year
- π£ Apache Spark interview questions and answers to help you prepare for your next machine learning and data science interview in 2026.β35Jan 4, 2026Updated 4 months ago