β13Feb 18, 2022Updated 4 years ago
Alternatives and similar repositories for Data_Engineering_Essentials_Hands_on_SQL_Python_and_Spark
Users that are interested in Data_Engineering_Essentials_Hands_on_SQL_Python_and_Spark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Source code for 'Up and Running with DAX for Power BI' by Alison Boxβ12Jun 10, 2022Updated 3 years ago
- π£ Azure interview questions and answers to help you prepare for your next technical interview in 2026.β29Jan 4, 2026Updated 3 months ago
- The goal of this project is to analyse the impact of Covid-19 on the Aviation industry through data engineering processes using technologβ¦β13Jun 26, 2022Updated 3 years ago
- An Airflow plugin, providing an admin UI to conveniently start backfills. Usable with Airflow 1, 2 and Cloud Composerβ14Aug 16, 2022Updated 3 years ago
- β18Aug 15, 2022Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean β’ AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Gathers Kubernetes cost information for a clusterβ13Dec 18, 2018Updated 7 years ago
- Apache Spark using SQLβ14Aug 18, 2021Updated 4 years ago
- β12Jul 27, 2021Updated 4 years ago
- An example CI/CD pipeline using GitHub Actions for doing continuous deployment of AWS Glue jobs built on PySpark and Jupyter Notebooks.β13Oct 15, 2020Updated 5 years ago
- A Flask web app that integrates Tesseract OCR to extract text from image files.β10May 14, 2023Updated 2 years ago
- OpenCV code to extract face and name from government issued ID cardsβ13Dec 27, 2015Updated 10 years ago
- β11Aug 15, 2025Updated 7 months ago
- Extract, transform, and load data for analytic processing using AWS Glueβ17May 2, 2021Updated 4 years ago
- A tool to create Airflow RBAC roles with dag-level permissions from cli.β13Sep 7, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off β’ AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ACK service controller for Amazon Elastic Container Registry (ECR)β17Mar 11, 2026Updated last month
- Extract text data from documents using OCR (optical character recognition) technology and NER (named entity recognition).β10May 11, 2023Updated 2 years ago
- β18Nov 16, 2018Updated 7 years ago
- Turning raw kickstarter text data => Campaign predictions using SpaCy, Scikit-learn, SQLAlchemy, SQLite3 & XGBoost Classifier (feat eng =β¦β16Feb 26, 2021Updated 5 years ago
- Public GitHub repo for SciPy 2022 tutorial (Introduction to Numerical Computing With NumPy)β14Aug 24, 2022Updated 3 years ago
- β14Mar 11, 2023Updated 3 years ago
- β17Updated this week
- Backstage plugins collections for dotnetβ15Mar 26, 2023Updated 3 years ago
- Module for pipelines concept in PySparkβ16Mar 27, 2024Updated 2 years ago
- DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Tag highlight in neovim written in luaβ22Mar 18, 2026Updated 3 weeks ago
- Functional Data Engineering tutorial in Python & Airflow.β17Mar 24, 2023Updated 3 years ago
- ETL processing toolset with SQL-like language and GIS capabilities, built on core Spark. Extensible and modular. REPL includedβ16Jan 26, 2026Updated 2 months ago
- β17May 16, 2020Updated 5 years ago
- Zsh goodies for MacOS usersβ24Oct 7, 2024Updated last year
- Project is in active development and has been moved to https://repository.datamart.ru/datamarts/prostore.β17Apr 22, 2022Updated 3 years ago
- β21May 13, 2025Updated 10 months ago
- Notebooks/materials on Big Data with PySpark skill track from datacamp (primarily). Also, contains books/cheat-sheets.β14Mar 4, 2022Updated 4 years ago
- overview into resources for analyzing the games, working with the data and showcasing applications of the broadcast tracking data.β19Jun 18, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Vim homeβ24May 31, 2022Updated 3 years ago
- [Late Submission] Solution for Kuzushiji recognition (Kaggle competition)β18Jun 9, 2021Updated 4 years ago
- β29Jan 22, 2021Updated 5 years ago
- ΠΠ°ΡΠ΅ΡΠΈΠ°Π»Ρ ΠΊΡΡΡΠ° Airflow 101β15Jun 15, 2020Updated 5 years ago
- Prometheus and Grafana for Infra Monitoring and Visualisationβ31Sep 18, 2022Updated 3 years ago
- β12Mar 15, 2025Updated last year
- A tutorial on building a real-time data streaming application pipeline with Apache Kafkaπ₯π₯π₯β24Apr 29, 2022Updated 3 years ago