Just starting your DE journey or along the way already?. I will be sharing a short list of DATA-ENGINEERING-CENTRED books that covers the job roles of a data engineer with an overall view of the required skill set and importantly ,it brings coding in DE to you.
☆34Jul 4, 2022Updated 3 years ago
Alternatives and similar repositories for data-engineering-book-reviews
Users that are interested in data-engineering-book-reviews are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This project is for demonstrating knowledge of Data Engineering tools and concepts and also learning in the process☆44Dec 1, 2022Updated 3 years ago
- This repo contains all the material developed during the 9-week bootcamp provided by DPhi in colaboration with DataTalks Club☆22Aug 2, 2022Updated 3 years ago
- Observability Python library - Powered by Kensu☆22Oct 15, 2024Updated last year
- 📟 Logging utilities for spaCy☆12Nov 3, 2023Updated 2 years ago
- The goal of this project is to analyse the impact of Covid-19 on the Aviation industry through data engineering processes using technolog…☆13Jun 26, 2022Updated 3 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- ☆11Mar 11, 2022Updated 4 years ago
- Sadnbox of Spark-notebook☆10Mar 19, 2016Updated 10 years ago
- ☆10Nov 28, 2022Updated 3 years ago
- A benchmark to see how many flops your kit can do☆11May 27, 2023Updated 3 years ago
- ☆12Oct 31, 2023Updated 2 years ago
- This project collects the map assets (Shapefiles and GeoJSON) that were used for the "Manifest Destiny" Visualization (http://michaelpora…☆24Oct 25, 2012Updated 13 years ago
- Surfalytics projces on Data Engineering and Analytics☆122Apr 5, 2026Updated last month
- The repository contains all the work including projects, notes, and articles related to ML Engineering while I am learning.☆10Dec 4, 2022Updated 3 years ago
- Docker containers with Apache Accumulo and Apache Spark environment.☆12Jan 22, 2016Updated 10 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆10Oct 12, 2022Updated 3 years ago
- ☆11Jan 24, 2023Updated 3 years ago
- Spending One Hundred days on blogging about cloud computing☆13Jul 12, 2022Updated 3 years ago
- computer vision : global offensive☆12Jun 15, 2020Updated 5 years ago
- real time log event processing using spark, kafka & cassandra☆13Dec 4, 2014Updated 11 years ago
- Autotrader.co.uk data scraper☆14May 18, 2020Updated 6 years ago
- sentiment analysis using spacy☆11Nov 22, 2021Updated 4 years ago
- Script para ingestão de dados do Mercado Bitcoin☆11Jun 29, 2023Updated 2 years ago
- Want to get notified on the progress of your TensorFlow model training? Enter, a TensorFlow Keras callback to send notifications on the m…☆12Dec 28, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Essential PySpark for Scalable Data Analytics, published by Packt☆46Apr 22, 2026Updated last month
- Process landsat imagery on EMR, serve them out to a web application that does NDVI/NDWI on the fly☆13Dec 12, 2017Updated 8 years ago
- ☆13Dec 30, 2022Updated 3 years ago
- ☆15Apr 8, 2026Updated last month
- TrafficAdvisor: a Real-Time Traffic Monitoring System☆14Sep 10, 2018Updated 7 years ago
- Integrate the GA4GH schemas and probably a scala impl of the service.☆14May 20, 2016Updated 10 years ago
- We fake out your troubles.☆14Oct 14, 2017Updated 8 years ago
- A simple pipeline utilising cron, Postgres, AWS EC2, and Metabase☆13Jul 9, 2024Updated last year
- Capstone Project: Predicting default in P2P lending☆13Feb 27, 2017Updated 9 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- High performance Golang HTTP middleware for server-side application layer caching, ideal for REST APIs, using Echo framework.☆15Sep 27, 2023Updated 2 years ago
- Powerful, developer-experience centric, blazingly fast and extensible job scheduler and workflow orchestration platform☆62May 17, 2026Updated last week
- Data Engineering Project in GCP☆22Mar 29, 2023Updated 3 years ago
- Awesome cheatsheets for Data Science☆12Sep 16, 2019Updated 6 years ago
- Airflow Examples: code samples for Medium articles☆14Jan 10, 2021Updated 5 years ago
- Build & Learn Data Engineering,Machine Learning over Kubernetes. No Shortcut approach.☆57Jan 3, 2023Updated 3 years ago
- ☆15Mar 23, 2026Updated 2 months ago