Data models, build data warehouses and data lakes, automate data pipelines, and worked with massive datasets.
☆13Jul 16, 2019Updated 6 years ago
Alternatives and similar repositories for Data-Engineer-Nano-Degree
Users that are interested in Data-Engineer-Nano-Degree are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Data Quest - Data Engineer Learning and Projects☆24May 29, 2019Updated 6 years ago
- A production-grade data pipeline has been designed to automate the parsing of user search patterns to analyze user engagement. Extract d…☆24Nov 22, 2021Updated 4 years ago
- ( These solutions tested on 4 node Hortonwork cluster on my laptop. Do not test on your production environment until you test... :)☆20Apr 18, 2020Updated 5 years ago
- Pyspark Spotify ETL☆17Aug 19, 2021Updated 4 years ago
- This project deals with vulnerability analysis and classification using machine learning techniques i.e. Natural Language Processing.☆10Feb 21, 2019Updated 7 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆10Dec 5, 2017Updated 8 years ago
- A repo to track data engineering projects☆13Nov 11, 2022Updated 3 years ago
- This repo has some proposed agenda for Azure Machine Learning related hands-on workshops.☆11Feb 2, 2021Updated 5 years ago
- assignments from courses at udacity☆10Mar 26, 2018Updated 8 years ago
- Capstone project for Galvanize - Data Science Immersive. 'Project Plotline' looks at the emotional content of movie scripts (web scraping…☆16Sep 27, 2016Updated 9 years ago
- Classification problem to predict loan defaulters using Lending Club Dataset☆11Jan 26, 2019Updated 7 years ago
- Developed an ETL pipeline for a Data Lake that extracts data from S3, processes the data using Spark, and loads the data back into S3 as …☆17Oct 1, 2019Updated 6 years ago
- A simple php toolbox to interact with the Microsoft Azure Search Service REST API.☆11Feb 2, 2023Updated 3 years ago
- Final Project for Data Engineering Zoomcamp Course 2024 🧙🔥☆11Apr 17, 2024Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- ☆10May 24, 2021Updated 4 years ago
- A curated list of awesome Databricks resources, including Spark☆22Jun 28, 2024Updated last year
- Udacity Data Engineering Nanodegree Projects☆11Sep 5, 2019Updated 6 years ago
- A from scratch Python implementation of Apache Kafka concepts including producers, brokers, topics, consumers, and offset management, bui…☆23Jul 29, 2025Updated 8 months ago
- Data Engineering Project at Insight☆15Nov 17, 2015Updated 10 years ago
- Repository that explains how the AutoML pipeline can be used for the Richter's Predictor competition on DrivenData☆10May 2, 2019Updated 6 years ago
- Projects done in the Data Engineering Nanodegree by Udacity.com☆272Mar 1, 2026Updated 3 weeks ago
- This repo is for the Linkedin Learning course: Predictive Customer Analytics☆11May 5, 2025Updated 10 months ago
- Load testing for event analytics platforms (Snowplow, more coming soon)☆13May 17, 2016Updated 9 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆21Jan 22, 2020Updated 6 years ago
- This repository aims to onboard new users into Modeling in SAP Data Warehouse Cloud in the most practical manner. For that you will build…☆17Feb 2, 2024Updated 2 years ago
- 鲁伟《机器学习公式推导与代码实现》。整体对算法的分类是亮点。算法原理和代码实现也相对简单,可以和《机器学习实战》对比起来看。☆11Oct 19, 2022Updated 3 years ago
- AlvinToh Learning Repository for The Ultimate Hands-On Hadoop - Tame your Big Data!☆10May 23, 2018Updated 7 years ago
- Life-cycle: Internal working of HDFS, SQOOP, HIVE, SPARK, HBASE, KAFKA with code.☆15Sep 10, 2019Updated 6 years ago
- files created in ardan labs golang training☆12Nov 8, 2023Updated 2 years ago
- Deep Learning Udacity Nanodegree - SageMaker Deployment of a Sentiment Analysis model☆10Apr 14, 2019Updated 6 years ago
- code, labs and lectures for the course☆48Apr 16, 2023Updated 2 years ago
- Simple ETL pipeline using Python☆29May 22, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆14Jan 22, 2019Updated 7 years ago
- An example project that implements a data pipeline using Scala, Akka, and Spark and works with document-oriented and graph databases to l…☆11Aug 9, 2019Updated 6 years ago
- Women With HRT Bookbuilder Workshop☆16May 20, 2021Updated 4 years ago
- My solutions for the Udacity Data Engineering Nanodegree☆34Oct 14, 2019Updated 6 years ago
- ☆11May 4, 2022Updated 3 years ago
- You can automate the process of building, testing, delivering, or deploying your Machine Learning models into production using GitHub Act…☆12Jun 13, 2020Updated 5 years ago
- Data Analysis with Python - Customer Segmentation ( RFM Analysis) - Power BI Dashboard - Tableau Dashboard☆12Feb 16, 2021Updated 5 years ago