cathydo178 / Data-Engineering-ProjectsLinks
Design data models, build data warehouses and data lakes, automate data pipelines, and work with massive datasets
☆7Updated 5 years ago
Alternatives and similar repositories for Data-Engineering-Projects
Users that are interested in Data-Engineering-Projects are comparing it to the libraries listed below
Sorting:
- A repo to track data engineering projects☆13Updated 2 years ago
- Apache Spark using SQL☆14Updated 3 years ago
- Data models, build data warehouses and data lakes, automate data pipelines, and worked with massive datasets.☆13Updated 5 years ago
- Machine Learning and Data Analysis Case Studies using Spark.☆72Updated 4 years ago
- My solutions for the Udacity Data Engineering Nanodegree☆34Updated 5 years ago
- Udacity Data Engineering Nanodegree Projects☆11Updated 5 years ago
- Data Quest - Data Engineer Learning and Projects☆24Updated 6 years ago
- Here's how to get DataQuest's Data Engineering Track missions' content to work on your localhost. Using data from my Valenbisi ARIMA mode…☆15Updated 6 years ago
- Learn to build a data pipeline with Airflow to automate wrangling data - An Udacity Data Engineer Nano Degree Project☆8Updated 5 years ago
- A complete daily plan for studying to become a machine learning engineer.☆51Updated 8 years ago
- A curated list of repositories for my book Machine Learning Solutions.☆78Updated 7 years ago
- Data engineering interviews Q&A for data community by data community☆63Updated 5 years ago
- ☆18Updated 7 years ago
- Repository for Spark using Python material. It is popularly known as PySpark.☆20Updated 3 years ago
- Big Data Modeling, MapReduce, Spark, PySpark @ Santa Clara University☆158Updated 6 months ago
- Apache Spark Interview Question and Answers☆21Updated 4 years ago
- Learning from multiple companies in Silicon Valley. Netflix, Facebook, Google, Startups☆16Updated 6 years ago
- Repository used for Spark Trainings☆53Updated 2 years ago
- This repository hosts the code/projects/demos/slides for Big Data technologies under Apache Hadoop and Apache Spark umbrella.☆42Updated 2 years ago
- RedditR for Content Engagement and Recommendation☆21Updated 7 years ago
- machine learning and deep learning tutorials, articles and other resources☆41Updated 8 years ago
- notebooks produced throughout the Udacity's Nanodegree Data Engineering Course☆73Updated 4 years ago
- Codes, notes and guides on Udacity's machine learning nanodegree.☆83Updated 8 years ago
- ETL pipeline using pyspark (Spark - Python)☆116Updated 5 years ago
- ☆31Updated 6 years ago
- Lecture notes, lab notes, and links to helpful resources to pass Google Certification Exam for Professional Data Engineer.☆18Updated 2 years ago
- Sharing interesting and noteworthy Data Engineering content☆68Updated 8 years ago
- Repository for the Zero to Deep Learning™ Video Course on Udemy☆32Updated 8 years ago
- Course project for Practical Machine Learning: https://www.coursera.org/course/predmachlearn☆13Updated 10 years ago
- Teaching notes from my Advanced SQL workshops as local lead instructor at General Assembly New York. The first edition was created for th…☆18Updated 5 years ago