chandu-muthyala / Data-Engineer-Nano-Degree
Data models, build data warehouses and data lakes, automate data pipelines, and worked with massive datasets.
☆13Updated 5 years ago
Alternatives and similar repositories for Data-Engineer-Nano-Degree:
Users that are interested in Data-Engineer-Nano-Degree are comparing it to the libraries listed below
- A repo to track data engineering projects☆13Updated 2 years ago
- Udacity Data Engineering Nanodegree Projects☆11Updated 5 years ago
- My solutions for the Udacity Data Engineering Nanodegree☆33Updated 5 years ago
- Here's how to get DataQuest's Data Engineering Track missions' content to work on your localhost. Using data from my Valenbisi ARIMA mode…☆15Updated 6 years ago
- Projects submitted as part of working through udacity's data engineering nanodegree.☆9Updated 5 years ago
- ☆26Updated 3 years ago
- Applying automated feature engineering to the Kaggle Home Credit Default Risk Competition☆19Updated 6 years ago
- Churn Prediction with PySpark using MLlib and ML Packages☆56Updated 9 years ago
- Machine Learning and Data Analysis Case Studies using Spark.☆72Updated 4 years ago
- ☆33Updated last year
- ☆18Updated 6 years ago
- PySpark Cookbook, published by Packt☆91Updated 2 years ago
- Learning from multiple companies in Silicon Valley. Netflix, Facebook, Google, Startups☆16Updated 6 years ago
- ☆7Updated 6 years ago
- A production-grade data pipeline has been designed to automate the parsing of user search patterns to analyze user engagement. Extract d…☆24Updated 3 years ago
- Data science blog☆33Updated 6 years ago
- Repository used for Spark Trainings☆53Updated 2 years ago
- Springboard - Data Science Intensive course☆13Updated 8 years ago
- ☆18Updated 3 years ago
- AWS Big Data Certification☆25Updated 3 months ago
- Source code for the MC technical blog post "Data Observability in Practice Using SQL"☆38Updated 9 months ago
- Developed an ETL pipeline for a Data Lake that extracts data from S3, processes the data using Spark, and loads the data back into S3 as …☆16Updated 5 years ago
- ☆46Updated 3 years ago
- Code for my blogs on Data Engineering☆15Updated 4 years ago
- Recency, Frequency, and Monetary are three behavioral attributes and are quite simple, in that they can be easily computed for any databa…☆15Updated last year
- Apache Spark using SQL☆14Updated 3 years ago
- ☆26Updated 5 years ago
- Pytest for Data Science Beginners☆58Updated 6 years ago
- E-Commerce Website A/B testing: Recommend which of two landing pages to keep based on A/B testing☆23Updated 7 years ago
- Batch Processing , orchestration using Apache Airflow and Google Workflows, spark structured Streaming and a lot more☆19Updated 2 years ago