nmolivo / dataquest_eng
Here's how to get DataQuest's Data Engineering Track missions' content to work on your localhost. Using data from my Valenbisi ARIMA modeling project, I document my steps using PostgreSQL, Postico, and the Command Line to get our DataQuest exercises running out of a Jupyter Notebook.
☆15Updated 6 years ago
Related projects ⓘ
Alternatives and complementary repositories for dataquest_eng
- Data Quest - Data Engineer Learning and Projects☆24Updated 5 years ago
- My solutions for the Udacity Data Engineering Nanodegree☆33Updated 5 years ago
- Udacity Data Engineering Nanodegree Projects☆11Updated 5 years ago
- Data models, build data warehouses and data lakes, automate data pipelines, and worked with massive datasets.☆13Updated 5 years ago
- ☆19Updated 6 years ago
- Jordan Cheah's Data Science & Data Engineering Portfolio☆27Updated 8 years ago
- Course on Udemy by Jose Portilla☆97Updated 6 years ago
- AWS Big Data Certification☆25Updated last year
- notebooks produced throughout the Udacity's Nanodegree Data Engineering Course☆72Updated 4 years ago
- PySpark Code for Hands-on Learners☆114Updated 5 years ago
- This repository hosts the code/projects/demos/slides for Big Data technologies under Apache Hadoop and Apache Spark umbrella.☆42Updated 2 years ago
- Python Notes on IPython Notebook files.☆37Updated 3 years ago
- A repo to track data engineering projects☆13Updated 2 years ago
- Lab for Linear and Logistic Regression, SciKit Learn☆41Updated 6 years ago
- Repository used for Spark Trainings☆53Updated last year
- A complete daily plan for studying to become a machine learning engineer.☆50Updated 8 years ago
- Educational notes,Hands on problems w/ solutions for hadoop ecosystem☆86Updated 5 years ago
- ( These solutions tested on 4 node Hortonwork cluster on my laptop. Do not test on your production environment until you test... :)☆20Updated 4 years ago
- In the Data Science and Engineering program, engineering professionals combine the skills of software programmer, database manager, and s…☆27Updated 7 years ago
- Frank Kane's Taming Big Data with Apache Spark and Python, published by Packt☆118Updated last year
- A production-grade data pipeline has been designed to automate the parsing of user search patterns to analyze user engagement. Extract d…☆24Updated 2 years ago
- Data science portfolio examples.☆52Updated 4 years ago
- Code for my blogs on Data Engineering☆15Updated 4 years ago
- Learn Machine Learning using PySpark from scratch☆19Updated 5 years ago
- PySpark Cookbook, published by Packt☆89Updated last year
- Apache Spark using SQL☆14Updated 3 years ago
- Learning from multiple companies in Silicon Valley. Netflix, Facebook, Google, Startups☆16Updated 6 years ago
- ☆32Updated 8 months ago
- Data Engineering pipeline hosted entirely in the AWS ecosystem utilizing DocumentDB as the database☆13Updated 3 years ago
- Sharing interesting and noteworthy Data Engineering content☆65Updated 8 years ago