salahdev8 / NYYellowTaxiProjectLinks
Big Data project using Hadoop (MapReduce, spark, Hive)
☆32Updated 6 years ago
Alternatives and similar repositories for NYYellowTaxiProject
Users that are interested in NYYellowTaxiProject are comparing it to the libraries listed below
Sorting:
- This is a guided certification project, as a part of Data Science for Social Good initiative☆18Updated 5 years ago
- AlvinToh Learning Repository for The Ultimate Hands-On Hadoop - Tame your Big Data!☆10Updated 7 years ago
- Udacity Data Engineering Nanodegree Program☆53Updated 4 years ago
- The collection of exercises I did during Ironhack's Data Science bootcamp.☆15Updated 5 years ago
- Data Streaming Nanodegree (from Udacity) exercises, projects and their solutions☆17Updated 2 years ago
- ☆21Updated 2 years ago
- A step-by-step tutorial to learn Data Science☆88Updated 4 years ago
- A Python package that creates fine-grained dbt tasks on Apache Airflow☆18Updated last year
- Developed an ETL pipeline for a Data Lake that extracts data from S3, processes the data using Spark, and loads the data back into S3 as …☆16Updated 6 years ago
- All Data Engineering notebooks from Datacamp course☆116Updated 6 years ago
- Solution to all projects of Udacity's Data Engineering Nanodegree: Data Modeling with Postgres & Cassandra, Data Warehouse with Redshift,…☆57Updated 3 years ago
- A Collection of Useful Notebooks to help in Feature Engineering.☆12Updated 5 years ago
- MLOps for deploying a Credit Risk model☆35Updated 2 years ago
- ☆63Updated 7 years ago
- Data Engineering Bootcamp☆30Updated 6 months ago
- plan, design and implement enterprise data infrastructure solutions and create the blueprints for an organization’s data management syste…☆14Updated 2 years ago
- Real World Project on Formula1 Racing using Azure Databricks, Delta Lake and Azure Data Factory☆13Updated 2 years ago
- Data Pipeline from the Global Historical Climatology Network DataSet☆27Updated 3 years ago
- Created an optimised pipeline to provide accurate data for analysis, then used snowsight (provided by Snowflake) to create a dashboard.☆20Updated 3 years ago
- A course by DataTalks Club that covers Spark, Kafka, Docker, Airflow, Terraform, DBT, Big Query etc☆15Updated 3 years ago
- Notes, annotations, and exercises from Coursera's SQL for Data Science course: https://www.coursera.org/learn/sql-for-data-science☆62Updated 3 years ago
- Portofolio repository for Udacity Data Scientist Nanodegree☆42Updated 5 years ago
- Big Data Engineering practice project, including ETL with Airflow and Spark using AWS S3 and EMR☆88Updated 6 years ago
- The Repository for all code I use in my Data Science and Machine Learning Tutorials on YouTube☆75Updated 3 years ago
- Simple ETL pipeline using Python☆29Updated 2 years ago
- A data engineering project with Airflow, dbt, Terrafrom, GCP and much more!☆25Updated 3 years ago
- Udacity Data Engineer Nano Degree - Project-3 (Data Warehouse)☆22Updated 6 years ago
- My solutions for the Udacity Data Engineering Nanodegree☆34Updated 6 years ago
- Data Engineering Project in GCP☆22Updated 2 years ago
- Course on Udemy by Jose Portilla☆98Updated 8 years ago