caizkun / mapreduce-examplesLinks
A collection of mapreduce problems and solutions
☆35Updated 8 years ago
Alternatives and similar repositories for mapreduce-examples
Users that are interested in mapreduce-examples are comparing it to the libraries listed below
Sorting:
- Data cleaning, pre-processing, and Analytics on a million movies using Spark and Scala.☆95Updated 4 years ago
- This is a repository for my data engineer course through Udacity.☆16Updated 6 years ago
- This repository focuses on gathering and making a curated list resources to learn Hadoop for FREE.☆56Updated 7 years ago
- Classwork projects and home works done through Udacity data engineering nano degree☆75Updated 2 years ago
- Apache Spark Course Material☆96Updated 2 years ago
- Mastering Big Data Analytics with PySpark, Published by Packt☆165Updated last year
- notebooks produced throughout the Udacity's Nanodegree Data Engineering Course☆74Updated 5 years ago
- Stream/batch system with Hadoop, Spark on NYC taxi data | #DE☆26Updated 4 months ago
- PySpark functions and utilities with examples. Assists ETL process of data modeling☆104Updated 5 years ago
- Counting Tweets Per User in Real-Time☆43Updated 8 years ago
- GCP-Data-Engineer-Study-Guide☆123Updated 6 years ago
- Data Engineering Capstone Project: ETL Pipelines and Data Warehouse Development☆21Updated 6 years ago
- Big Data Modeling, MapReduce, Spark, PySpark @ Santa Clara University☆165Updated 2 months ago
- Develop ML models predict taxi trip duration in NYC. Ranked : Top 6% | RMSLE : 0.377 (Kaggle) | #DS☆17Updated 3 years ago
- Projects done in the Data Engineering Nanodegree by Udacity.com☆273Updated 6 years ago
- ☆152Updated 7 years ago
- How to build an awesome data engineering team☆101Updated 6 years ago
- A real-time event pipeline around Kafka Ecosystem for Chicago Transit Authority.☆32Updated 2 years ago
- Because its never late to start taking notes and 'public' it...☆62Updated 8 months ago
- The goal of this project is to track the expenses of Uber Rides and Uber Eats through data Engineering processes using technologies such …☆123Updated 3 years ago
- PySpark Tutorial for Beginners on Google Colab: Hands-On Guide☆17Updated 5 years ago
- ☆170Updated 3 years ago
- ETL pipeline using pyspark (Spark - Python)☆116Updated 5 years ago
- This project provides Apache Spark SQL, RDD, DataFrame and Dataset examples in Scala language☆567Updated last year
- Solution to all projects of Udacity's Data Engineering Nanodegree: Data Modeling with Postgres & Cassandra, Data Warehouse with Redshift,…☆57Updated 3 years ago
- Big Data for Data Engineers Coursera Specialization from Yandex☆101Updated 2 years ago
- Fundamentals of Spark with Python (using PySpark), code examples☆362Updated 3 years ago
- Apache Spark 3 - Structured Streaming Course Material☆126Updated 2 years ago
- data engineering 100 days 🤖 🧲 🦾 | #DE☆40Updated 2 years ago
- 🔴 1704 Machine Learning, Data Science & Python Interview Questions (ANSWERED) To Kill Your Next ML & DS Interview. Get All Answers + PDF…☆118Updated 3 years ago