dipanjanS / BerkeleyX-CS100.1x-Big-Data-with-Apache-Spark
This repository contains code files specifically IPython notebooks for the assignments in the course "Introduction to Big Data with Apache Spark" by UC Berkeley and Databricks on edX
☆114Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for BerkeleyX-CS100.1x-Big-Data-with-Apache-Spark
- Churn Prediction with PySpark using MLlib and ML Packages☆56Updated 8 years ago
- ☆77Updated 8 years ago
- Sharing interesting and noteworthy Data Engineering content☆65Updated 8 years ago
- This repository contains code files specifically IPython notebooks for the assignments in the course "Scalable Machine Learning" by UC Be…☆30Updated 9 years ago
- This repository contains code examples for the course CS 20SI: TensorFlow for Deep Learning Research.☆12Updated 7 years ago
- General Assembly repo for Data Science 18☆36Updated 9 years ago
- PySpark Machine Learning Examples☆44Updated 6 years ago
- Updated repository☆157Updated 2 years ago
- An API to Analyze Cab GeoLocation Data and a Simulated App for finding an available cab in Real-Time☆63Updated 9 years ago
- Supporting content (slides and exercises) for the Addison-Wesley (Pearson) video series covering best practices for developing scalable S…☆66Updated 8 years ago
- Data Analyst Nano-degree Projects, Deep Learning, Android, some front end stuff☆74Updated 5 years ago
- Archived work from Udacity nanodegrees☆70Updated 2 years ago
- Apache Spark (Scala, PySpark, SparkR) Code, Tricks, and References☆70Updated 5 years ago
- Feature Engineering with Pipeline Talk at ODSC West 2016, Santa Clara☆17Updated 8 years ago
- pyspark sample scripts☆17Updated 5 years ago
- Projects for my Udacity Data Analyst Nanodegree☆100Updated 4 years ago
- Machine Learning and Data Analysis Case Studies using Spark.☆72Updated 3 years ago
- Pydata Dallas 2015 Scikit-Learn Tutorial☆62Updated 9 years ago
- ☆26Updated 10 months ago
- Code & Data for Introduction to Machine Learning with Scikit-Learn☆81Updated 6 years ago
- ☆20Updated 7 years ago
- This repository contains materials for demos, tutorials, and talks by Dato Inc.☆173Updated 8 years ago
- PyCon 2017 tutorial on time series analysis☆72Updated 7 years ago
- Solution code from my winning submission to Kaggle's PyCon 2015 competition☆55Updated 9 years ago
- 32/2384 Solution to Kaggle Mercari Competition (solo silver medal winner)☆20Updated 6 years ago
- PySpark Code for Hands-on Learners☆114Updated 5 years ago
- COMS W4995 Applied Machine Learning - Spring 18☆158Updated 5 years ago