dipanjanS / BerkeleyX-CS100.1x-Big-Data-with-Apache-Spark
This repository contains code files specifically IPython notebooks for the assignments in the course "Introduction to Big Data with Apache Spark" by UC Berkeley and Databricks on edX
☆115Updated 8 months ago
Alternatives and similar repositories for BerkeleyX-CS100.1x-Big-Data-with-Apache-Spark:
Users that are interested in BerkeleyX-CS100.1x-Big-Data-with-Apache-Spark are comparing it to the libraries listed below
- ☆77Updated 8 years ago
- Supporting content (slides and exercises) for the Addison-Wesley (Pearson) video series covering best practices for developing scalable S…☆66Updated 9 years ago
- Churn Prediction with PySpark using MLlib and ML Packages☆56Updated 9 years ago
- General Assembly repo for Data Science 18☆36Updated 9 years ago
- Tutorial: Machine Learning with Text in scikit-learn☆74Updated 8 years ago
- Codes written for some competitions☆13Updated 8 years ago
- This repository contains code files specifically IPython notebooks for the assignments in the course "Scalable Machine Learning" by UC Be…☆30Updated 9 years ago
- PySpark Machine Learning Examples☆45Updated 7 years ago
- PyCon SG 2016 - Customer Segmentation in Python☆56Updated 8 years ago
- Codes related to Knocktober 2016☆23Updated 8 years ago
- Containing codes of participation in Kaggle competitions.☆37Updated 9 years ago
- AWS, Vagrant, and Spark☆21Updated 9 years ago
- PyCon 2017 tutorial on time series analysis☆72Updated 7 years ago
- Pydata Dallas 2015 Scikit-Learn Tutorial☆62Updated 9 years ago
- Archived work from Udacity nanodegrees☆70Updated 3 years ago
- Machine Learning and Data Analysis Case Studies using Spark.☆72Updated 4 years ago
- An API to Analyze Cab GeoLocation Data and a Simulated App for finding an available cab in Real-Time☆63Updated 10 years ago
- Apache Zeppelin notebooks for Recommendation Engines using Keras and Machine Learning on Apache Spark☆32Updated 7 years ago
- Code for the Kaggle acquire valued shoppers challenge☆66Updated 10 years ago
- Jupyter notebooks for learning Python and Data Science, companion to Data Science Solutions book.☆36Updated 5 years ago
- Materials for the "Advanced Scikit-learn" class in the afternoon☆165Updated 6 years ago
- Set of Machine Learning and Stochastic Optimazion tools based on Hadoop, Spark and Storm https://pkghosh.wordpress.com/☆177Updated last year
- Appendix☆13Updated 9 years ago
- Training models with Apache Spark, PySpark for Titanic Kaggle competition☆14Updated 8 years ago
- Solution code from my winning submission to Kaggle's PyCon 2015 competition☆55Updated 10 years ago
- COMS W4995 Applied Machine Learning - Spring 18☆158Updated 5 years ago
- All materials for workshops - HackOn(Data) - Toronto☆33Updated 7 years ago
- a curated list of R tutorials for Data Science, NLP and Machine Learning☆23Updated 8 years ago
- Sharing interesting and noteworthy Data Engineering content☆67Updated 8 years ago
- Code & Data for Introduction to Machine Learning with Scikit-Learn☆81Updated 6 years ago