dipanjanS / BerkeleyX-CS100.1x-Big-Data-with-Apache-Spark
This repository contains code files specifically IPython notebooks for the assignments in the course "Introduction to Big Data with Apache Spark" by UC Berkeley and Databricks on edX
☆115Updated 7 months ago
Alternatives and similar repositories for BerkeleyX-CS100.1x-Big-Data-with-Apache-Spark:
Users that are interested in BerkeleyX-CS100.1x-Big-Data-with-Apache-Spark are comparing it to the libraries listed below
- ☆77Updated 8 years ago
- PySpark Machine Learning Examples☆45Updated 7 years ago
- An API to Analyze Cab GeoLocation Data and a Simulated App for finding an available cab in Real-Time☆63Updated 10 years ago
- Churn Prediction with PySpark using MLlib and ML Packages☆56Updated 9 years ago
- This repository contains code files specifically IPython notebooks for the assignments in the course "Scalable Machine Learning" by UC Be…☆30Updated 9 years ago
- All materials for workshops - HackOn(Data) - Toronto☆33Updated 7 years ago
- This repository contains materials for demos, tutorials, and talks by Dato Inc.☆172Updated 8 years ago
- Supporting content (slides and exercises) for the Addison-Wesley (Pearson) video series covering best practices for developing scalable S…☆66Updated 9 years ago
- Data Engineering Project at Insight☆15Updated 9 years ago
- Spark 2.0 Python Machine Learning examples☆98Updated 5 years ago
- Machine Learning and Data Analysis Case Studies using Spark.☆72Updated 3 years ago
- PyCon SG 2016 - Customer Segmentation in Python☆56Updated 8 years ago
- Codes written for some competitions☆13Updated 8 years ago
- Sharing interesting and noteworthy Data Engineering content☆67Updated 8 years ago
- Apache Spark (Scala, PySpark, SparkR) Code, Tricks, and References☆69Updated 6 years ago
- Real-time Machine Learning with Apache Spark on Twitter Public Stream☆68Updated 7 years ago
- General Assembly repo for Data Science 18☆36Updated 9 years ago
- Updated repository☆157Updated 3 years ago
- Data and code for "Fast Data Applications with Spark and Python"☆25Updated 8 years ago
- Tutorial: Machine Learning with Text in scikit-learn☆74Updated 8 years ago
- Learn the pyspark API through pictures and simple examples☆170Updated 4 years ago
- All my submissions for Kaggle contests that I have been, and going to be participating.☆40Updated 7 years ago
- Archived work from Udacity nanodegrees☆70Updated 3 years ago
- Code & Data for Introduction to Machine Learning with Scikit-Learn☆81Updated 6 years ago
- a curated list of R tutorials for Data Science, NLP and Machine Learning☆23Updated 8 years ago
- Generic codes related to NLP☆84Updated 6 years ago
- Source material for Data Science for Telecom Tutorial at Strata Singapore 2015☆102Updated 9 years ago
- Final project for Udacity's ud741 — Unsupervised Learning☆52Updated 9 years ago
- ☆87Updated 9 years ago
- Training models with Apache Spark, PySpark for Titanic Kaggle competition☆14Updated 8 years ago