dipanjanS / BerkeleyX-CS100.1x-Big-Data-with-Apache-Spark
This repository contains code files specifically IPython notebooks for the assignments in the course "Introduction to Big Data with Apache Spark" by UC Berkeley and Databricks on edX
☆115Updated 8 months ago
Alternatives and similar repositories for BerkeleyX-CS100.1x-Big-Data-with-Apache-Spark:
Users that are interested in BerkeleyX-CS100.1x-Big-Data-with-Apache-Spark are comparing it to the libraries listed below
- ☆77Updated 8 years ago
- This repository contains code files specifically IPython notebooks for the assignments in the course "Scalable Machine Learning" by UC Be…☆30Updated 9 years ago
- General Assembly repo for Data Science 18☆36Updated 10 years ago
- Churn Prediction with PySpark using MLlib and ML Packages☆56Updated 9 years ago
- Codes written for some competitions☆13Updated 8 years ago
- PySpark Machine Learning Examples☆44Updated 7 years ago
- Materials for the "Advanced Scikit-learn" class in the afternoon☆165Updated 6 years ago
- PyCon 2017 tutorial on time series analysis☆72Updated 7 years ago
- Codes related to Knocktober 2016☆23Updated 8 years ago
- PyCon SG 2016 - Customer Segmentation in Python☆56Updated 8 years ago
- Machine Learning and Data Analysis Case Studies using Spark.☆72Updated 4 years ago
- Project work for the Udacity Data Analyst Nanodegree☆38Updated 7 years ago
- Tutorial: Machine Learning with Text in scikit-learn☆74Updated 8 years ago
- Archived work from Udacity nanodegrees☆70Updated 3 years ago
- Pydata Dallas 2015 Scikit-Learn Tutorial☆62Updated 10 years ago
- Code for the Kaggle acquire valued shoppers challenge☆66Updated 11 years ago
- Code for O'Reilly's "A Short Course on TensorFlow"☆104Updated 7 years ago
- General Assembly's Data Science course in Washington, DC☆185Updated 2 years ago
- All materials for workshops - HackOn(Data) - Toronto☆33Updated 7 years ago
- AWS, Vagrant, and Spark☆21Updated 9 years ago
- Tutorial repo for the article "ML in Production"☆30Updated 2 years ago
- Set of Machine Learning and Stochastic Optimazion tools based on Hadoop, Spark and Storm https://pkghosh.wordpress.com/☆177Updated last year
- Apache Spark (Scala, PySpark, SparkR) Code, Tricks, and References☆69Updated 6 years ago
- Workshop for Spark and Databricks☆54Updated 5 years ago
- Containing codes of participation in Kaggle competitions.☆37Updated 9 years ago
- Supporting content (slides and exercises) for the Addison-Wesley (Pearson) video series covering best practices for developing scalable S…☆67Updated 9 years ago
- This repository contains code examples for the course CS 20SI: TensorFlow for Deep Learning Research.☆12Updated 8 years ago
- Simple sentiment analysis model with PySpark☆43Updated 7 years ago
- Welcome to my independent research repository!☆17Updated 8 years ago
- Analysis of NYC Green Taxi and a model to predict the tip as a percentage of the total fare☆45Updated 7 years ago