MLWhiz / Spark_ProjectsLinks
Spark Projects for the Berkeley Data Science Course
☆13Updated 10 years ago
Alternatives and similar repositories for Spark_Projects
Users that are interested in Spark_Projects are comparing it to the libraries listed below
Sorting:
- Notebook on finding fraud in credit card transactions☆14Updated 6 years ago
- Learning PySpark video series☆11Updated 7 years ago
- A simple implementation of k-means clustering on the Spark cluster computing framework. See http://cs.berkeley.edu/~matei/spark.☆27Updated 14 years ago
- Notes and Quiz Answers of Statistical Inference Coursera Course☆15Updated last year
- ☆19Updated 9 years ago
- PySpark-ETL☆23Updated 5 years ago
- Lecture slides and quizzes for Leskovec, Rajaraman, and Ullman's "Mining of Massive Datasets" Stanford course☆92Updated 7 years ago
- A code-based tutorial for production level data streaming with PySpark plus Optimus for data cleaning, Confluent Kafka, & Apache Drill u…☆27Updated 6 years ago
- This repository contains code files specifically IPython notebooks for the assignments in the course "Introduction to Big Data with Apach…☆116Updated last year
- My presentation at ODSC India 2018 about Deep Learning with Apache Spark☆27Updated 7 years ago
- Data sets and scripts for Coursera Big Data Specialization.☆171Updated last year
- Codes, notes and guides on Udacity's machine learning nanodegree.☆81Updated 9 years ago
- Large-scale Graph Mining with Spark☆39Updated 7 years ago
- This is the presentation on - What are the key points one should consider if they will be appearing in Data Science job interview☆40Updated 7 years ago
- Contains source files used in the Spark with Python course☆18Updated 6 years ago
- Project solutions for CS188 Artificial Intelligence course☆12Updated 8 years ago
- It's a Github Repo to get an understanding on various pre-processing steps required in Machine Learning before we build Machine Learning …☆28Updated 6 years ago
- edXSpark☆21Updated 9 years ago
- Predicting Boston Housing Prices using Linear Regression☆12Updated 6 years ago
- Télécom Paris | MS Big Data | SD 701 | Big Data Mining Course Project using Spark and Google Colab for building Scalable Recommender Syst…☆20Updated 2 years ago
- Detailed notes and code to learn the basics of machine learning with scikit-learn.☆35Updated 9 years ago
- Code and documents for UC Berkeley MIDS program course w207-Applied-Machine-Learning☆41Updated 4 years ago
- Code repository for Large Scale Machine Learning with Spark by Packt☆20Updated 3 years ago
- This repository contains code files specifically IPython notebooks for the assignments in the course "Scalable Machine Learning" by UC Be…☆31Updated 10 years ago
- Source code for 'PySpark Recipes' by Raju Kumar Mishra☆25Updated 5 years ago
- Contains code and presentation for my interactive hack session, 'Effective Feature Engineering: A Structured Approach to Building Better …☆30Updated 4 years ago
- self study java, through cs61b (http://www.cs.berkeley.edu/~jrs/61b/)☆10Updated 12 years ago
- In this Facebook live code along session with Hugo Bowne-Anderson, you're going to check out Google trends data of keywords 'diet', 'gym'…☆44Updated 7 years ago
- PySpark Cookbook, published by Packt☆93Updated 2 years ago
- This contains notes and exercises made in Python I made a long time ago from the Andrew Ng course in Coursera.☆47Updated 9 years ago