MLWhiz / Spark_ProjectsLinks
Spark Projects for the Berkeley Data Science Course
☆13Updated 10 years ago
Alternatives and similar repositories for Spark_Projects
Users that are interested in Spark_Projects are comparing it to the libraries listed below
Sorting:
- Lecture slides and quizzes for Leskovec, Rajaraman, and Ullman's "Mining of Massive Datasets" Stanford course☆92Updated 7 years ago
- Learning PySpark video series☆11Updated 7 years ago
- A collection of course materials, notes and assignments for the Masters of Information and Data Sciences program at UC Berkeley☆14Updated 5 years ago
- Notebook on finding fraud in credit card transactions☆14Updated 6 years ago
- This is the presentation on - What are the key points one should consider if they will be appearing in Data Science job interview☆40Updated 7 years ago
- Cancer Prediction based on Mass Spectrum Analysis☆10Updated 3 years ago
- Notes and Quiz Answers of Statistical Inference Coursera Course☆15Updated last year
- Code for my presentation: Using PySpark to Process Boat Loads of Data☆20Updated 8 years ago
- PySpark Code for Hands-on Learners☆116Updated 6 years ago
- PySpark-ETL☆22Updated 5 years ago
- Churn Prediction with PySpark using MLlib and ML Packages☆58Updated 9 years ago
- Partly lecture and partly a hands-on tutorial and workshop, this is a three part series on how to get started with MLflow. In this four p…☆39Updated 4 years ago
- Create scalable machine learning applications to power a modern data-driven business using Spark☆62Updated 2 years ago
- This repository contains code files specifically IPython notebooks for the assignments in the course "Introduction to Big Data with Apach…☆116Updated last year
- In-class exercises for Deep Learning course at NYC Data Science Academy☆32Updated 7 years ago
- A repository for a PySpark Cookbook by Tomasz Drabas and Denny Lee☆60Updated 7 years ago
- Data sets and scripts for Coursera Big Data Specialization.☆171Updated last year
- Codes, notes and guides on Udacity's machine learning nanodegree.☆81Updated 9 years ago
- This repository hosts the code/projects/demos/slides for Big Data technologies under Apache Hadoop and Apache Spark umbrella.☆42Updated 3 years ago
- Repository for sharing the knowledge from the learning path of Kaggle Learning. All contributions welcome :).☆153Updated 7 years ago
- ☆18Updated last month
- A code-based tutorial for production level data streaming with PySpark plus Optimus for data cleaning, Confluent Kafka, & Apache Drill u…☆28Updated 6 years ago
- Repository for medium article☆21Updated last year
- My presentation at ODSC India 2018 about Deep Learning with Apache Spark☆27Updated 7 years ago
- Frank Kane's Taming Big Data with Apache Spark and Python, published by Packt☆124Updated 2 years ago
- ☆21Updated 7 years ago
- Contains source files used in the Spark with Python course☆18Updated 6 years ago
- Projects from Udacity Data Streaming Nanodegree☆15Updated 2 years ago
- ☆15Updated 6 years ago
- Code repository for Large Scale Machine Learning with Spark by Packt☆20Updated 3 years ago