mGalarnyk / DSE230_Data_Analysis_Using_Hadoop_and_Spark_UCSDLinks
Map-reduce, streaming analysis, and external memory algorithms and their implementation using the Hadoop and its eco-system: HBase, Hive, Pig and Spark. The class will include assignment of analyzing large existing databases.
☆34Updated 8 years ago
Alternatives and similar repositories for DSE230_Data_Analysis_Using_Hadoop_and_Spark_UCSD
Users that are interested in DSE230_Data_Analysis_Using_Hadoop_and_Spark_UCSD are comparing it to the libraries listed below
Sorting:
- Homework/Classwork for my DSE 200 Python for Data Analysis Class at UC San Diego (UCSD)☆101Updated 9 years ago
- Repo for my graduate data science machine learning class at UCSD (UC San Diego). This course provides a broad introduction to the practic…☆53Updated 7 years ago
- iPython NOtebooks on Stats☆165Updated last year
- Probability and Statistics Using Python Data Science Masters Course at UCSD (DSE 210)☆182Updated 8 years ago
- In the Data Science and Engineering program, engineering professionals combine the skills of software programmer, database manager, and s…☆28Updated 8 years ago
- Installations for Data Science. Anaconda, RStudio, Spark, TensorFlow, AWS (Amazon Web Services).☆236Updated 2 years ago
- It consists of examples, assignments discussed in data science course taken at algorithmica.☆108Updated last year
- This repository contains code files specifically IPython notebooks for the assignments in the course "Introduction to Big Data with Apach…☆116Updated last year
- Workshop: Python for Data Science☆64Updated 10 years ago
- Projects for my Udacity Data Analyst Nanodegree☆104Updated 5 years ago
- Churn Prediction with PySpark using MLlib and ML Packages☆58Updated 9 years ago
- Code example to predict prices of Airbnb vacation rentals, using scikit-learn on Spark with spark-sklearn, on MapR.☆44Updated 9 years ago
- Code files added☆100Updated 2 years ago
- Notebooks for Course☆247Updated 9 years ago
- Tutorials on Machine Learning with Scikit-Learn☆83Updated 5 years ago
- a curated list of R tutorials for Data Science, NLP and Machine Learning☆23Updated 9 years ago
- A complete daily plan for studying to become a machine learning engineer.☆52Updated 9 years ago
- Repository for sharing the knowledge from the learning path of Kaggle Learning. All contributions welcome :).☆153Updated 7 years ago
- Code material for a data science tutorial☆197Updated 8 years ago
- Hands-On Data Science and Python Machine Learning, published by Packt☆144Updated 2 years ago
- ☆26Updated last year
- This is the presentation on - What are the key points one should consider if they will be appearing in Data Science job interview☆40Updated 7 years ago
- Solution of the Titanic Kaggle competition☆131Updated 4 years ago
- Project work for the Udacity Data Analyst Nanodegree☆39Updated 8 years ago
- Jupyter notebooks for learning Python and Data Science, companion to Data Science Solutions book.☆37Updated 5 years ago
- Code from Jason Brownlee's course on mastering machine learning☆126Updated 9 years ago
- Detailed notes and code to learn the basics of machine learning with scikit-learn.☆35Updated 9 years ago
- Data Science and Machine Learning with Python - Hands On from Udemy☆14Updated 8 years ago
- A short tutorial notebook on PySpark☆15Updated 9 years ago
- PyCon SG 2016 - Customer Segmentation in Python☆56Updated 9 years ago