mGalarnyk / DSE230_Data_Analysis_Using_Hadoop_and_Spark_UCSD
Map-reduce, streaming analysis, and external memory algorithms and their implementation using the Hadoop and its eco-system: HBase, Hive, Pig and Spark. The class will include assignment of analyzing large existing databases.
☆34Updated 7 years ago
Alternatives and similar repositories for DSE230_Data_Analysis_Using_Hadoop_and_Spark_UCSD:
Users that are interested in DSE230_Data_Analysis_Using_Hadoop_and_Spark_UCSD are comparing it to the libraries listed below
- Homework/Classwork for my DSE 200 Python for Data Analysis Class at UC San Diego (UCSD)☆100Updated 8 years ago
- Repo for my graduate data science machine learning class at UCSD (UC San Diego). This course provides a broad introduction to the practic…☆52Updated 6 years ago
- Probability and Statistics Using Python Data Science Masters Course at UCSD (DSE 210)☆177Updated 7 years ago
- Workshop: Python for Data Science☆62Updated 10 years ago
- A complete daily plan for studying to become a machine learning engineer.☆50Updated 8 years ago
- Coursera machine learning specialization coursework (python based, University of Washington).☆19Updated 8 years ago
- ☆48Updated 8 years ago
- This is the presentation on - What are the key points one should consider if they will be appearing in Data Science job interview☆40Updated 6 years ago
- common data analysis and machine learning tasks using python☆41Updated 8 years ago
- iPython NOtebooks on Stats☆163Updated 7 months ago
- Installations for Data Science. Anaconda, RStudio, Spark, TensorFlow, AWS (Amazon Web Services).☆234Updated 2 years ago
- Lab for Linear and Logistic Regression, SciKit Learn☆41Updated 6 years ago
- In the Data Science and Engineering program, engineering professionals combine the skills of software programmer, database manager, and s …☆27Updated 7 years ago
- This repository contains code files specifically IPython notebooks for the assignments in the course "Introduction to Big Data with Apach…☆115Updated 7 months ago
- Churn Prediction with PySpark using MLlib and ML Packages☆56Updated 9 years ago
- Repository for sharing the knowledge from the learning path of Kaggle Learning. All contributions welcome :).☆149Updated 7 years ago
- An awesome Data Science repository to learn and apply for real world problems.☆59Updated 8 years ago
- ☆22Updated 4 years ago
- Interview stuff for friends☆84Updated 2 years ago
- a curated list of R tutorials for Data Science, NLP and Machine Learning☆23Updated 8 years ago
- ☆21Updated 9 years ago
- General Assembly repo for Data Science 18☆36Updated 9 years ago
- Workshop: Intro to Python for Data Analysis☆72Updated 10 years ago
- Modern databases can contain massive volumes of data. Within this data lies important information that can only be effectively analyzed u…☆9Updated 8 years ago
- machine learning and deep learning tutorials, articles and other resources☆41Updated 8 years ago
- Hands-On Data Science and Python Machine Learning, published by Packt☆140Updated 2 years ago
- A curated list of awesome Machine Learning frameworks, libraries and software.☆39Updated 8 years ago
- Springboard - Data Science Intensive course☆13Updated 8 years ago
- Code from Jason Brownlee's course on mastering machine learning☆124Updated 8 years ago
- A curated list of data science blogs☆23Updated 8 years ago