mGalarnyk / DSE230_Data_Analysis_Using_Hadoop_and_Spark_UCSDLinks
Map-reduce, streaming analysis, and external memory algorithms and their implementation using the Hadoop and its eco-system: HBase, Hive, Pig and Spark. The class will include assignment of analyzing large existing databases.
☆34Updated 8 years ago
Alternatives and similar repositories for DSE230_Data_Analysis_Using_Hadoop_and_Spark_UCSD
Users that are interested in DSE230_Data_Analysis_Using_Hadoop_and_Spark_UCSD are comparing it to the libraries listed below
Sorting:
- Repo for my graduate data science machine learning class at UCSD (UC San Diego). This course provides a broad introduction to the practic…☆52Updated 7 years ago
- Homework/Classwork for my DSE 200 Python for Data Analysis Class at UC San Diego (UCSD)☆100Updated 8 years ago
- Probability and Statistics Using Python Data Science Masters Course at UCSD (DSE 210)☆180Updated 7 years ago
- In the Data Science and Engineering program, engineering professionals combine the skills of software programmer, database manager, and s…☆27Updated 7 years ago
- A short tutorial notebook on PySpark☆15Updated 9 years ago
- Workshop: Python for Data Science☆62Updated 10 years ago
- iPython NOtebooks on Stats☆164Updated 10 months ago
- PyCon SG 2016 - Customer Segmentation in Python☆56Updated 8 years ago
- ☆213Updated 3 years ago
- General Assembly repo for Data Science 18☆36Updated 10 years ago
- Project work for the Udacity Data Analyst Nanodegree☆39Updated 7 years ago
- Installations for Data Science. Anaconda, RStudio, Spark, TensorFlow, AWS (Amazon Web Services).☆234Updated 2 years ago
- A complete daily plan for studying to become a machine learning engineer.☆52Updated 8 years ago
- Springboard - Data Science Intensive course☆13Updated 8 years ago
- a curated list of R tutorials for Data Science, NLP and Machine Learning☆23Updated 9 years ago
- Interview stuff for friends☆84Updated 2 years ago
- ☆26Updated last year
- Coursera machine learning specialization coursework (python based, University of Washington).☆19Updated 9 years ago
- Solutions to the book "Collection of Data Science TakeHome Challenges" in Python.☆10Updated 7 years ago
- Udacity Data Science Nanodegree Repository. Contains lecture notes, and dummy scripts as well as projects undertaken for the nanodegree.☆31Updated 5 years ago
- PyCon 2017 tutorial on time series analysis☆72Updated 8 years ago
- Archived work from Udacity nanodegrees☆69Updated 3 years ago
- Tutorial: Machine Learning with Text in scikit-learn☆74Updated 8 years ago
- Projects for my Udacity Data Analyst Nanodegree☆102Updated 4 years ago
- ☆48Updated 8 years ago
- Few tutorials on pandas, matplotlib and seaborn☆27Updated 9 years ago
- Repository for sharing the knowledge from the learning path of Kaggle Learning. All contributions welcome :).☆153Updated 7 years ago
- Data Science Cheat Sheet is help to remind code with in minute and also useful to recall the code.Collecting at one place so everyone can…☆26Updated 7 years ago
- Database Management Systems Data Science Masters Course (DSE 201)☆12Updated 8 years ago
- Lab for Linear and Logistic Regression, SciKit Learn☆41Updated 6 years ago