donaldpminer / hadoop-python-tutorial
Exercises and examples developed for the Hadoop with Python tutorial
☆21Updated 5 years ago
Alternatives and similar repositories for hadoop-python-tutorial:
Users that are interested in hadoop-python-tutorial are comparing it to the libraries listed below
- Churn Prediction with PySpark using MLlib and ML Packages☆56Updated 9 years ago
- This repository contains code files specifically IPython notebooks for the assignments in the course "Introduction to Big Data with Apach…☆115Updated 8 months ago
- PySpark Machine Learning Examples☆44Updated 7 years ago
- PyCon SG 2016 - Customer Segmentation in Python☆56Updated 8 years ago
- General Assembly repo for Data Science 18☆36Updated 10 years ago
- Machine learning with scikit-learn tutorial at PyData Chicago 2016☆129Updated 8 years ago
- Jupyter notebooks for learning Python and Data Science, companion to Data Science Solutions book.☆36Updated 5 years ago
- ☆77Updated 8 years ago
- Repository for the PyData DC 2016 tutorial☆29Updated 8 years ago
- AWS, Vagrant, and Spark☆21Updated 9 years ago
- A complete daily plan for studying to become a machine learning engineer.☆50Updated 8 years ago
- helpful resources for (big) data science☆33Updated 3 years ago
- Code & Data for Introduction to Machine Learning with Scikit-Learn☆81Updated 6 years ago
- General Assembly's Data Science course in Washington, DC☆185Updated 2 years ago
- Repository for data science course Spring 14☆184Updated 10 years ago
- Materials for the "Advanced Scikit-learn" class in the afternoon☆165Updated 6 years ago
- Workshop: Python for Data Science☆62Updated 10 years ago
- PyCon 2017 tutorial on time series analysis☆72Updated 7 years ago
- Slides and code examples for H2O tutorials at various events☆56Updated 7 years ago
- Tutorial: Machine Learning with Text in scikit-learn☆74Updated 8 years ago
- Codes, notes and guides on Udacity's machine learning nanodegree.☆83Updated 8 years ago
- Workshop on Machine Learning in Python☆19Updated 9 years ago
- This repository contains materials for demos, tutorials, and talks by Dato Inc.☆172Updated 8 years ago
- Archived work from Udacity nanodegrees☆70Updated 3 years ago
- Code for CS570, Essentials of Data Science☆109Updated 7 years ago
- Lab for Linear and Logistic Regression, SciKit Learn☆41Updated 6 years ago
- Solution code from my winning submission to Kaggle's PyCon 2015 competition☆55Updated 10 years ago
- ☆40Updated 7 years ago
- R on Apache Spark (SparkR) tutorials for Big Data analysis and Machine Learning as IPython / Jupyter notebooks☆122Updated 7 years ago
- Updated repository☆157Updated 3 years ago