donaldpminer / hadoop-python-tutorial
Exercises and examples developed for the Hadoop with Python tutorial
☆21Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for hadoop-python-tutorial
- This repository contains code files specifically IPython notebooks for the assignments in the course "Introduction to Big Data with Apach…☆114Updated 3 months ago
- Solution code from my winning submission to Kaggle's PyCon 2015 competition☆55Updated 9 years ago
- Workshop on Machine Learning in Python☆19Updated 8 years ago
- ☆77Updated 8 years ago
- Springboard - Data Science Intensive course☆13Updated 7 years ago
- Workshop: Python for Data Science☆61Updated 9 years ago
- Repository for the PyData DC 2016 tutorial☆29Updated 7 years ago
- Tutorial: Machine Learning with Text in scikit-learn☆74Updated 7 years ago
- PySpark Machine Learning Examples☆44Updated 6 years ago
- Slides and code examples for H2O tutorials at various events☆56Updated 7 years ago
- General Assembly repo for Data Science 18☆36Updated 9 years ago
- AWS, Vagrant, and Spark☆20Updated 9 years ago
- Churn Prediction with PySpark using MLlib and ML Packages☆56Updated 8 years ago
- Supporting content (slides and exercises) for the Addison-Wesley (Pearson) video series covering best practices for developing scalable S…☆66Updated 8 years ago
- ☆87Updated 8 years ago
- Machine learning with scikit-learn tutorial at PyData Chicago 2016☆128Updated 8 years ago
- a curated list of R tutorials for Data Science, NLP and Machine Learning☆22Updated 8 years ago
- Walkthrough exercises from PandasTutorial by Wes McKinney.☆151Updated 4 years ago
- PyCon SG 2016 - Customer Segmentation in Python☆56Updated 8 years ago
- Code Repository for Python Deeper Insights into Machine Learning, published by packt☆29Updated last year
- Codes written for some competitions☆13Updated 7 years ago
- pyspark sample scripts☆17Updated 5 years ago
- This repository contains materials for demos, tutorials, and talks by Dato Inc.☆173Updated 8 years ago
- Data Science in 30 Minutes #5: Spark☆19Updated 7 years ago
- This repository contains code examples for the course CS 20SI: TensorFlow for Deep Learning Research.☆12Updated 7 years ago