hougs / py-hadoop-tutorial
Source Material for using Python and Hadoop together
☆13Updated 8 years ago
Alternatives and similar repositories for py-hadoop-tutorial:
Users that are interested in py-hadoop-tutorial are comparing it to the libraries listed below
- Materials for PyData at Strata/Hadoop World San Jose 2015☆12Updated 10 years ago
- Materials for my PyData Seattle talk☆21Updated 9 years ago
- Docker container with a PyData stack and JupyterHub server☆37Updated 8 years ago
- VM Setup stuff for http://bit.ly/22giU4y☆9Updated 8 years ago
- Repository for exploratory data transformation & visualization talk☆27Updated 8 years ago
- 12 Week Data Science Immersive☆27Updated 9 years ago
- Materials fort Strata NYC 2016 scikit-learn tutorial☆15Updated 8 years ago
- Presentation at Perth Data Science Meetup, February 2015☆72Updated 10 years ago
- A couple projects using scikit-learn illustrating project decision making.☆15Updated 8 years ago
- Scripts to Analyze Pronto's Data Release☆24Updated 9 years ago
- Repo of data science coding challenges for various companies☆24Updated 9 years ago
- Notebooks (and slides) for my PyData NYC 2014 tutorial on the more advanced features of scikit-learn.☆69Updated 10 years ago
- repository for code related to the end-to-end data analysis in python workshop, from the Open Data Science Conference 2015☆15Updated 9 years ago
- A collection of IPython Notebooks containing my research.☆20Updated 6 years ago
- spark backend for dplyr☆48Updated 9 years ago
- Deep Learning for Pugs☆74Updated 7 years ago
- Talk on "Tree models with Scikit-Learn: Great learners with little assumptions" presented at PyPata Paris 2015☆50Updated 9 years ago
- Code for Pythonic visualization blog post☆40Updated 7 years ago
- ☆24Updated 6 years ago
- Simple scripts to setup a fresh data science box using an Ubuntu 12.04.* LTS 64-bit server running on an EC2☆163Updated 11 years ago
- Computational Statistics II Tutorial at SciPy 2015☆48Updated 9 years ago
- Modeling Social Data, Applied Mathematics, Columbia University (Spring 2015)☆33Updated 5 years ago
- Examples from Holden's intro to PySpark workshop. This is an intro level workshop focused on using Spark with Python.☆14Updated 7 years ago
- My Tutorial for PyData London☆25Updated 9 years ago
- Latency numbers every data scientist should know (aka the pyramid of analytical tasks) - the order of magnitude of computational time for…☆20Updated 7 years ago
- Materials for my talk at PyData Chicago 2016☆20Updated 7 years ago
- Portland Python Meetup March 2015☆40Updated 9 years ago
- (Deprecated) Task for the Search & Discovery data analyst job.☆21Updated 9 years ago
- PyData Madrid 2016 material for the talk: A Primer to recommendation Systems☆37Updated 8 years ago
- ☆34Updated 8 years ago