GalvanizeDataScience / data-scientists-guide-apache-spark
☆15Updated this week
Related projects: ⓘ
- ☆28Updated this week
- My capstone project for Galvanize (Zipfian Academy)☆38Updated 5 years ago
- ☆12Updated this week
- This project contains the code to translate between Apache Spark and SFrame.☆21Updated 8 years ago
- Additional files for the Otto Group Challenge hosted by Kaggle☆36Updated 9 years ago
- Some notebook examples related to Apache Spark, IPython / Jupyter, Zeppelin☆52Updated 8 years ago
- Presentation at Perth Data Science Meetup, February 2015☆72Updated 9 years ago
- Data and code for "Fast Data Applications with Spark and Python"☆25Updated 8 years ago
- ☆33Updated 8 years ago
- Code examples supporting the "Introduction to Apache Spark" video published by O'Reilly Media☆37Updated 2 years ago
- Latency numbers every data scientist should know (aka the pyramid of analytical tasks) - the order of magnitude of computational time for…☆20Updated 7 years ago
- ☆27Updated this week
- Tutorial for Deploying Anaconda Cluster and PySpark on top of Red Hat Storage GlusterFS☆8Updated 9 years ago
- Code for Learning with Data Blog☆64Updated 7 years ago
- ☆31Updated this week
- Complete Pipeline Training at Big Data Scala By the Bay☆71Updated 8 years ago
- Luigi Plugin for Hubot☆35Updated 8 years ago
- ☆51Updated this week
- My talk at Strata 2014 in Santa Clara, CA☆74Updated 10 years ago
- Some IPython notebooks I've created...☆29Updated 8 years ago
- A Topic Modeling toolbox☆93Updated 8 years ago
- Materials fort Strata NYC 2016 scikit-learn tutorial☆15Updated 8 years ago
- Generating the next read for our book club- with Data Science!☆40Updated 8 years ago
- Analyze the structure and dynamics of an open source project's developer community, using graph algorithms, etc.☆57Updated 3 years ago
- Materials for PyData at Strata/Hadoop World San Jose 2015☆12Updated 9 years ago
- Code & Data for Introduction to Machine Learning with Scikit-Learn☆81Updated 6 years ago
- ☆17Updated this week
- Notebooks (and slides) for my PyData NYC 2014 tutorial on the more advanced features of scikit-learn.☆69Updated 9 years ago
- This is an introduction of Apache Spark DataFrames.☆41Updated 9 years ago
- PDF and python files for creating time maps and downloading tweets☆58Updated 4 years ago