mrm1001 / spark_tutorial
Code for the Spark tutorial at the Pydata conference in London June 2015
☆12Updated 8 years ago
Alternatives and similar repositories for spark_tutorial:
Users that are interested in spark_tutorial are comparing it to the libraries listed below
- PyData Madrid 2016 material for the talk: A Primer to recommendation Systems☆37Updated 8 years ago
- A simple introduction to using spark ml pipelines☆26Updated 6 years ago
- Notes on Lambda Architecture☆12Updated 7 years ago
- Spark MOOC setup and labs for DBC users☆45Updated 9 years ago
- Scikit-learn quickstart tutorial for Webstep☆18Updated 7 years ago
- Pydata NYC 2014 Scikit Learn Tutorial☆64Updated 10 years ago
- Understanding Probabilistic Topic Models with Simulation in Python☆64Updated 7 years ago
- ☆20Updated 3 years ago
- Kaggle Criteo https://www.kaggle.com/c/criteo-display-ad-challenge☆98Updated 10 years ago
- Spark library for doing exploratory data analysis in a scalable way☆43Updated 9 years ago
- My machine learning model for the See Click Predict Fix Kaggle competition☆31Updated 7 years ago
- A simple example of containerized data science with python and Docker.☆51Updated 6 years ago
- Oracle Data Science Bootcamp 2014☆25Updated 9 years ago
- Sample applications built using AWS' Amazon Machine Learning.☆51Updated 7 years ago
- Material and slides for Boston NLP meetup May 23rd 2016☆17Updated 8 years ago
- Amazon access control challenge☆25Updated 10 years ago
- Explore different deep-learning frameworks☆18Updated 6 years ago
- Additional useful algorithms that can be used with spark.☆24Updated 10 years ago
- Notebooks (and slides) for my PyData NYC 2014 tutorial on the more advanced features of scikit-learn.☆69Updated 10 years ago
- Machine Learning with Scikit-Learn (material for pydata Amsterdam 2016)☆30Updated 8 years ago
- Tools for performing hyperparameter search with Scikit-Learn and Dask http://dask-searchcv.readthedocs.io☆11Updated 7 years ago
- Kaggle competition☆23Updated 9 years ago
- Code to munge data between Kaggle .tsv Rotten Tomatoes Sentiment Analysis data set and Vowpal Wabbit☆24Updated 10 years ago
- PyTennessee 2014: Statistical Data Analysis in Python☆85Updated 10 years ago
- Additional files for the Otto Group Challenge hosted by Kaggle☆36Updated 9 years ago
- A couple projects using scikit-learn illustrating project decision making.☆15Updated 8 years ago
- Spark Tutorial at the University of Maryland☆38Updated 10 years ago
- Articles on Data Science, Jupyter, and Pandas☆18Updated 9 years ago
- Code & Data for Introduction to Machine Learning with Scikit-Learn☆81Updated 6 years ago
- ☆27Updated 9 years ago