nicodv / pyspark-tutorialLinks
A short tutorial notebook on PySpark
☆15Updated 9 years ago
Alternatives and similar repositories for pyspark-tutorial
Users that are interested in pyspark-tutorial are comparing it to the libraries listed below
Sorting:
- All Kaggle competitions☆91Updated 8 years ago
- PyCon 2017 tutorial on time series analysis☆72Updated 8 years ago
- This library is a wrapper for sklearn and works with data stored using Pandas module.☆17Updated 9 years ago
- 32/2384 Solution to Kaggle Mercari Competition (solo silver medal winner)☆21Updated 7 years ago
- Slides and materials for most of my talks by year☆92Updated last year
- PyTennessee 2014: Statistical Data Analysis in Python☆85Updated 10 years ago
- Material for UW Extension Data Science 350☆19Updated 7 years ago
- Brian Farris' Talk on Reinforcement Learning and Multi-Armed Bandits for the Data Incubator☆30Updated 7 years ago
- A machine learning algorithm written to predict severity of insurance claim☆20Updated 8 years ago
- Code example to predict prices of Airbnb vacation rentals, using scikit-learn on Spark with spark-sklearn, on MapR.☆44Updated 8 years ago
- Map-reduce, streaming analysis, and external memory algorithms and their implementation using the Hadoop and its eco-system: HBase, Hive,…☆34Updated 8 years ago
- Code for the Kaggle acquire valued shoppers challenge☆66Updated 11 years ago
- Predicting happiness from demographics and poll answers☆45Updated 8 years ago
- Springboard - Data Science Intensive course☆13Updated 8 years ago
- Containing codes of participation in Kaggle competitions.☆37Updated 9 years ago
- ☆26Updated last year
- Jupyter notebooks for learning Python and Data Science, companion to Data Science Solutions book.☆36Updated 5 years ago
- Kaggle competition results☆20Updated 6 years ago
- Metis Data Science Portfolio - Summer 2017☆26Updated 7 years ago
- Pydata Dallas 2015 Scikit-Learn Tutorial☆62Updated 10 years ago
- ☆77Updated 8 years ago
- Code to munge data between Kaggle .tsv Rotten Tomatoes Sentiment Analysis data set and Vowpal Wabbit☆24Updated 11 years ago
- Material and slides for Boston NLP meetup May 23rd 2016☆17Updated 9 years ago
- Apache Zeppelin notebooks for Recommendation Engines using Keras and Machine Learning on Apache Spark☆32Updated 7 years ago
- A helper library for data science pipeline☆36Updated 6 years ago
- Notes for Data Science 350 Class☆24Updated 8 years ago
- Source Code for 'Advanced Data Analytics Using Python' by Sayan Mukhopadhyay☆68Updated 7 years ago
- Tutorial on deploying machine learning models to production☆59Updated 5 years ago
- Some small utility modules to help with pandas, numpy and sklearn usage in other projects☆22Updated 10 years ago
- Pandas integration with sklearn☆21Updated 8 years ago