CjTouzi / edx-Introduction-to-Big-Data-with-Apache-Spark
☆12Updated 9 years ago
Related projects ⓘ
Alternatives and complementary repositories for edx-Introduction-to-Big-Data-with-Apache-Spark
- This project contains the code to translate between Apache Spark and SFrame.☆21Updated 8 years ago
- Provides the implementation of a topic detection framework developed for the MULTISENSOR project.☆9Updated 8 years ago
- This repository contains code files specifically IPython notebooks for the assignments in the course "Scalable Machine Learning" by UC Be…☆30Updated 9 years ago
- ☆16Updated 8 years ago
- Code to munge data between Kaggle .tsv Rotten Tomatoes Sentiment Analysis data set and Vowpal Wabbit☆24Updated 10 years ago
- Healthcare Twitter Analysis☆26Updated 8 years ago
- ☆12Updated 8 years ago
- Material and slides for Boston NLP meetup May 23rd 2016☆17Updated 8 years ago
- tutorials and samples that show you how get the most out of IBM Analytics for Apache Spark☆79Updated 6 years ago
- Worked examples for exercises in Think Stats using the Scientific Python stack.☆8Updated 4 years ago
- ☆41Updated 4 years ago
- ☆11Updated 8 years ago
- RedRock - Mobile Application prototype using Apache Spark, Twitter and Elasticsearch☆14Updated 6 years ago
- Distributed implementation of Robust PLSA using Spark☆12Updated 3 years ago
- Exploring item combinations with a bar chart☆10Updated 3 years ago
- A Spark-based LexRank extractive summarizer for text documents☆19Updated 8 years ago
- ☆13Updated 5 years ago
- Tutorial repo for the article "ML in Production"☆30Updated last year
- Deployment instructions to get a GPU VM for the Deep Learning class☆17Updated 6 years ago
- Jupyter notebooks and code for Intro to DL talk at Genesys☆14Updated 8 years ago
- System for mining Wikipedia Usage data to read our collective mind☆21Updated 10 years ago
- edXSpark☆21Updated 8 years ago
- Spark in Kaggle competitions☆9Updated 8 years ago
- Materials for Machine Learning with H2O Open Platform at ODSC Masterclass Summit 2017☆12Updated 7 years ago
- Predicting sales with Pandas☆15Updated 9 years ago
- Jupyter notebook containing code from text preprocessing blog post☆10Updated 7 years ago
- AXA Driver Telematics Challenge on Kaggle.com☆50Updated 7 years ago
- A simple introduction to using spark ml pipelines☆26Updated 6 years ago