CjTouzi / edx-Introduction-to-Big-Data-with-Apache-Spark
☆12Updated 9 years ago
Alternatives and similar repositories for edx-Introduction-to-Big-Data-with-Apache-Spark:
Users that are interested in edx-Introduction-to-Big-Data-with-Apache-Spark are comparing it to the libraries listed below
- Uses NLP methods to parse and classify contracts from The City of New Orleans☆10Updated 9 years ago
- Healthcare Twitter Analysis☆26Updated 8 years ago
- a graph-based knowledge search engine powered by Wikipedia☆14Updated last year
- Provides the implementation of a topic detection framework developed for the MULTISENSOR project.☆9Updated 9 years ago
- Social Context Analysis aNd Emotion Recognition☆12Updated 7 years ago
- Exploring item combinations with a bar chart☆10Updated 3 years ago
- Introduction to Neural Networks with Keras, O'Reilly Artificial Intelligence Conference 2017, Tutorial☆18Updated 7 years ago
- Regularized latent variable mixed membership modeling☆13Updated 11 years ago
- Predicting sales with Pandas☆15Updated 9 years ago
- ☆12Updated 8 years ago
- Spark-cloud is a set of scripts for starting spark clusters on ec2☆12Updated 9 years ago
- Predicting the winner of FIFA 2014☆16Updated 10 years ago
- System for mining Wikipedia Usage data to read our collective mind☆21Updated 10 years ago
- Flask app to run a bandit algorithm testing different beer recommenders☆25Updated 10 years ago
- Keyword Extraction system using Brown Clustering - (This version is trained to extract keywords from job listings)☆17Updated 10 years ago
- Code to munge data between Kaggle .tsv Rotten Tomatoes Sentiment Analysis data set and Vowpal Wabbit☆24Updated 10 years ago
- Data and code for "Fast Data Applications with Spark and Python"☆25Updated 8 years ago
- Jupyter notebooks and code for Intro to DL talk at Genesys☆14Updated 8 years ago
- Twitter visualizaton experiment using various python-based technologies.☆60Updated 8 years ago
- An in depth tutorial on sklearn's Pipeline and FeatureUnion classes.☆16Updated 7 years ago
- ☆11Updated 8 years ago
- This repository contains code files specifically IPython notebooks for the assignments in the course "Scalable Machine Learning" by UC Be…☆30Updated 9 years ago
- ☆13Updated 5 years ago
- Assembly of fundamental statistics implemented based on Apache Spark☆31Updated 9 years ago
- A couple projects using scikit-learn illustrating project decision making.☆15Updated 8 years ago
- Deprecated Module: See Xponents or OpenSextantToolbox as active code base.☆31Updated 11 years ago
- ☆41Updated 4 years ago
- Code + Jupyter Notebooks for Visualizing Clusters of Clickbait Headlines Using Spark, Word2vec, and Plotly☆47Updated 4 years ago
- Correlation matrix with scatter plot using d3.js☆19Updated 10 years ago
- Pydata Seattle 2015 Trend Estimation in Time Series Signals Deck + Notebooks☆21Updated 9 years ago