CjTouzi / edx-Introduction-to-Big-Data-with-Apache-Spark
☆12Updated 9 years ago
Alternatives and similar repositories for edx-Introduction-to-Big-Data-with-Apache-Spark:
Users that are interested in edx-Introduction-to-Big-Data-with-Apache-Spark are comparing it to the libraries listed below
- This project contains the code to translate between Apache Spark and SFrame.☆21Updated 8 years ago
- Assembly of fundamental statistics implemented based on Apache Spark☆31Updated 8 years ago
- ☆41Updated 7 years ago
- Jupyter notebooks and code for Intro to DL talk at Genesys☆14Updated 8 years ago
- Correlation matrix with scatter plot using d3.js☆19Updated 10 years ago
- This repository contains code files specifically IPython notebooks for the assignments in the course "Scalable Machine Learning" by UC Be…☆30Updated 9 years ago
- Articles on Data Science, Jupyter, and Pandas☆18Updated 9 years ago
- Additional useful algorithms that can be used with spark.☆24Updated 10 years ago
- Apache Toree quickstart tutorial☆29Updated 8 years ago
- Healthcare Twitter Analysis☆26Updated 8 years ago
- Distributed Streaming Quantiles (for PySpark)☆37Updated 11 years ago
- ☆13Updated 5 years ago
- How to use automatic polynomial features and neural network mode in VW☆17Updated 10 years ago
- Pydata Seattle 2015 Trend Estimation in Time Series Signals Deck + Notebooks☆21Updated 9 years ago
- Predicting sales with Pandas☆15Updated 9 years ago
- Mirror of Apache Spark☆24Updated 9 years ago
- A real time streaming implementation of markov chain based fraud detection☆24Updated 10 years ago
- A simple introduction to using spark ml pipelines☆26Updated 6 years ago
- Distributed implementation of Robust PLSA using Spark☆12Updated 3 years ago
- Some notebook examples related to Apache Spark, IPython / Jupyter, Zeppelin☆52Updated 8 years ago
- Data and code for "Fast Data Applications with Spark and Python"☆25Updated 8 years ago
- A curated list of articles, papers and tools for managing the building and deploying of machine learning models, aka machine learning eng…☆18Updated 6 years ago
- Python and Scala APIs for enhanced Spark analytics☆12Updated 7 years ago
- Provides the implementation of a topic detection framework developed for the MULTISENSOR project.☆9Updated 8 years ago
- RedRock - Mobile Application prototype using Apache Spark, Twitter and Elasticsearch☆14Updated 6 years ago
- This project provides sequential pattern mining for Apache Spark. The algorithms are based on the work of Philippe Fournier-Viger and co…☆30Updated 9 years ago
- Regularized latent variable mixed membership modeling☆13Updated 11 years ago
- Data Science in Scala - Conf. Talk Repo☆15Updated 8 years ago
- Training materials for Strata, AMP Camp, etc☆150Updated 9 years ago
- An API for Distributed Machine Learning☆154Updated 8 years ago