cestella / presentationsLinks
Public Presentations
☆24Updated last month
Alternatives and similar repositories for presentations
Users that are interested in presentations are comparing it to the libraries listed below
Sorting:
- Spark Tutorial at the University of Maryland☆38Updated 10 years ago
- Repo for experiments on pyspark and sklearn☆79Updated 11 years ago
- Tutorial for Deploying Anaconda Cluster and PySpark on top of Red Hat Storage GlusterFS☆8Updated 10 years ago
- open source version of the Bonsai library☆26Updated 9 years ago
- A simple example of containerized data science with python and Docker.☆51Updated 7 years ago
- Single view demo☆14Updated 9 years ago
- Java implementation of the Microsoft's AdPredictor algorithm☆17Updated 7 years ago
- Simple Spark example of generating table stats for use of data quality checks☆28Updated 8 years ago
- A real time streaming implementation of markov chain based fraud detection☆23Updated 10 years ago
- PyMC version 3 (PyMC 2 is in branch 2.3)☆27Updated 10 years ago
- Spark library for doing exploratory data analysis in a scalable way☆43Updated 9 years ago
- A simple introduction to using spark ml pipelines☆26Updated 7 years ago
- Spark MOOC setup and labs for DBC users☆45Updated 9 years ago
- Oracle Data Science Bootcamp 2014☆25Updated 10 years ago
- Introduction to predictive modeling in Spark with applications in pharmaceutical bioinformatics☆39Updated 9 years ago
- Fast Ensembles of Sparse Trees☆38Updated 9 years ago
- Set up tools for running a few DL libraries on CDH and CDSW☆17Updated 4 years ago
- Additional useful algorithms that can be used with spark.☆24Updated 10 years ago
- Spark implementation of the Google Correlate algorithm to quickly find highly correlated vectors in huge datasets☆93Updated 9 years ago
- Starter project for building MemSQL Streamliner Pipelines☆32Updated 8 years ago
- This project contains the code to translate between Apache Spark and SFrame.☆20Updated 8 years ago
- ☆11Updated 8 years ago
- Notebooks (and slides) for my PyData NYC 2014 tutorial on the more advanced features of scikit-learn.☆69Updated 10 years ago
- SmallK: very fast data clustering tools☆14Updated 6 years ago
- Some notebook examples related to Apache Spark, IPython / Jupyter, Zeppelin☆52Updated 9 years ago
- Some IPython notebooks I've created...☆29Updated 9 years ago
- Machine Learning for Cascading☆82Updated 9 years ago
- ☆15Updated 8 years ago
- ☆24Updated 10 years ago
- Quick summary: This code implements a spectral (third order tensor decomposition) learning method for learning LDA topic model on Spark.☆105Updated 6 years ago