lintool / bigdata-2016w
CS 489/698 Big Data Infrastructure (Winter 2016) at the University of Waterloo
☆39Updated 9 years ago
Alternatives and similar repositories for bigdata-2016w
Users that are interested in bigdata-2016w are comparing it to the libraries listed below
Sorting:
- Distributed Matrix Library☆71Updated 8 years ago
- Lossy Counting and Sticky Sampling implementation for efficient frequency counts on data streams.☆63Updated 8 years ago
- ☆46Updated 7 years ago
- GPU Acceleration for Apache Spark☆34Updated 9 years ago
- Data Science in Scala - Conf. Talk Repo☆15Updated 9 years ago
- !!!!!DEPRECATED!!!! distributed machine learning benchmark - a public benchmark of distributed ML solvers and frameworks☆40Updated 6 years ago
- Yggdrasil: Faster Decision Trees Using Column Partitioning in Spark☆31Updated 7 years ago
- Repo for experiments on pyspark and sklearn☆79Updated 11 years ago
- Spark library for doing exploratory data analysis in a scalable way☆43Updated 9 years ago
- Splash Project for parallel stochastic learning☆94Updated 7 years ago
- Training materials for Strata, AMP Camp, etc☆149Updated 9 years ago
- The Musketeer workflow manager.☆41Updated 6 years ago
- Demo of random projections at BerlinBuzzwords 2015☆22Updated 5 years ago
- Lasagne / Theano tutorials for Nvidia Deep Learning Summercamp 2016☆26Updated 8 years ago
- Quick summary: This code implements a spectral (third order tensor decomposition) learning method for learning LDA topic model on Spark.☆105Updated 6 years ago
- Talk on "Tree models with Scikit-Learn: Great learners with little assumptions" presented at PyPata Paris 2015☆50Updated 10 years ago
- A parallel IRWLS library to solve SVMs and budgeted SVMs☆59Updated 7 years ago
- A curated list of awesome Machine Learning frameworks, libraries and software.☆48Updated 10 years ago
- An implementation of DistBelief using the Akka Actor framework☆83Updated 9 years ago
- Demo code contrasting Google Dataflow (Apache Beam) with Apache Spark☆14Updated 8 years ago
- Latency numbers every data scientist should know (aka the pyramid of analytical tasks) - the order of magnitude of computational time for…☆20Updated 8 years ago
- Testing framework for Collaborative Filtering☆38Updated 10 years ago
- *Experimental* GraphChi-DB graph database with computational capabilities☆79Updated 9 years ago
- Gaussian Mixture Model Implementation in Pyspark☆32Updated 10 years ago
- Datasets and notebooks☆13Updated 8 years ago
- IPython notebook for training multilayer LSTM and RNN networks with pycaffe☆53Updated 9 years ago
- ☆15Updated 7 years ago
- Deep learning made easy☆116Updated 10 years ago
- Named Entity Extraction on Twitter Stream using Apache Spark Streaming and Stanford CoreNLP☆15Updated 8 years ago
- Notes and Labs for Advanced Topics in Data Processing☆39Updated 10 years ago