lintool / bigdata-2016w
CS 489/698 Big Data Infrastructure (Winter 2016) at the University of Waterloo
☆39Updated 8 years ago
Alternatives and similar repositories for bigdata-2016w:
Users that are interested in bigdata-2016w are comparing it to the libraries listed below
- Pydata NYC 2014 Scikit Learn Tutorial☆64Updated 10 years ago
- Distributed Matrix Library☆70Updated 7 years ago
- Yggdrasil: Faster Decision Trees Using Column Partitioning in Spark☆31Updated 6 years ago
- Quickly start YARN cluster on EC2☆30Updated 7 years ago
- Public materials for the Fall 2016 offering of CS145☆35Updated 7 years ago
- Testing framework for Collaborative Filtering☆38Updated 9 years ago
- Latency numbers every data scientist should know (aka the pyramid of analytical tasks) - the order of magnitude of computational time for…☆20Updated 7 years ago
- Spark library for doing exploratory data analysis in a scalable way☆43Updated 9 years ago
- Source code for the tutorial series at http://www.thoughtly.co/blog/prototype☆32Updated 9 years ago
- A parallel IRWLS library to solve SVMs and budgeted SVMs☆59Updated 7 years ago
- A collection of efficient utilities for a data scientist.☆41Updated 9 years ago
- ☆23Updated 9 years ago
- Distributed Streaming Quantiles (for PySpark)☆37Updated 10 years ago
- Spark MLlib code optimized to efficiently support sparse data☆50Updated 8 years ago
- Code to create benchmarks for Kaggle's Facebook Recruiting Competition☆86Updated 12 years ago
- My winning solution for Kaggle Higgs Machine Learning Challenge (single classifier, xgboost)☆82Updated 10 years ago
- crumbling large graphs into connected components☆12Updated 7 years ago
- Code and data for bike forecast post☆17Updated 9 years ago
- Additional files for the Otto Group Challenge hosted by Kaggle☆36Updated 9 years ago
- ☆38Updated 8 years ago
- Gaussian Mixture Model Implementation in Pyspark☆32Updated 10 years ago
- Slides for quick intro to machine learning with sklearn☆65Updated 10 years ago
- Predictive analatics using deepLearning4j and Spark☆26Updated 8 years ago
- Data Science in Scala - Conf. Talk Repo☆15Updated 8 years ago
- Common Code Workflow tutorial on Theano☆28Updated 10 years ago
- 阅读论文备份☆17Updated 8 years ago
- Logistic regression engine for medium-sized data☆55Updated 9 years ago
- Experiments with distributed matrix factorization. Presented at DataWorks Summit 2017, München.☆10Updated 6 years ago