lintool / bigdata-2016wLinks
CS 489/698 Big Data Infrastructure (Winter 2016) at the University of Waterloo
☆39Updated 9 years ago
Alternatives and similar repositories for bigdata-2016w
Users that are interested in bigdata-2016w are comparing it to the libraries listed below
Sorting:
- My winning solution for Kaggle Higgs Machine Learning Challenge (single classifier, xgboost)☆82Updated 11 years ago
- ☆154Updated 8 years ago
- My entry to the Kaggle 2012 Stack Overflow competition. Ranked 10th on the final public leaderboard.☆45Updated 9 years ago
- Latency numbers every data scientist should know (aka the pyramid of analytical tasks) - the order of magnitude of computational time for…☆20Updated 8 years ago
- Deep learning made easy☆116Updated 11 years ago
- Lossy Counting and Sticky Sampling implementation for efficient frequency counts on data streams.☆63Updated 9 years ago
- SDK for Turi's GraphLab Create.☆148Updated 7 years ago
- GPU Acceleration for Apache Spark☆34Updated 10 years ago
- Distributed Matrix Library☆72Updated 8 years ago
- Algorithm's team Jupyter Notebooks☆113Updated 4 months ago
- Quick summary: This code implements a spectral (third order tensor decomposition) learning method for learning LDA topic model on Spark.☆105Updated 7 years ago
- Spark MLlib code optimized to efficiently support sparse data☆51Updated 8 years ago
- Training materials for Strata, AMP Camp, etc☆149Updated 9 years ago
- cache-friendly multithread matrix factorization☆90Updated 9 years ago
- Talk on "Tree models with Scikit-Learn: Great learners with little assumptions" presented at PyPata Paris 2015☆50Updated 10 years ago
- Public materials for the Fall 2016 offering of CS145☆34Updated 8 years ago
- Defunct☆241Updated 8 years ago
- Shell script to scrape Harvard CS109 (Intro to Data Science) lecture videos☆79Updated 9 years ago
- Logistic regression engine for medium-sized data☆55Updated 10 years ago
- ☆17Updated 3 years ago
- Data-Intensive Text Processing with MapReduce☆627Updated 4 years ago
- A primal-dual framework for distributed L1-regularized optimization☆36Updated 9 years ago
- Pydata NYC 2014 Scikit Learn Tutorial☆65Updated 10 years ago
- A parallel IRWLS library to solve SVMs and budgeted SVMs☆59Updated 8 years ago
- Yggdrasil: Faster Decision Trees Using Column Partitioning in Spark☆30Updated 7 years ago
- You wanna learn how to use Hadoop, start here!☆40Updated 13 years ago
- ☆20Updated 4 years ago
- Various neural networks on MNIST data using TensorFlow library☆16Updated 9 years ago
- An implementation of DistBelief using the Akka Actor framework☆83Updated 9 years ago
- Introduction to Machine Learning, a series of IPython Notebook and accompanying slideshow and video☆103Updated 6 years ago