lintool / bigdata-2016w
CS 489/698 Big Data Infrastructure (Winter 2016) at the University of Waterloo
☆39Updated 9 years ago
Alternatives and similar repositories for bigdata-2016w:
Users that are interested in bigdata-2016w are comparing it to the libraries listed below
- Distributed Matrix Library☆71Updated 8 years ago
- Quickly start YARN cluster on EC2☆30Updated 7 years ago
- Spark library for doing exploratory data analysis in a scalable way☆43Updated 9 years ago
- A parallel IRWLS library to solve SVMs and budgeted SVMs☆59Updated 7 years ago
- GPU Acceleration for Apache Spark☆34Updated 9 years ago
- This repository contains code files specifically IPython notebooks for the assignments in the course "Scalable Machine Learning" by UC Be…☆30Updated 9 years ago
- Yggdrasil: Faster Decision Trees Using Column Partitioning in Spark☆31Updated 6 years ago
- Repo for experiments on pyspark and sklearn☆79Updated 11 years ago
- Pydata NYC 2014 Scikit Learn Tutorial☆64Updated 10 years ago
- Logistic regression engine for medium-sized data☆55Updated 9 years ago
- Latency numbers every data scientist should know (aka the pyramid of analytical tasks) - the order of magnitude of computational time for…☆20Updated 8 years ago
- Code and data for bike forecast post☆17Updated 10 years ago
- Data Science in Scala - Conf. Talk Repo☆15Updated 9 years ago
- My winning solution for Kaggle Higgs Machine Learning Challenge (single classifier, xgboost)☆82Updated 10 years ago
- Creates models to classify documents into categories☆66Updated 7 years ago
- An API for Distributed Machine Learning☆154Updated 8 years ago
- Scikit-learn Tutorial at EuroPython 2014☆43Updated 6 years ago
- ☆46Updated 7 years ago
- Benchmarks of BLAS libraries with Scala interface☆30Updated 9 years ago
- Source code for the tutorial series at http://www.thoughtly.co/blog/prototype☆32Updated 10 years ago
- A Spark-based LexRank extractive summarizer for text documents☆19Updated 9 years ago
- MLSS 2016 material.☆22Updated 8 years ago
- Named Entity Extraction on Twitter Stream using Apache Spark Streaming and Stanford CoreNLP☆15Updated 8 years ago
- Lossy Counting and Sticky Sampling implementation for efficient frequency counts on data streams.☆63Updated 8 years ago
- Code to create benchmarks for Kaggle's Facebook Recruiting Competition☆85Updated 12 years ago
- Quick summary: This code implements a spectral (third order tensor decomposition) learning method for learning LDA topic model on Spark.☆105Updated 6 years ago
- ☆36Updated 9 years ago
- Course homepages for courses that I've taught at the University of Maryland☆55Updated 9 years ago
- ☆24Updated 9 years ago
- Incremental Random Forest☆48Updated 12 years ago