szilard / dataset-sizes-kdnuggetsLinks
Size of datasets used for analytics based on 10 years of surveys by KDnuggets.
☆16Updated 9 years ago
Alternatives and similar repositories for dataset-sizes-kdnuggets
Users that are interested in dataset-sizes-kdnuggets are comparing it to the libraries listed below
Sorting:
- R code to accompany Henrik Brink, Joseph W. Richards, and Mark Fetherolf's book "Real-World Machine Learning"☆61Updated 3 years ago
- Random forests for R for large data sets, optimized with parallel tree-growing and disk-based memory☆91Updated 10 years ago
- R package for exploratory data analysis☆120Updated 7 years ago
- Exploratory and diagnostic machine learning tools for R☆73Updated 4 years ago
- Anomalous time series package for R☆93Updated 7 years ago
- Simple scripts to setup a fresh data science box using an Ubuntu 12.04.* LTS 64-bit server running on an EC2☆163Updated 11 years ago
- A system for blending regression models in R☆76Updated 12 years ago
- 2D Outlier Analysis using Shiny☆48Updated 3 years ago
- Scalable R for Machine Learning☆43Updated 7 years ago
- A browser based R Notebook☆125Updated 12 years ago
- Information Package☆44Updated 9 years ago
- Very concise notes on machine learning and statistics.☆384Updated 13 years ago
- Materials for Nathan and Garrett's tutorial R for Big Data☆17Updated 9 years ago
- Materials for my PyData Seattle talk☆21Updated 10 years ago
- An R package to streamline the training, fine-tuning and predicting processes for deep learning based on 'darch' and 'deepnet'.☆46Updated 10 years ago
- ☆85Updated 8 years ago
- This repository contains my ML scripts in R☆33Updated 8 years ago
- spark backend for dplyr☆48Updated 9 years ago
- Simple employee cost/benefit model with plots. Supports a series of blog entries.☆70Updated 10 years ago
- Ensemble/Blender example in R using Caret (companion code for YouTube video: https://www.youtube.com/watch?v=k7sTiTWWCXM)☆11Updated 11 years ago
- ☆24Updated 7 years ago
- A minimal benchmark of various tools (statistical software, databases etc.) for working with tabular data of moderately large sizes (inte…☆89Updated 8 years ago
- Materials for a workshop on developing undergraduate classes on Bayesian statistics.☆47Updated 9 years ago
- Packing a couple of inspiring Google Analytics visualizations within a R Shiny Dashboard☆35Updated 9 years ago
- R on Apache Spark (SparkR) tutorials for Big Data analysis and Machine Learning as IPython / Jupyter notebooks☆122Updated 8 years ago
- A collection of data science examples implemented across a variety of languages and libraries.☆33Updated 9 years ago
- wrapper for the yhat API☆16Updated 8 years ago
- Deep neural networks on over 50 classification problems from the UC Irvine Machine Learning Repository☆26Updated 10 years ago
- R Shiny EC2 Bootstrap Guide☆81Updated 5 years ago
- Sparklyr Extensions API☆32Updated 9 years ago