szilard / dataset-sizes-kdnuggetsLinks
Size of datasets used for analytics based on 10 years of surveys by KDnuggets.
☆16Updated 10 years ago
Alternatives and similar repositories for dataset-sizes-kdnuggets
Users that are interested in dataset-sizes-kdnuggets are comparing it to the libraries listed below
Sorting:
- R code to accompany Henrik Brink, Joseph W. Richards, and Mark Fetherolf's book "Real-World Machine Learning"☆61Updated 3 years ago
- Simple scripts to setup a fresh data science box using an Ubuntu 12.04.* LTS 64-bit server running on an EC2☆163Updated 12 years ago
- Random forests for R for large data sets, optimized with parallel tree-growing and disk-based memory☆91Updated 10 years ago
- R package for exploratory data analysis☆119Updated 8 years ago
- Data and code for the Dataists R recommendation system contest☆75Updated 15 years ago
- Anomalous time series package for R☆93Updated 7 years ago
- Sparklyr Extensions API☆32Updated 9 years ago
- Very concise notes on machine learning and statistics.☆384Updated 13 years ago
- Viewable pages from WinVector LLC view at: http://winvector.github.io☆23Updated last year
- Information Package☆44Updated 9 years ago
- Exploratory and diagnostic machine learning tools for R☆74Updated 4 years ago
- Simple employee cost/benefit model with plots. Supports a series of blog entries.☆70Updated 11 years ago
- A simple dataset of Stack Overflow questions and tags☆109Updated 8 years ago
- Statistical Learning & Data Mining IV - H2O Presenation & Tutorial☆26Updated 8 years ago
- Scalable R for Machine Learning☆43Updated 7 years ago
- A browser based R Notebook☆125Updated 12 years ago
- R on Apache Spark (SparkR) tutorials for Big Data analysis and Machine Learning as IPython / Jupyter notebooks☆123Updated 8 years ago
- Deep Learning for Pugs☆74Updated 8 years ago
- A system for blending regression models in R☆76Updated 13 years ago
- R package for sentiment text analysis☆21Updated 10 years ago
- Materials for a workshop on developing undergraduate classes on Bayesian statistics.☆47Updated 9 years ago
- spark backend for dplyr☆48Updated 10 years ago
- Docker container for Shiny Server☆14Updated 9 years ago
- ☆85Updated 8 years ago
- A minimal benchmark of various tools (statistical software, databases etc.) for working with tabular data of moderately large sizes (inte…☆89Updated 8 years ago
- Deep neural networks on over 50 classification problems from the UC Irvine Machine Learning Repository☆27Updated 10 years ago
- Kaggle R docker image☆149Updated 8 months ago
- A collection of data science examples implemented across a variety of languages and libraries.