szilard / benchm-databases
A minimal benchmark of various tools (statistical software, databases etc.) for working with tabular data of moderately large sizes (interactive data analysis).
☆90Updated 7 years ago
Alternatives and similar repositories for benchm-databases:
Users that are interested in benchm-databases are comparing it to the libraries listed below
- R code to accompany Henrik Brink, Joseph W. Richards, and Mark Fetherolf's book "Real-World Machine Learning"☆61Updated 2 years ago
- Exploratory data analysis for large datasets (10-100 million observations)☆290Updated 9 years ago
- Standard API for Distributed Data Structures in R☆118Updated 7 years ago
- ☆133Updated 7 years ago
- Syberia: The development framework for R☆146Updated 6 years ago
- Statistical computations for visualisation☆70Updated 8 years ago
- Links to slides for talks at the 2016 Joint Statistical Meetings in Chicago☆79Updated 3 years ago
- Anomalous time series package for R☆92Updated 7 years ago
- Slides and code for the 2016 useR! tutorial "Never Tell Me the Odds! Machine Learning with Class Imbalances"☆39Updated 8 years ago
- spark backend for dplyr☆48Updated 9 years ago
- Exploratory and diagnostic machine learning tools for R☆73Updated 3 years ago
- Notes on generalized linear models☆110Updated 6 years ago
- exploratory data analysis using random forests☆69Updated 7 years ago
- 2D Outlier Analysis using Shiny☆48Updated 2 years ago
- animated and interactive web graphics☆147Updated 5 years ago
- Showcase for using H2O and R for churn prediction (inspired by ZhouFang928 examples)☆58Updated 7 years ago
- ☆88Updated 9 years ago
- A simpler ggplot2 syntax, saving half of your typing.☆79Updated 6 years ago
- A script for rapidly sampling a proportion of lines from a file☆19Updated 10 years ago
- ☆78Updated 9 years ago
- R package for exploratory data analysis☆120Updated 7 years ago
- vtreat is a data frame processor/conditioner that prepares real-world data for predictive modeling in a statistically sound manner. Distr…☆285Updated 3 months ago
- ☆85Updated 7 years ago
- Detailed Visualization of Large Complex Data in R☆115Updated 8 years ago
- ☆22Updated 8 years ago
- Random forests for R for large data sets, optimized with parallel tree-growing and disk-based memory☆91Updated 9 years ago
- A companion book for the Coursera Regression Models class☆54Updated 5 years ago
- Get web applications growing in R☆85Updated 7 years ago
- ARCHIVED Accesses the Monkeylearn API for Text Classifiers and Extractors☆93Updated 2 years ago
- R/foreach Redis backend for parallel computing☆71Updated 3 years ago