drewconway / data_science_box
Simple scripts to setup a fresh data science box using an Ubuntu 12.04.* LTS 64-bit server running on an EC2
☆163Updated 11 years ago
Alternatives and similar repositories for data_science_box:
Users that are interested in data_science_box are comparing it to the libraries listed below
- ☆85Updated 7 years ago
- Course materials for Sta 104 - Summer 2015 semester at Duke University☆22Updated 9 years ago
- A compendium of the pitfalls and problems that arise when using standard statistical methods☆246Updated 11 years ago
- ☆78Updated 9 years ago
- A browser based R Notebook☆125Updated 11 years ago
- R code to accompany Henrik Brink, Joseph W. Richards, and Mark Fetherolf's book "Real-World Machine Learning"☆61Updated 2 years ago
- Links to slides for talks at the 2016 Joint Statistical Meetings in Chicago☆79Updated 3 years ago
- Simple employee cost/benefit model with plots. Supports a series of blog entries.☆70Updated 10 years ago
- Dev version of access to LinkedIn API via R☆86Updated 8 years ago
- Materials for my PyData Seattle talk☆21Updated 9 years ago
- Materials accompanying the presentation "Introduction to ggplot2"☆28Updated 8 years ago
- Data and Visualization of US ZIP Codes☆49Updated 4 years ago
- Showcase for using H2O and R for churn prediction (inspired by ZhouFang928 examples)☆58Updated 7 years ago
- More Than Words: Text and Context, Language Analytics in Finance (joint useR2016 tutorial presentation on text mining with Karthik Mokash…☆13Updated 6 years ago
- FiveThirtyEight replica☆17Updated 9 years ago
- Very concise notes on machine learning and statistics.☆382Updated 12 years ago
- Code examples referenced on CodeMine Blog☆16Updated 4 years ago
- Viewable pages from WinVector LLC view at: http://winvector.github.io☆23Updated 3 months ago
- A companion book for the Coursera Regression Models class☆54Updated 5 years ago
- Statistical computations for visualisation☆70Updated 8 years ago
- Random forests for R for large data sets, optimized with parallel tree-growing and disk-based memory☆91Updated 9 years ago
- Notes on generalized linear models☆110Updated 6 years ago
- exploratory data analysis using random forests☆69Updated 7 years ago
- ARCHIVED Accesses the Monkeylearn API for Text Classifiers and Extractors☆93Updated 2 years ago
- Piketty in R☆213Updated 8 years ago
- Packing a couple of inspiring Google Analytics visualizations within a R Shiny Dashboard☆34Updated 8 years ago
- Materials for a workshop on developing undergraduate classes on Bayesian statistics.☆47Updated 8 years ago
- Materials for workshop "Data Visualization with R and ggplot2"☆75Updated 11 years ago
- cheatsheet for ggplot2☆46Updated 10 years ago
- A package to run unit tests on tabular data☆141Updated 8 years ago