bagrow / datatools
Small scripts for quickly plotting and munging data from the command line.
☆37Updated 8 months ago
Related projects: ⓘ
- A set of command-line statistics tools☆29Updated 9 years ago
- Python wrapper for the Vowpal Wabbit machine learning library.☆53Updated 11 years ago
- Library for GPU-related statistical functions☆84Updated 11 years ago
- ZIA Code Repository☆94Updated 11 years ago
- A web server interface for the R language☆49Updated 12 years ago
- John Langford's original release of Vowpal Wabbit -- a fast online learning algorithm☆57Updated last month
- My IPython startup files.☆109Updated 9 years ago
- Simple comparison of Python and R for a basic OLS analysis☆41Updated 13 years ago
- baby steps in d3.js☆173Updated 12 years ago
- An R wrapper to the infochimps.com APIs☆38Updated 13 years ago
- A Python version (almost a port) of ProPublica's TableFu☆231Updated 11 years ago
- A command-line twitter client with smart filtering and statistical classification☆165Updated 13 years ago
- Code for High Performance Computing tutorial for EuroPython 2011☆100Updated 3 years ago
- An implementation of the gap statistic algorithm to compute the number of clusters in a set of numerical data.☆39Updated 13 years ago
- R driver for MongoDB☆83Updated 11 years ago
- a python port of https://github.com/twitter/twitter-text-rb also available via `pip install twitter_text`☆82Updated 6 years ago
- An analysis of adverse drug event data using Hadoop, R, and Gephi☆44Updated 8 years ago
- ☆21Updated 8 years ago
- This repository contains all code examples in Machine Learning for Email, by Drew Conway and John Myles White.☆101Updated 12 years ago
- trying shingling / resemblance / simhash / sketching to do some data deduping☆98Updated 9 years ago
- A set of convenience functions in R for exploring iPhone and iPad location data☆38Updated 13 years ago
- A script for rapidly sampling a proportion of lines from a file☆19Updated 9 years ago
- A python package for defensive data analysis.☆17Updated 9 years ago
- enable rapid iteration and development of complex data pipelines☆28Updated 6 years ago
- A git-blame viewer, written using PyGTK.☆35Updated 10 years ago
- IMPORTANT: Data Brewery is now Bubbles: https://github.com/stiivi/bubbles This brewery repository is NOT MAINTAINED any more.☆134Updated 11 years ago
- Randomly sample lines from a csv, tsv, or other line-based data file☆122Updated 9 years ago
- Where 2.0 Workshop Code: Spatial Analysis of Tweets using Hadoop, Pig, Python & Mechanical Turk. Slides here: http://www.slideshare.net/…☆134Updated 14 years ago
- ggplot2-inspired d3 app to make instant interactive visualizations☆55Updated 12 years ago