bagrow / datatoolsLinks
Small scripts for quickly plotting and munging data from the command line.
☆38Updated last year
Alternatives and similar repositories for datatools
Users that are interested in datatools are comparing it to the libraries listed below
Sorting:
- A set of command-line statistics tools☆29Updated 9 years ago
- An R wrapper to the infochimps.com APIs☆38Updated 14 years ago
- A web server interface for the R language☆52Updated 13 years ago
- This repository contains all code examples in Machine Learning for Email, by Drew Conway and John Myles White.☆101Updated 13 years ago
- A Python version (almost a port) of ProPublica's TableFu☆231Updated 11 years ago
- JSON -> Relational DB Column Types☆63Updated 2 years ago
- A command-line twitter client with smart filtering and statistical classification☆165Updated 14 years ago
- OAuth wrapper for cURL on the command line☆118Updated 8 years ago
- Python Development Emacs Environment☆51Updated last week
- baby steps in d3.js☆172Updated 13 years ago
- a python port of https://github.com/twitter/twitter-text-rb also available via `pip install twitter_text`☆82Updated 7 years ago
- Python client library for controlling Google Refine☆83Updated 8 years ago
- Where 2.0 Workshop Code: Spatial Analysis of Tweets using Hadoop, Pig, Python & Mechanical Turk. Slides here: http://www.slideshare.net/…☆134Updated 15 years ago
- File format conversion tools☆291Updated 4 years ago
- TweeQL is a Query Language for Tweets: SELECT brand(text) AS brand, sentiment(text) AS sentiment FROM twitter_sample;☆194Updated 11 years ago
- A framework (comand line tool + libraries) for creating flexible compute pipelines☆56Updated 4 years ago
- trying shingling / resemblance / simhash / sketching to do some data deduping☆99Updated 9 years ago
- Randomly sample lines from a csv, tsv, or other line-based data file☆125Updated 10 years ago
- ZIA Code Repository☆97Updated 11 years ago
- Parser and standardizer for politician, individual and organization names.☆129Updated 8 years ago
- A script for rapidly sampling a proportion of lines from a file☆19Updated 10 years ago
- vbench: A tool for benchmarking your code through time, for showing performance improvement or regressions☆244Updated 7 years ago
- IPython Notebook + D3☆128Updated 10 years ago
- Run IPython, Pattern, NLTK, Pandas, NumPy, SciPy, Numba, Biopython inside Docker☆47Updated 11 years ago
- DEPRECATED: THIS REPOSITORY IS NO LONGER IN USE: PLEASE SEE swcarpentry/styles INSTEAD.☆22Updated 8 years ago
- Tools for text tokenization and encoding☆84Updated 3 years ago
- R driver for MongoDB☆82Updated 12 years ago
- workflow support for reproducible deduplication and merging☆16Updated 2 years ago
- Proposals for new Jupyter subprojects to enter into incubation☆18Updated 4 years ago
- Code for High Performance Computing tutorial for EuroPython 2011☆104Updated 3 years ago