nierman / thinkstats
code etc for Think Stats book http://greenteapress.com/thinkstats/
☆18Updated 13 years ago
Related projects ⓘ
Alternatives and complementary repositories for thinkstats
- Python MapReduce library written in Cython. Visit us in #hadoopy on freenode. See the link below for documentation and tutorials.☆243Updated 8 years ago
- Example code for "Web-Scale Computer Vision using MapReduce for Multimedia Data Mining"☆49Updated 14 years ago
- Experimental parallel data analysis toolkit.☆120Updated 3 years ago
- A Python wrapper for Cascading☆222Updated 4 years ago
- Toy single-machine implementation of the Pregel graph-based framework☆114Updated 7 years ago
- C++ native client for Impala and Hive, with Python / pandas bindings☆73Updated 6 years ago
- Example code for running R on Hadoop☆133Updated 12 years ago
- HDFS client for Python☆63Updated 13 years ago
- MILK: Machine Learning Toolkit☆605Updated 9 years ago
- Hadoop (Utilities, Patches and Examples)☆242Updated 8 years ago
- A Python wrapper for MADlib(http://madlib.net) - an open source library for scalable in-database machine learning algorithms☆63Updated 3 years ago
- Transactional and indexing extensions for hbase☆73Updated 13 years ago
- Mahout vector encoding for pig☆54Updated last year
- Zohmg is a data store for aggregation of multi-dimensional time series data, built on top of Hadoop, Dumbo and HBase.☆174Updated 12 years ago
- code for my O'Reilly masterclass videos☆312Updated 9 years ago
- Material for talk "Machine Learning 101" https://speakerdeck.com/kastnerkyle/pycon2015 https://us.pycon.org/2015/schedule/presentation/36…☆87Updated 9 years ago
- Chapter-wise code for Agile Data the O'Reilly book☆157Updated 10 years ago
- Python language Plugin for elasticsearch☆103Updated 5 years ago
- ☆75Updated 11 years ago
- Trident-ML : A realtime online machine learning library☆382Updated 10 months ago
- example code for "Large-scale social media analysis with Hadoop" tutorial presented at ICWSM 2010☆42Updated 14 years ago
- Python module that allows one to easily write and run Hadoop programs.☆1,035Updated 6 years ago
- HBase as the backing store for the TF-IDF representations for Lucene☆108Updated 14 years ago
- Oozie - workflow engine for Hadoop☆373Updated 7 years ago
- Python wrapper for the Vowpal Wabbit machine learning library.☆53Updated 11 years ago
- Tool to help users migrate large relational databases into Hadoop clusters.☆67Updated 12 years ago
- Mirror of Apache HCatalog☆61Updated last year