ptwobrussell / Mining-the-Social-Web
The official online compendium for Mining the Social Web (O'Reilly, 2011)
☆1,209Updated 10 years ago
Related projects: ⓘ
- Adaptations and Extensions of Twitter-Related Examples from Mining the Social Web☆382Updated 11 years ago
- Python module that allows one to easily write and run Hadoop programs.☆1,034Updated 6 years ago
- The official online compendium for Mining the Social Web, 2nd Edition (O'Reilly, 2013)☆2,898Updated 2 years ago
- MILK: Machine Learning Toolkit☆605Updated 9 years ago
- Yahoo!'s topic modelling framework using Latent Dirichlet Allocation☆338Updated 12 years ago
- Create beautiful tag clouds as images or HTML☆396Updated 5 years ago
- Twitter Tools☆216Updated 6 years ago
- Html Content / Article Extractor in Scala - open sourced from Gravity Labs☆1,525Updated 7 years ago
- Train NLTK objects with zero code☆747Updated 4 years ago
- code for my O'Reilly masterclass videos☆312Updated 9 years ago
- Hadoop library for large-scale data processing, now an Apache Incubator project☆584Updated 10 years ago
- Rails app for tracking trends in server logs - powered by the Cloudera Hadoop Distribution on EC2☆356Updated 13 years ago
- Run MapReduce jobs on Hadoop or Amazon Web Services☆2,615Updated last year
- RHadoop☆763Updated 8 years ago
- ☆116Updated 12 years ago
- code for the ml class☆29Updated 13 years ago
- Common Crawl support library to access 2008-2012 crawl archives (ARC files)☆489Updated 6 years ago
- Python MapReduce library written in Cython. Visit us in #hadoopy on freenode. See the link below for documentation and tutorials.☆243Updated 8 years ago
- ☆473Updated this week
- Indexing engine for IndexTank☆846Updated 12 years ago
- A collection of the best open data sets and open-source tools for data science☆1,124Updated 8 years ago
- Weave (Web-based Analysis and Visualization Environment)☆369Updated 5 years ago
- Where 2.0 Workshop Code: Spatial Analysis of Tweets using Hadoop, Pig, Python & Mechanical Turk. Slides here: http://www.slideshare.net/…☆134Updated 14 years ago
- [abandoned] python port of arc90's readability bookmarklet☆537Updated 13 years ago
- Crab is a flexible, fast recommender engine for Python that integrates classic information filtering recommendation algorithms in the world…☆1,176Updated 3 years ago
- Machine Learning cheat sheet for linear classifiers and clustering algorithms☆189Updated 3 years ago
- Code and slides in support of Data Bootcamp tutorial at Strata Conference 2011☆94Updated 13 years ago
- Timeline visualization application☆449Updated 14 years ago
- Lightning-fast cluster computing in Java, Scala and Python.☆1,426Updated 10 years ago
- A command-line twitter client with smart filtering and statistical classification☆165Updated 13 years ago