ssalevan / cc-helloworldLinks
CommonCrawl Hello World example
☆33Updated 11 years ago
Alternatives and similar repositories for cc-helloworld
Users that are interested in cc-helloworld are comparing it to the libraries listed below
Sorting:
- Common Crawl support library to access 2008-2012 crawl archives (ARC files)☆501Updated 7 years ago
- Some utilities for Lucene☆110Updated 12 years ago
- A scrapy-based Hacker News crawler.☆151Updated 12 years ago
- (glow)$ gci -m 'glow is an application for counting firefox downloads'☆76Updated 14 years ago
- ☆116Updated 13 years ago
- distributed twitter search engine☆78Updated 13 years ago
- playing around with the common crawl dataset☆70Updated 12 years ago
- The API, BackOffice, Storefront, and Nebulizer for IndexTank☆382Updated 12 years ago
- GoldenOrb is an open-source implementation of Pregel, Google's graph processing framework☆293Updated 3 years ago
- This is an educational example of a data mining web application: when is good time to post on HN☆359Updated 12 years ago
- Bulk loading for elastic search☆185Updated last year
- Java implementation of a probabilistic set data structure☆144Updated 8 years ago
- example code for "Large-scale social media analysis with Hadoop" tutorial presented at ICWSM 2010☆42Updated 15 years ago
- Tool to help users migrate large relational databases into Hadoop clusters.☆67Updated 13 years ago
- displays edit activity on wikipedia☆233Updated 5 years ago
- Social sentiment flagger intended to judge given text as: positive, neutral or negative.☆130Updated 13 years ago
- [not maintained] Custom Twitter Search via ElasticSearch&Wicket☆60Updated 4 years ago
- Deploy static sites to App Engine by pushing to GitHub☆224Updated 3 years ago
- Client page for the Glow project☆99Updated 3 years ago
- ☆104Updated last year
- Exception catcher that runs on Google App Engine☆74Updated 12 years ago
- Etsy's little framework for A/B testing, feature ramp up, and more.☆128Updated 9 years ago
- ☆59Updated 13 years ago
- code for the ml class☆29Updated 13 years ago
- A demonstration of how to use BrowserID.☆100Updated 12 years ago
- Boilerplate setup for App Engine with html5-boilerplate 2.0, OpenID, memcache, user preferences, and more☆185Updated 13 years ago
- A graph database for python built on top of cassandra☆49Updated 10 years ago
- A social cookbook based on Facebook's Open Graph☆207Updated 13 years ago
- ElasticSearch Javascript client☆85Updated 14 years ago
- redis monitoring tool built with flask and redis-py☆61Updated 13 years ago