petewarden / common_crawl_types
A simple Ruby example of how to process Common Crawl files using Elastic MapReduce
☆30Updated 12 years ago
Alternatives and similar repositories for common_crawl_types:
Users that are interested in common_crawl_types are comparing it to the libraries listed below
- A pure Ruby interface to the WordNet database☆89Updated 5 years ago
- A Ruby wrapper for Latent Dirichlet Allocation (LDA).☆133Updated 4 years ago
- Web interface for MailyHerald - Ruby on Rails email processing solution.☆20Updated 4 years ago
- Provides access to the Alchemy text mining API - http://www.alchemyapi.com/☆56Updated 4 years ago
- Simple Rails 3.1+ gem for better timezone detection☆50Updated 11 years ago
- Download, unpack from a ZIP/TAR/GZ/BZ2 archive, parse, correct, convert units and import Google Spreadsheets, XLS, ODS, XML, CSV, HTML, e…☆302Updated 10 years ago
- Text classifier in Ruby that uses Hadoop/HBase, Mongo, or Cassandra for storage. New location for http://github.com/livingsocial/ankusa☆99Updated 9 years ago
- Let’s make search a better experience for our users☆40Updated 7 years ago
- Ruby port of UEALite Stemmer - a conservative stemmer for search and indexing☆53Updated 2 years ago
- The gem that swaps out text with a fully-compliant Rails form in one click.☆237Updated 9 years ago
- Ruby Binding for Stanford Pos-Tagger and Name Entity Recognizer☆92Updated 10 years ago
- A Ruby gem that provides parsing and output of person names, as well as Gender & Ethnicity matching.☆55Updated 5 months ago
- An opinionated rails drip email engine that depends on ActiveRecord and ActionMailer☆32Updated last year
- Find a lot of kinds of common information in a string. CommonRegex port for Ruby☆79Updated 3 years ago
- allow edit Postgresql hstore values as json tree in ActiveAdmin☆38Updated 6 years ago
- Incoming! helps you receive email in your Rack apps.☆309Updated 6 months ago
- Statistical gender detection for Ruby☆60Updated 5 years ago
- Fast Ruby FFI string edit distance algorithms☆80Updated 11 years ago
- Implementation of the Rapid Automatic Keyword Extraction algorithm in Ruby, a multi-word keywords extraction.☆37Updated 11 years ago
- Clearbit Ruby library☆52Updated last month
- Adds benchmarking methods to Sidekiq workers, keeps metrics and adds tab to Web UI to let you browse them☆144Updated 2 years ago
- A web interface for Clockwork☆39Updated 3 weeks ago
- Message bus via the background queue you're already using.☆62Updated 5 months ago
- resque plugin to add unique jobs☆36Updated last year
- InfluxDB ActiveRecord-style☆117Updated 3 months ago
- Simple kNN Classifier written in Ruby☆60Updated 3 years ago
- Rails-Engine-Gem that offers an admin interface for trusted user☆85Updated last year