petewarden / common_crawl_types
A simple Ruby example of how to process Common Crawl files using Elastic MapReduce
☆30Updated 12 years ago
Related projects: ⓘ
- Text classifier in Ruby that uses Hadoop/HBase, Mongo, or Cassandra for storage. New location for http://github.com/livingsocial/ankusa☆99Updated 8 years ago
- A Ruby gem that provides parsing and output of person names, as well as Gender & Ethnicity matching.☆51Updated last month
- Statistical gender detection for Ruby☆60Updated 4 years ago
- Ruby implementation of the PageRank and TextRank algorithms.☆75Updated 8 years ago
- Ruby port of UEALite Stemmer - a conservative stemmer for search and indexing☆53Updated last year
- A pure Ruby interface to the WordNet database☆89Updated 5 years ago
- A Ruby wrapper for Latent Dirichlet Allocation (LDA).☆133Updated 3 years ago
- Ruby Binding for Stanford Pos-Tagger and Name Entity Recognizer☆92Updated 10 years ago
- Provides access to the Alchemy text mining API - http://www.alchemyapi.com/☆56Updated 4 years ago
- A simple tokenizer in Ruby for NLP tasks.☆45Updated 7 years ago
- Fast Ruby FFI string edit distance algorithms☆81Updated 11 years ago
- Categorize is a text categorization library written in Ruby. It prioritizes performance over accuracy and is built to run online in dynam…☆37Updated 11 years ago
- high availability extensions to the Elasticsearch::Rails standard tasks☆21Updated 8 years ago
- Incoming! helps you receive email in your Rack apps.☆309Updated 2 months ago
- A multilingual tokenizer to split a string into tokens☆90Updated last month
- Fast access to database results without the memory overhead of ActiveRecord objects☆39Updated 3 weeks ago
- Simple Rails 3.1+ gem for better timezone detection☆50Updated 11 years ago
- Implementation of the Rapid Automatic Keyword Extraction algorithm in Ruby, a multi-word keywords extraction.☆37Updated 10 years ago
- A foundation of knowledge and libraries for solid analytics☆40Updated 6 years ago
- Web analytics for your rails apps using Redis☆174Updated 4 years ago
- Rails-Engine-Gem that offers an admin interface for trusted user☆85Updated last year
- Ruby gem to semi-automatically redact confidential information from a text☆14Updated 8 years ago
- Use Minuteman easily in your Rails app☆64Updated 10 years ago
- Ruby library for Finance math.☆56Updated 2 months ago
- Easy-to-use anomaly detection for Ruby☆101Updated last month
- *UNMAINTAINED* HTTMultiParty is a thin wrapper around HTTParty to provide multipart uploads.☆137Updated 3 years ago
- Accurate Bayesian sentence tokenizer in Ruby.☆80Updated 10 years ago
- Heroku buildpack with jemalloc☆11Updated 6 years ago
- allow edit Postgresql hstore values as json tree in ActiveAdmin☆38Updated 6 years ago
- Add basecamp style subdomain authentication to devise.☆79Updated 10 years ago