petewarden / common_crawl_types
A simple Ruby example of how to process Common Crawl files using Elastic MapReduce
☆29Updated 13 years ago
Alternatives and similar repositories for common_crawl_types:
Users that are interested in common_crawl_types are comparing it to the libraries listed below
- Web interface for MailyHerald - Ruby on Rails email processing solution.☆20Updated 4 years ago
- Ruby implementation of the PageRank and TextRank algorithms.☆75Updated 9 years ago
- A pure Ruby interface to the WordNet database☆90Updated 5 years ago
- A multilingual tokenizer to split a string into tokens☆91Updated 8 months ago
- Let’s make search a better experience for our users☆40Updated 7 years ago
- A web interface for Clockwork☆40Updated 3 weeks ago
- Ruby Binding for Stanford Pos-Tagger and Name Entity Recognizer☆92Updated 10 years ago
- Categorize is a text categorization library written in Ruby. It prioritizes performance over accuracy and is built to run online in dynam…☆37Updated 11 years ago
- Common exception reporting for a variety of services☆86Updated last year
- allow edit Postgresql hstore values as json tree in ActiveAdmin☆38Updated 6 years ago
- Web analytics for your rails apps using Redis☆175Updated 5 years ago
- A data migration and visualization command line gem in Ruby☆245Updated 5 years ago
- A web interface for Notable☆37Updated 2 years ago
- Copy active record models from remote databases☆265Updated last year
- An opinionated rails drip email engine that depends on ActiveRecord and ActionMailer☆32Updated 2 years ago
- Text classifier in Ruby that uses Hadoop/HBase, Mongo, or Cassandra for storage. New location for http://github.com/livingsocial/ankusa☆100Updated 9 years ago
- Incoming! helps you receive email in your Rack apps.☆308Updated 9 months ago
- Self-Imposed Rate Limiting for Ruby☆53Updated 2 years ago
- high availability extensions to the Elasticsearch::Rails standard tasks☆21Updated 9 years ago
- Advanced monitoring for Sidekiq☆230Updated 9 years ago
- Additional sidekiq middleware☆92Updated 8 years ago
- Resumable upload protocol implementation in Ruby☆44Updated 10 years ago
- Find a lot of kinds of common information in a string. CommonRegex port for Ruby☆80Updated 3 years ago
- resque plugin to add unique jobs☆35Updated last year
- A simple tokenizer in Ruby for NLP tasks.☆45Updated 8 years ago
- Official Ruby Gem for Kraken API☆35Updated 9 years ago
- Easy-to-use anomaly detection for Ruby☆103Updated this week
- Statistical gender detection for Ruby☆60Updated 5 years ago
- A redis-backed Bayesian classifier☆38Updated 9 years ago
- Simple Rails 3.1+ gem for better timezone detection☆50Updated 11 years ago