petewarden / common_crawl_types
A simple Ruby example of how to process Common Crawl files using Elastic MapReduce
☆29Updated 12 years ago
Alternatives and similar repositories for common_crawl_types:
Users that are interested in common_crawl_types are comparing it to the libraries listed below
- Web interface for MailyHerald - Ruby on Rails email processing solution.☆20Updated 4 years ago
- Common exception reporting for a variety of services☆86Updated last year
- allow edit Postgresql hstore values as json tree in ActiveAdmin☆38Updated 6 years ago
- Self-Imposed Rate Limiting for Ruby☆53Updated 2 years ago
- A simple tokenizer in Ruby for NLP tasks.☆46Updated 7 years ago
- Ruby wrapper for correcting spelling and grammar mistakes based on the context of complete sentences.☆478Updated 5 years ago
- A Ruby wrapper for Latent Dirichlet Allocation (LDA).☆133Updated 4 years ago
- A pure Ruby interface to the WordNet database☆89Updated 5 years ago
- Rails console history for Heroku, Docker, and more☆80Updated last month
- Categorize is a text categorization library written in Ruby. It prioritizes performance over accuracy and is built to run online in dynam…☆37Updated 11 years ago
- Find a lot of kinds of common information in a string. CommonRegex port for Ruby☆80Updated 3 years ago
- An opinionated rails drip email engine that depends on ActiveRecord and ActionMailer☆32Updated 2 years ago
- A multilingual tokenizer to split a string into tokens☆91Updated 6 months ago
- The gem that swaps out text with a fully-compliant Rails form in one click.☆237Updated 9 years ago
- Ruby wrapper for the Paymill API☆84Updated 4 years ago
- Implementation of the Rapid Automatic Keyword Extraction algorithm in Ruby, a multi-word keywords extraction.☆37Updated 11 years ago
- Simple Rails 3.1+ gem for better timezone detection☆50Updated 11 years ago
- Structured, documented, powerful event emitting library for Mixpanel and other such systems☆75Updated 5 years ago
- Job status and batches for Active Job☆69Updated 6 years ago
- Let’s make search a better experience for our users☆40Updated 7 years ago
- Advanced monitoring for Sidekiq☆230Updated 9 years ago
- resque plugin to add unique jobs☆35Updated last year
- Ruby implementation of the PageRank and TextRank algorithms.☆75Updated 9 years ago
- A Rack middleware for Rails >= 3.1.0 with asset pipeline and asset digest enabled. This middleware is used to redirect any request to sta…☆72Updated 7 years ago
- A rack middleware for throttling and filtering requests☆83Updated 6 years ago
- Ruby gem for GrooveHQ api☆24Updated 6 years ago
- calculate_all method for aggregate functions in Active Record☆125Updated 6 years ago
- Adds benchmarking methods to Sidekiq workers, keeps metrics and adds tab to Web UI to let you browse them☆145Updated 2 years ago
- Incoming! helps you receive email in your Rack apps.☆308Updated 7 months ago
- Additional sidekiq middleware☆92Updated 8 years ago