petewarden / common_crawl_typesLinks
A simple Ruby example of how to process Common Crawl files using Elastic MapReduce
☆29Updated 13 years ago
Alternatives and similar repositories for common_crawl_types
Users that are interested in common_crawl_types are comparing it to the libraries listed below
Sorting:
- Ruby implementation of the PageRank and TextRank algorithms.☆75Updated 6 months ago
- Ruby wrapper for correcting spelling and grammar mistakes based on the context of complete sentences.☆477Updated 6 years ago
- A pure Ruby interface to the WordNet database☆91Updated 6 years ago
- A Ruby wrapper for Latent Dirichlet Allocation (LDA).☆134Updated 5 years ago
- Find a lot of kinds of common information in a string. CommonRegex port for Ruby☆80Updated 3 years ago
- Implementation of the Rapid Automatic Keyword Extraction algorithm in Ruby, a multi-word keywords extraction.☆37Updated 11 years ago
- A ruby gem to convert numbers into English words and vice versa.☆79Updated last year
- Sentiment analysis with Machine Learning☆163Updated 2 years ago
- Ruby Binding for Stanford Pos-Tagger and Name Entity Recognizer☆92Updated 11 years ago
- allow edit Postgresql hstore values as json tree in ActiveAdmin☆39Updated 7 years ago
- Fast Ruby FFI string edit distance algorithms☆80Updated 12 years ago
- A multilingual tokenizer to split a string into tokens☆92Updated last year
- Expose libstemmer_c to Ruby☆250Updated 3 years ago
- Web interface for MailyHerald - Ruby on Rails email processing solution.☆20Updated 4 years ago
- Simple Rails 3.1+ gem for better timezone detection☆50Updated 12 years ago
- Advanced monitoring for Sidekiq☆230Updated 10 years ago
- ☆153Updated 8 years ago
- Project for filtering stopwords☆79Updated 2 years ago
- Structured, documented, powerful event emitting library for Mixpanel and other such systems☆75Updated 5 years ago
- Simple, dependency-free Wilson score☆161Updated 4 years ago
- Anomaly detection and forecasting for Ruby☆137Updated 7 months ago
- Categorize is a text categorization library written in Ruby. It prioritizes performance over accuracy and is built to run online in dynam…☆37Updated 12 years ago
- A rack middleware for throttling and filtering requests☆83Updated 7 years ago
- A simple tokenizer in Ruby for NLP tasks.☆46Updated 8 years ago
- Feature Sliders for Rails☆206Updated 2 weeks ago
- Locality Sensitive Hashing in Ruby☆33Updated 12 years ago
- An opinionated rails drip email engine that depends on ActiveRecord and ActionMailer☆32Updated 2 years ago
- Statistical gender detection for Ruby☆60Updated 6 years ago
- Copy active record models from remote databases☆265Updated 2 years ago
- Nickel extracts date, time, and message information from naturally worded text.☆118Updated 8 years ago