petewarden / common_crawl_typesLinks
A simple Ruby example of how to process Common Crawl files using Elastic MapReduce
☆29Updated 13 years ago
Alternatives and similar repositories for common_crawl_types
Users that are interested in common_crawl_types are comparing it to the libraries listed below
Sorting:
- Ruby implementation of the PageRank and TextRank algorithms.☆75Updated 2 months ago
- A pure Ruby interface to the WordNet database☆91Updated 5 years ago
- Web interface for MailyHerald - Ruby on Rails email processing solution.☆20Updated 4 years ago
- allow edit Postgresql hstore values as json tree in ActiveAdmin☆38Updated 7 years ago
- Ruby wrapper for correcting spelling and grammar mistakes based on the context of complete sentences.☆478Updated 5 years ago
- Find a lot of kinds of common information in a string. CommonRegex port for Ruby☆80Updated 3 years ago
- Let’s make search a better experience for our users☆40Updated 8 years ago
- A Ruby wrapper for Latent Dirichlet Allocation (LDA).☆133Updated 4 years ago
- Download, unpack from a ZIP/TAR/GZ/BZ2 archive, parse, correct, convert units and import Google Spreadsheets, XLS, ODS, XML, CSV, HTML, e…☆306Updated 11 years ago
- Structured, documented, powerful event emitting library for Mixpanel and other such systems☆75Updated 5 years ago
- Advanced monitoring for Sidekiq☆230Updated 10 years ago
- Find and rank keywords in text☆207Updated 5 years ago
- Locality Sensitive Hashing in Ruby☆33Updated 11 years ago
- A multilingual tokenizer to split a string into tokens☆91Updated last year
- A simple tokenizer in Ruby for NLP tasks.☆45Updated 8 years ago
- Asynchronous Web Crawler & Scraper☆143Updated 2 years ago
- Incoming! helps you receive email in your Rack apps.☆307Updated last year
- Statistical gender detection for Ruby☆60Updated 5 years ago
- A foundation of knowledge and libraries for solid analytics☆43Updated 7 years ago
- Text readability analyzer using Flesch-Kincaid and others☆68Updated 5 years ago
- Expose libstemmer_c to Ruby☆250Updated 3 years ago
- The gem that swaps out text with a fully-compliant Rails form in one click.☆238Updated 10 years ago
- Full Excel/CSV Import/Export facilities for Rails☆136Updated 3 years ago
- Text classifier in Ruby that uses Hadoop/HBase, Mongo, or Cassandra for storage. New location for http://github.com/livingsocial/ankusa☆100Updated 9 years ago
- Self-Imposed Rate Limiting for Ruby☆54Updated 2 years ago
- Ruby Binding for Stanford Pos-Tagger and Name Entity Recognizer☆92Updated 11 years ago
- Web analytics for your rails apps using Redis☆175Updated 5 years ago
- Project for filtering stopwords☆78Updated last year
- Anomaly detection and forecasting for Ruby☆136Updated 4 months ago
- Distributed locks (mutexes & semaphores) using Memcached or Redis☆119Updated 3 years ago