petewarden / common_crawl_types
A simple Ruby example of how to process Common Crawl files using Elastic MapReduce
☆30Updated 12 years ago
Related projects ⓘ
Alternatives and complementary repositories for common_crawl_types
- Text classifier in Ruby that uses Hadoop/HBase, Mongo, or Cassandra for storage. New location for http://github.com/livingsocial/ankusa☆99Updated 8 years ago
- A web interface for Notable☆36Updated last year
- allow edit Postgresql hstore values as json tree in ActiveAdmin☆38Updated 6 years ago
- Let’s make search a better experience for our users☆40Updated 7 years ago
- Common exception reporting for a variety of services☆87Updated last year
- Copy active record models from remote databases☆265Updated last year
- Add basecamp style subdomain authentication to devise.☆79Updated 10 years ago
- Web analytics for your rails apps using Redis☆175Updated 4 years ago
- Fast Ruby FFI string edit distance algorithms☆81Updated 11 years ago
- Web interface for MailyHerald - Ruby on Rails email processing solution.☆20Updated 3 years ago
- Easy-to-use anomaly detection for Ruby☆103Updated 3 weeks ago
- A simple tokenizer in Ruby for NLP tasks.☆46Updated 7 years ago
- Incoming! helps you receive email in your Rack apps.☆309Updated 4 months ago
- Additional sidekiq middleware☆92Updated 8 years ago
- high availability extensions to the Elasticsearch::Rails standard tasks☆21Updated 8 years ago
- A pure Ruby interface to the WordNet database☆89Updated 5 years ago
- Statistical gender detection for Ruby☆60Updated 5 years ago
- Easily add a JSON endpoint to your Rails application that returns useful sidekiq stats☆33Updated 2 years ago
- An opinionated rails drip email engine that depends on ActiveRecord and ActionMailer☆32Updated last year
- Directed Acyclic Graph hierarchy for Rail's ActiveRecord models☆55Updated 2 years ago
- The gem that swaps out text with a fully-compliant Rails form in one click.☆238Updated 9 years ago
- Loves/is loved by polymorphic belongs_to associations, Ransack, Squeel, MetaSearch...☆52Updated 5 years ago
- Activejob Stats is an Activejob addon that will collect and send data samples from your Jobs to various stat servers☆58Updated 3 years ago
- Ruby Binding for Stanford Pos-Tagger and Name Entity Recognizer☆92Updated 10 years ago
- 🔬 Microscope adds useful scopes targeting ActiveRecord boolean, date and datetime fields.☆55Updated 2 years ago
- A rack middleware for throttling and filtering requests☆83Updated 6 years ago
- Forklift: Moving big databases around. A ruby ETL tool.☆137Updated 2 years ago