stewartmckee / cobwebLinks
Web crawler with very flexible crawling options. Can either use standalone or can be used with resque to perform clustered crawls.
☆225Updated 2 years ago
Alternatives and similar repositories for cobweb
Users that are interested in cobweb are comparing it to the libraries listed below
Sorting:
- Polipus: distributed and scalable web-crawler framework☆92Updated 10 years ago
- PredictionIO Ruby SDK☆191Updated 6 years ago
- Download, unpack from a ZIP/TAR/GZ/BZ2 archive, parse, correct, convert units and import Google Spreadsheets, XLS, ODS, XML, CSV, HTML, e…☆306Updated 11 years ago
- Collection of filters that transform plain text into HTML code.☆800Updated 2 months ago
- Read text and metadata from files and documents (.doc, .docx, .pages, .odt, .rtf, .pdf)☆502Updated 2 years ago
- Asynchronous Web Crawler & Scraper☆143Updated 2 years ago
- A gem to screencap webpages in ruby. Uses Phantom.js under the hood.☆180Updated 6 years ago
- A quick and easy way to visually test your Rails application's API.☆726Updated 10 years ago
- Modular, extensible open-source ecommerce solution for Ruby on Rails. No longer under development.☆272Updated 6 years ago
- Ruby Binding for Stanford Pos-Tagger and Name Entity Recognizer☆92Updated 11 years ago
- Find and rank keywords in text☆207Updated 5 years ago
- Fast and efficient recommendations and predictions using Redis☆503Updated 4 years ago
- A simple, flexible, extensible, and liberal RSS and Atom reader for Ruby. It is designed to be backwards compatible with the standard RSS…☆225Updated 2 years ago
- A generalized Rack middleware for importing contacts from major email providers.☆477Updated 5 years ago
- Unirest in Ruby: Simplified, lightweight HTTP client library.☆364Updated 3 months ago
- Tor-privoxy is a Ruby Mechanize wrapper to access the web with mechanize via Tor/Privoxy It allows to use multiple Privoxy instances, swi…☆84Updated 8 years ago
- Add group and membership functionality to your Rails models☆194Updated 6 years ago
- A Ruby natural language processor.☆163Updated 3 years ago
- Multi-fetch Fragments makes rendering and caching a collection of template partials easier and faster.☆538Updated 7 years ago
- Expose libstemmer_c to Ruby☆250Updated 3 years ago
- Binary uuid keys in Rails☆339Updated 5 years ago
- ☆277Updated 5 years ago
- Ruby regular expressions library☆125Updated last year
- Tool for extracting pages from pdf as images and text as strings.☆227Updated last year
- A Ruby C wrapper for Open Text Summarizer☆205Updated 13 years ago
- Simple colored logging for rails 4 and 5 apps☆69Updated 7 years ago
- NOTICE: official repository moved to https://github.com/amerine/acts_as_tree☆285Updated 17 years ago
- FasterCSV is CSV, but faster, smaller, and cleaner.☆177Updated 10 years ago
- Captures a web page as a screenshot.☆214Updated last year
- A Ruby library to template Microsoft Word .docx files. Generates new Word .docx files based on a template file. Does templating entirely …☆147Updated 3 years ago