stewartmckee / cobwebLinks
Web crawler with very flexible crawling options. Can either use standalone or can be used with resque to perform clustered crawls.
☆225Updated 3 years ago
Alternatives and similar repositories for cobweb
Users that are interested in cobweb are comparing it to the libraries listed below
Sorting:
- Read text and metadata from files and documents (.doc, .docx, .pages, .odt, .rtf, .pdf)☆502Updated 2 years ago
- Ruby Binding for Stanford Pos-Tagger and Name Entity Recognizer☆92Updated 11 years ago
- PredictionIO Ruby SDK☆191Updated 7 years ago
- Collection of filters that transform plain text into HTML code.☆801Updated last month
- Download, unpack from a ZIP/TAR/GZ/BZ2 archive, parse, correct, convert units and import Google Spreadsheets, XLS, ODS, XML, CSV, HTML, e…☆305Updated 11 years ago
- Polipus: distributed and scalable web-crawler framework☆92Updated 10 years ago
- Google Translate (with Bulk translate) in Ruby☆334Updated 3 weeks ago
- Collection of text algorithms. gem install text☆584Updated 10 years ago
- Multi-fetch Fragments makes rendering and caching a collection of template partials easier and faster.☆538Updated 7 years ago
- A quick and easy way to visually test your Rails application's API.☆726Updated 10 years ago
- Find and rank keywords in text☆206Updated 6 years ago
- Add group and membership functionality to your Rails models☆196Updated 6 years ago
- A Ruby natural language processor.☆163Updated 4 years ago
- A language detection library for Ruby that uses bloom filters for speed.☆682Updated 3 years ago
- A gem to screencap webpages in ruby. Uses Phantom.js under the hood.☆180Updated 6 years ago
- Binary uuid keys in Rails☆337Updated 5 years ago
- A ruby library for TTS & ASR document preparation☆101Updated 3 years ago
- Approximate String Matching library☆388Updated 2 months ago
- Ruby language bindings for LIBSVM☆279Updated 2 years ago
- Ruby bindings to the Stanford Core NLP tools (English, French, German).☆436Updated 6 months ago
- Asynchronous Web Crawler & Scraper☆142Updated 2 years ago
- Ruby regular expressions library☆124Updated last year
- Rack middleware for rate-limiting incoming HTTP requests configured to be used with Redis.☆272Updated 5 years ago
- Tool for extracting pages from pdf as images and text as strings.☆230Updated 2 years ago
- Fast and efficient recommendations and predictions using Redis☆502Updated 4 years ago
- Modular, extensible open-source ecommerce solution for Ruby on Rails. No longer under development.☆271Updated 7 years ago
- A simple wrapper for the standard ruby OpenSSL library☆338Updated last month
- carrierwave extension to use ffmpeg to transcode videos to html5-friendly format☆187Updated 7 years ago
- Allows file upload using FTP for CarrierWave uploaders.☆84Updated 2 years ago
- Ruby bindings to the OpenNLP Java toolkit.☆91Updated 6 months ago