Web crawler with very flexible crawling options. Can either use standalone or can be used with resque to perform clustered crawls.
☆224Dec 1, 2022Updated 3 years ago
Alternatives and similar repositories for cobweb
Users that are interested in cobweb are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Lightweight Ruby web crawler/scraper with an elegant DSL which extracts structured data from pages.☆1,360Apr 7, 2026Updated last week
- A mean little DSL'd poltergeist (capybara) based web crawler that stuffs data into your Rails app.☆21Jul 18, 2013Updated 12 years ago
- Slim for Volt framework☆13Jan 5, 2016Updated 10 years ago
- A versatile Ruby web spidering library that can spider a site, multiple domains, certain links or infinitely. Spidr is designed to be fas…☆835Jan 12, 2026Updated 3 months ago
- a simple, fast web-crawler written in Ruby using Watir or Typhoeus☆16Jan 26, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- an re-implementation of rmmseg (Chinese word segmentation library for Ruby) in C++ in new rake-compiler☆16Apr 18, 2016Updated 9 years ago
- Ruby gem for web scraping purposes. It scrapes a given URL, and returns you its title, meta description, meta keywords, links, images...☆1,046Apr 9, 2026Updated last week
- A rails tagging gem implementing flickr's machine tags + maybe more (semantic tags)☆44May 1, 2013Updated 12 years ago
- Anemone web-spider framework☆1,606Mar 20, 2020Updated 6 years ago
- Asynchronous Web Crawler & Scraper☆143Mar 23, 2023Updated 3 years ago
- Machine Learning & Data Mining with JRuby☆65Dec 23, 2025Updated 3 months ago
- Ruby library - Fill out PDF form with FDF/XFDF via pdftk☆16Sep 28, 2021Updated 4 years ago
- An add-on gem for spelling suggestions in Thinking Sphinx☆58Mar 3, 2020Updated 6 years ago
- Extracts machine-readable metadata and content from Web pages☆745Apr 10, 2019Updated 7 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Extract useful data from HTML and XML with ease!☆57Jan 3, 2018Updated 8 years ago
- An Evented Beanstalk Client☆64Jul 27, 2012Updated 13 years ago
- Collection of text algorithms. gem install text☆585Apr 13, 2015Updated 11 years ago
- A ruby gem that can convert HTML to formatted plain text.☆42Feb 14, 2019Updated 7 years ago
- Ruby Microdata parser for RDF.rb☆33Jan 8, 2024Updated 2 years ago
- Struct with keyword arguments support☆18Jan 8, 2023Updated 3 years ago
- A Ruby DSL for structured web crawling, with a robust caching system.☆255Mar 19, 2024Updated 2 years ago
- Partisan is a Ruby library that allows ActiveRecord records to be followers and followables.☆32Dec 21, 2017Updated 8 years ago
- Mechanize is a ruby library that makes automated web interaction easy.☆4,441Feb 20, 2026Updated last month
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Adds Jade support to Brunch☆30Jan 3, 2017Updated 9 years ago
- Demo app showing how you can filter a table in a Ruby on Rails app with StimulusReflex☆17Jan 19, 2023Updated 3 years ago
- Full Excel/CSV Import/Export facilities for Rails☆135Sep 17, 2021Updated 4 years ago
- jQuery Templates for the Rails asset pipeline.☆46Jan 20, 2016Updated 10 years ago
- Rails gem for managing web site agreements (terms, privacy policy, etc).☆40Jan 7, 2025Updated last year
- A simple Ruby example of how to process Common Crawl files using Elastic MapReduce☆29Mar 25, 2012Updated 14 years ago
- postgresql fuzzywuzzy extension☆12Feb 27, 2019Updated 7 years ago
- Micro web app for Text-To-Speech conversion via HTTP powered by Ruby, Roda, lame, espeak and espeak-ruby.☆27Nov 28, 2021Updated 4 years ago
- Web crawler with a Ruby API☆44Feb 17, 2021Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Reduce misspelled email addresses in Ruby.☆40Jul 17, 2021Updated 4 years ago
- A gem to record data about Ruby code execution in Neo4j for analysis☆18Jan 12, 2016Updated 10 years ago
- Rails engine to manage and supervise your batch jobs. Based on sidekiq.☆34May 23, 2024Updated last year
- DEPRECATED - Ruby library for generating text with declarative recursive grammars, a fork of maetl/Calyx☆28Jul 30, 2016Updated 9 years ago
- Ruby wrapper for phantomjs☆53Feb 29, 2020Updated 6 years ago
- Ruby on Rails course☆15Nov 23, 2015Updated 10 years ago
- Wgit enables you to crawl and extract the data you want from the web☆16Aug 19, 2025Updated 7 months ago