hermit-crab / ScrapeMate
Scraping assistant tool. Editing and maintaining CSS/XPath selectors across webpages.
☆101Updated 6 years ago
Alternatives and similar repositories for ScrapeMate:
Users that are interested in ScrapeMate are comparing it to the libraries listed below
- Apify actor that opens a web page in headless Chrome and analyzes the HTML and JavaScript objects, looks for schema.org microdata and JSO…☆150Updated 2 years ago
- script that generates an rss feed out of websites that don't have one☆31Updated 6 years ago
- Module that extracts structured information from a rendered html site and outputs JSON. HTML to JSON.☆69Updated 3 years ago
- Traditional "Web 2.0" social bookmarking, with small improvements☆100Updated 2 years ago
- Tool for real-time scraping of news articles.☆39Updated 5 years ago
- Scrapy middleware which allows to crawl only new content☆80Updated 2 years ago
- A Scrapy middleware for scraping time series data from Archive.org's Wayback Machine.☆113Updated last year
- admin ui for scrapy/open source scrapinghub☆58Updated 3 years ago
- Save data from Google Takeout to a SQLite database☆108Updated last year
- Extract text from HTML☆134Updated 4 years ago
- CoCrawler is a versatile web crawler built using modern tools and concurrency.☆190Updated 2 years ago
- Table Sorter☆21Updated 8 years ago
- A simple, Qt-Webengine powered web browser with built in functionality for basic scrapy webscraping support.☆109Updated 10 months ago
- Strip non-presentational content out of HTML pages☆45Updated 2 years ago
- python api wrapper for https://mercury.postlight.com/web-parser/☆23Updated last year
- Plugin based RSS feed generator for sites that don't offer any. Serves RSS, Atom and JSON Feeds.☆88Updated 3 years ago
- DIY Atom feeds in times of social media and paywalls☆83Updated 10 months ago
- Comprehensive wrapper and execution manager for the Chrome browser using the Chrome Debugging Protocol.☆221Updated 3 months ago
- Offline-first web browser☆86Updated 6 years ago
- Nunux Keeper web app☆107Updated 3 years ago
- The most boring open source you've ever seen ....☆128Updated last year
- Chrome extension to "Create WARC files from any webpage"☆219Updated last year
- Screening emails workflow☆99Updated 4 months ago
- Tools to easy generate RSS feed that contains each scraped item using Scrapy framework.☆33Updated 4 months ago
- A Scrapy middleware to bypass the CloudFlare's anti-bot protection☆109Updated 3 years ago
- Build a search index across content from multiple SQLite database tables and run faceted searches against it using Datasette☆193Updated 3 years ago
- 🦛 scrapes websites and generates rss feeds☆53Updated last month
- Create a SQLite database containing data from your Pocket account☆105Updated last year
- Full text search all your browsing history☆97Updated this week
- Notetaking Electron app that can answer your questions and makes summaries for you☆90Updated 2 years ago