hermit-crab / ScrapeMate
Scraping assistant tool. Editing and maintaining CSS/XPath selectors across webpages.
☆102Updated 6 years ago
Alternatives and similar repositories for ScrapeMate:
Users that are interested in ScrapeMate are comparing it to the libraries listed below
- Apify actor that opens a web page in headless Chrome and analyzes the HTML and JavaScript objects, looks for schema.org microdata and JSO…☆150Updated 2 years ago
- script that generates an rss feed out of websites that don't have one☆31Updated 6 years ago
- Offline-first web browser☆87Updated 6 years ago
- admin ui for scrapy/open source scrapinghub☆58Updated 3 years ago
- Traditional "Web 2.0" social bookmarking, with small improvements☆100Updated 2 years ago
- A Scrapy middleware for scraping time series data from Archive.org's Wayback Machine.☆113Updated last year
- Google/Excel Sheets API Python.☆70Updated 7 months ago
- Rodo is a terminal-based todo manager written in Ruby☆34Updated 7 months ago
- Tool for real-time scraping of news articles.☆39Updated 5 years ago
- A simple Python wrapper for the archive.is capturing service☆200Updated 2 months ago
- Chrome extension to "Create WARC files from any webpage"☆220Updated last year
- Parse government documents into well formed JSON☆68Updated 2 months ago
- Scrapy rotation proxy package with advanced functions☆95Updated 2 years ago
- Build a search index across content from multiple SQLite database tables and run faceted searches against it using Datasette☆193Updated 3 years ago
- Strip non-presentational content out of HTML pages☆45Updated 2 years ago
- The most boring open source you've ever seen ....☆128Updated last year
- Bookmark with a snooze button. Bookmark, buffer and complete your reading list.☆91Updated 2 years ago
- Webrecorder Desktop App!☆205Updated 4 years ago
- Screening emails workflow☆100Updated 5 months ago
- Save data from Google Takeout to a SQLite database☆108Updated last year
- Scrapy middleware which allows to crawl only new content☆80Updated 2 years ago
- Simple podcast downloader (podcatcher)☆56Updated last month
- A helper library full of URL-related heuristics.☆69Updated 3 weeks ago
- Parses Firefox/Chrome HTML bookmarks files☆49Updated last year
- DIY Atom feeds in times of social media and paywalls☆83Updated 10 months ago
- API for extracting a table from an image or a PDF☆91Updated 7 months ago
- Navigator for Web Archive☆155Updated last year
- Chrome Extension for Hacker News and Reddit Links☆35Updated last year
- Web clipper browser extension for saving highlights, screenshots, and automatically extracting content from web pages.☆372Updated 3 years ago
- Extract text from HTML☆135Updated 4 years ago