hermit-crab / ScrapeMate
Scraping assistant tool. Editing and maintaining CSS/XPath selectors across webpages.
☆101Updated 6 years ago
Related projects ⓘ
Alternatives and complementary repositories for ScrapeMate
- Apify actor that opens a web page in headless Chrome and analyzes the HTML and JavaScript objects, looks for schema.org microdata and JSO…☆150Updated last year
- Traditional "Web 2.0" social bookmarking, with small improvements☆100Updated last year
- Bookmark with a snooze button. Bookmark, buffer and complete your reading list.☆91Updated last year
- Comprehensive wrapper and execution manager for the Chrome browser using the Chrome Debugging Protocol.☆219Updated last year
- A Scrapy middleware for scraping time series data from Archive.org's Wayback Machine.☆109Updated 9 months ago
- Rewriting web proxy and archival tool. At this point, it just tries to download all the things.☆199Updated this week
- Google/Excel Sheets API Python.☆70Updated 3 months ago
- Offline-first web browser☆84Updated 5 years ago
- script that generates an rss feed out of websites that don't have one☆30Updated 5 years ago
- CoCrawler is a versatile web crawler built using modern tools and concurrency.☆187Updated 2 years ago
- admin ui for scrapy/open source scrapinghub☆58Updated 3 years ago
- RSS feed reader for Python 3☆85Updated last year
- Scrapy middleware which allows to crawl only new content☆79Updated 2 years ago
- DIY Atom feeds in times of social media and paywalls☆81Updated 5 months ago
- Squidwarc is a high fidelity, user scriptable, archival crawler that uses Chrome or Chromium with or without a head☆169Updated 4 years ago
- A Python desktop application that makes the use of freelancing subreddits easier and faster.☆35Updated 4 years ago
- Software stack with latest Scrapy and updated deps☆62Updated this week
- Parses Firefox/Chrome HTML bookmarks files☆47Updated 7 months ago
- Tool for real-time scraping of news articles.☆39Updated 5 years ago
- Creates github index for similar repositories discovery☆193Updated 8 years ago
- ☆40Updated 3 years ago
- ☆78Updated 2 years ago
- Extract text from HTML☆131Updated 4 years ago
- Flask code to deploy an API that pulls structured data from online news articles☆230Updated last year
- A library to extract a publication date from a web page, along with a measure of the accuracy.☆42Updated 5 years ago
- Remote client for distributed automated HTTP(s) content fetching.☆77Updated this week
- Intelligent Bookmarking☆261Updated last year
- Simple podcast downloader (podcatcher)☆56Updated last year
- Simple Web UI for Scrapy spider management via Scrapyd☆50Updated 6 years ago