hermit-crab / ScrapeMate
Scraping assistant tool. Editing and maintaining CSS/XPath selectors across webpages.
☆101Updated 6 years ago
Alternatives and similar repositories for ScrapeMate:
Users that are interested in ScrapeMate are comparing it to the libraries listed below
- Apify actor that opens a web page in headless Chrome and analyzes the HTML and JavaScript objects, looks for schema.org microdata and JSO…☆150Updated last year
- Tool for real-time scraping of news articles.☆39Updated 5 years ago
- Build a search index across content from multiple SQLite database tables and run faceted searches against it using Datasette☆191Updated 3 years ago
- Scrapy middleware which allows to crawl only new content☆80Updated 2 years ago
- Offline-first web browser☆85Updated 6 years ago
- Comprehensive wrapper and execution manager for the Chrome browser using the Chrome Debugging Protocol.☆220Updated last month
- script that generates an rss feed out of websites that don't have one☆31Updated 5 years ago
- Save data from Google Takeout to a SQLite database☆107Updated last year
- A Scrapy middleware for scraping time series data from Archive.org's Wayback Machine.☆112Updated 11 months ago
- Scrapy rotation proxy package with advanced functions☆94Updated 2 years ago
- Traditional "Web 2.0" social bookmarking, with small improvements☆100Updated last year
- admin ui for scrapy/open source scrapinghub☆58Updated 3 years ago
- Creates github index for similar repositories discovery☆192Updated 8 years ago
- Screening emails workflow☆99Updated 2 months ago
- backup and parse your browser history databases (chrome, firefox, safari, and other chrome/firefox derivatives)☆125Updated 2 months ago
- a high-performance, lightweight and human friendly serving engine for scrapy☆29Updated 3 years ago
- Create high-quality images programmatically with easily-hackable templates.☆182Updated 4 months ago
- An algorithm for generating robust XPath locators for web testing.☆180Updated 2 years ago
- CoCrawler is a versatile web crawler built using modern tools and concurrency.☆189Updated 2 years ago
- Privacy-preserving Firefox extension linking to Hacker News discussion; built with Bloom filters and WebAssembly☆85Updated this week
- ☆13Updated 6 years ago
- Record browser actions then replay immediately. Craft your own custom automation workflows.☆65Updated 5 years ago
- Table Sorter☆21Updated 7 years ago
- Flask code to deploy an API that pulls structured data from online news articles☆229Updated 2 years ago
- A complimentary proxy to help to use SPM with headless browsers☆109Updated last year
- Simple podcast downloader (podcatcher)☆56Updated last year
- DIY Atom feeds in times of social media and paywalls☆83Updated 8 months ago
- Extract text from HTML☆133Updated 4 years ago
- A diagram of my personal infrastructure☆45Updated 3 years ago
- The most boring open source you've ever seen ....☆127Updated last year