get-set-fetch / scraperLinks
Nodejs web scraper. Contains a command line, docker container, terraform module and ansible roles for distributed cloud scraping. Supported databases: SQLite, MySQL, PostgreSQL. Supported headless clients: Puppeteer, Playwright, Cheerio, JSdom.
☆112Updated 2 years ago
Alternatives and similar repositories for scraper
Users that are interested in scraper are comparing it to the libraries listed below
Sorting:
- House of Apify Scrapers. Generic scraping actors with a simple UI to handle complex web crawling and scraping use cases.☆129Updated last week
- Base Docker images for Apify actors.☆89Updated this week
- A single tab web browser built with puppeteer. Also, no client-side JS. Viewport is streamed with MJPEG. For realz.☆60Updated 2 years ago
- Amazon products scraper with using of rotating proxies and headless Chrome from ScrapingAnt☆88Updated last year
- Web data extraction tool implemented as chrome extension with much more features☆46Updated 7 years ago
- An undetectable browser automation framework 🤖☆35Updated 4 years ago
- A simple puppeteer wrapper to enable useful plugins with ease☆57Updated this week
- Browser automation API for repetitive web-based tasks, with a friendly user interface. You can use it to scrape content or do many other …☆31Updated 3 years ago
- Email automation driven by headless chrome.☆167Updated 4 years ago
- A Node.js library to easily manage and rotate a pool of web browsers, using any of the popular browser automation libraries like Puppetee…☆98Updated 3 years ago
- Automated functional testing via the Chrome DevTools Protocol. Easy to use and open source. Generates unique CSS and Xpath selectors. Out…☆58Updated 4 years ago
- Web data extraction tool implemented as chrome extension☆272Updated this week
- An alternative to sticking that lovely web app into an <iframe> on a corp website☆51Updated 4 years ago
- Hosted web-client for the browserless debugger☆50Updated 3 months ago
- Standalone puppeteer playground in browser's developer tools.☆239Updated 2 years ago
- Extract data from any website right in Chrome☆18Updated 7 years ago
- Node.JS library and cli for scraping websites using Puppeteer (or not) and YAML definitions☆47Updated 2 years ago
- The sheetiest REST API on the block.☆70Updated 5 years ago
- Module that extracts structured information from a rendered html site and outputs JSON. HTML to JSON.☆70Updated 4 years ago
- 🏁📑 Static site generator for landing pages, docs, and more☆43Updated 3 years ago
- Building extensible automation. Tideflow is a Realtime, open source workflows execution and monitorization web application.☆114Updated 2 years ago
- A collaborative low code headless CMS and Javascript framework for building collaborative no code platforms, apps and UI's. Build powerfu…☆33Updated 2 months ago
- Simple proxy rotation service☆30Updated 10 years ago
- Use plain HTML to connect your website to Google Sheets☆46Updated 2 years ago
- Chromium Browser Automation (extension for chrome browser automation).☆126Updated last year
- A case management app built with Lowdefy.☆32Updated last year
- You can use this act to monitor any page's content and get a notification when content changes.☆22Updated 3 years ago
- Cloud crawler functions for scrapeulous☆45Updated 4 years ago
- support page for the Chrome extension CSS Selector Capture☆50Updated last year
- ScrapingAnt API client for Python.☆43Updated last year