get-set-fetch / scraper
Nodejs web scraper. Contains a command line, docker container, terraform module and ansible roles for distributed cloud scraping. Supported databases: SQLite, MySQL, PostgreSQL. Supported headless clients: Puppeteer, Playwright, Cheerio, JSdom.
☆112Updated 2 years ago
Alternatives and similar repositories for scraper
Users that are interested in scraper are comparing it to the libraries listed below
Sorting:
- web scraping extension☆82Updated last week
- NodeJs package for generating browser-like headers.☆71Updated 2 years ago
- 🧱 A uniform template to use as a foundation for Puppeteer bot construction.☆66Updated 4 years ago
- Browser automation API for repetitive web-based tasks, with a friendly user interface. You can use it to scrape content or do many other …☆30Updated 3 years ago
- Hosted web-client for the browserless debugger☆46Updated 6 months ago
- A Node.js library to easily manage and rotate a pool of web browsers, using any of the popular browser automation libraries like Puppetee…☆94Updated 2 years ago
- Base Docker images for Apify actors.☆76Updated this week
- Utilities and constants shared across Apify projects.☆14Updated this week
- Cloud crawler functions for scrapeulous☆45Updated 4 years ago
- A simple puppeteer wrapper to enable useful plugins with ease☆56Updated this week
- A puppeteer-extra plugin to remotely view and interact with puppeteer sessions. Essentially opening a "portal" to the page.☆53Updated last year
- An undetectable browser automation framework 🤖☆32Updated 3 years ago
- All In One API to easily scrape data from any website, without worrying about captchas and bot detection mecanisms.☆22Updated last year
- The sheetiest REST API on the block.☆70Updated 4 years ago
- Detects the presence of anti-bot and fingerprinting technologies on websites by analyzing requests, headers, cookies, and more. Built on …☆46Updated 7 months ago
- Module that extracts structured information from a rendered html site and outputs JSON. HTML to JSON.☆70Updated 3 years ago
- A single tab web browser built with puppeteer. Also, no client-side JS. Viewport is streamed with MJPEG. For realz.☆56Updated last year
- Web data extraction tool implemented as chrome extension with much more features☆47Updated 6 years ago
- House of Apify Scrapers. Generic scraping actors with a simple UI to handle complex web crawling and scraping use cases.☆121Updated 2 years ago
- automatic and extensive scraper for forums☆26Updated 3 weeks ago
- 📡 expose browser devtools port publicly with TLS and authentication.☆17Updated 8 months ago
- DronaHQ offers a low-code platform to build internal tools. Drag-and-drop UI components and connect them to your databases and APIs to bu…☆58Updated 3 weeks ago
- Automated functional testing via the Chrome DevTools Protocol. Easy to use and open source. Generates unique CSS and Xpath selectors. Out…☆55Updated 4 years ago
- command line Google search and save to JSON☆106Updated last year
- ☆26Updated 2 years ago
- ☆115Updated last year
- Amazon products scraper with using of rotating proxies and headless Chrome from ScrapingAnt☆81Updated last year
- A case management app built with Lowdefy.☆32Updated last year
- Home of the Ulixee Open Data Platform☆50Updated 5 months ago
- Generates realistic browser fingerprints☆78Updated 2 years ago