get-set-fetch / scraper
Nodejs web scraper. Contains a command line, docker container, terraform module and ansible roles for distributed cloud scraping. Supported databases: SQLite, MySQL, PostgreSQL. Supported headless clients: Puppeteer, Playwright, Cheerio, JSdom.
☆109Updated last year
Alternatives and similar repositories for scraper:
Users that are interested in scraper are comparing it to the libraries listed below
- Parses OTP messages for a verification code and service provider.☆24Updated 2 years ago
- A single tab web browser built with puppeteer. Also, no client-side JS. Viewport is streamed with MJPEG. For realz.☆54Updated last year
- web scraping extension☆78Updated 4 months ago
- Phantombuster's SDK☆14Updated 3 months ago
- Extracts email address from an arbitrary text input.☆61Updated this week
- Cloud crawler functions for scrapeulous☆44Updated 3 years ago
- All In One API to easily scrape data from any website, without worrying about captchas and bot detection mecanisms.☆21Updated last year
- Web data extraction tool implemented as chrome extension with much more features☆46Updated 6 years ago
- Playwright Docker Images. (Ubuntu, Alpine) x (ARMv8, x64) x (Chromium, Firefox, WebKit, Chrome, Edge)☆46Updated 3 months ago
- A puppeteer-extra plugin to remotely view and interact with puppeteer sessions. Essentially opening a "portal" to the page.☆49Updated last year
- Browser automation API for repetitive web-based tasks, with a friendly user interface. You can use it to scrape content or do many other …☆29Updated 3 years ago
- House of Apify Scrapers. Generic scraping actors with a simple UI to handle complex web crawling and scraping use cases.☆119Updated last year
- Man in the middle using Playwright☆26Updated last year
- Amazon products scraper with using of rotating proxies and headless Chrome from ScrapingAnt☆80Updated 10 months ago
- Aliexpress.com scraper which developed for Apify☆1Updated this week
- Utilities and constants shared across Apify projects.☆12Updated this week
- Create "perfect" snapshots of web pages☆32Updated last month
- Hosted web-client for the browserless debugger☆45Updated 3 months ago
- Amazon affiliate link storefront powered by a Reddit scraper☆14Updated 7 years ago
- ☆14Updated 2 years ago
- Standalone puppeteer playground in browser's developer tools.☆217Updated last year
- KeepLink is a simple bookmark service with tags and archive build with Supabase and Next.js. It doesn't have any social sharing featrue a…☆69Updated last year
- 📡 expose browser devtools port publicly with TLS and authentication.☆16Updated 4 months ago
- ☆11Updated 7 years ago
- NodeJs package for generating browser-like headers.☆65Updated 2 years ago
- Extract data from any website right in Chrome☆17Updated 6 years ago
- Google Search SERP Scraper☆105Updated last year
- The sheetiest REST API on the block.☆70Updated 4 years ago
- Automates the process of repeatedly searching for a website via scraped proxy IP and search keywords☆43Updated last year
- Generate a list of keywords from any text.☆31Updated 4 years ago