apify / actor-scraperLinks
House of Apify Scrapers. Generic scraping actors with a simple UI to handle complex web crawling and scraping use cases.
☆128Updated last week
Alternatives and similar repositories for actor-scraper
Users that are interested in actor-scraper are comparing it to the libraries listed below
Sorting:
- Base Docker images for Apify actors.☆89Updated last week
- Email automation driven by headless chrome.☆167Updated 4 years ago
- Google Search SERP Scraper☆120Updated last month
- A Node.js library to easily manage and rotate a pool of web browsers, using any of the popular browser automation libraries like Puppetee…☆98Updated 3 years ago
- Cloud crawler functions for scrapeulous☆45Updated 4 years ago
- Nodejs web scraper. Contains a command line, docker container, terraform module and ansible roles for distributed cloud scraping. Support…☆112Updated 2 years ago
- Plugin for website-scraper which returns html for dynamic websites using puppeteer☆353Updated this week
- Chromium / Puppeteer site crawler☆48Updated 5 years ago
- Amazon crawler - this configuration will extract items for a keywords that you will specify in the input, and it will automatically extra…☆77Updated 5 years ago
- Library and CLI for automating captcha verification across multiple providers.☆122Updated 5 years ago
- Instagram automation driven by headless chrome.☆118Updated 2 years ago
- SerpApi client library for Node.js. Previously: Google Search Results Node.js.☆91Updated last year
- Distributed crawling infrastructure running on top of severless computation, cloud storage (such as S3) and sophisticated queues.☆435Updated 2 years ago
- Apify SDK monorepo☆167Updated this week
- Amazon products scraper with using of rotating proxies and headless Chrome from ScrapingAnt☆87Updated last year
- A web crawler. Supercrawler automatically crawls websites. Define custom handlers to parse content. Obeys robots.txt, rate limits and con…☆380Updated 2 years ago
- Javascript scraping module based on puppeteer for many different search engines...☆562Updated 2 years ago
- Proxies Puppeteer Page requests.☆214Updated last year
- NodeJs package for generating browser-like headers.☆71Updated 3 years ago
- Scrape Tripadvisor restaurant, hotels, and places.☆50Updated 3 years ago
- Node.JS library and cli for scraping websites using Puppeteer (or not) and YAML definitions☆47Updated 2 years ago
- Nodejs lib to parse Google SERP html pages☆47Updated 2 years ago
- JavaScript SDK for RudderStack - the Customer Data Platform for Developers.☆168Updated this week
- ExpressJs middleware for rendering PWA to bots using Puppeteer.☆121Updated 3 months ago
- Ayakashi.io - The next generation web scraping framework☆216Updated 2 years ago
- Extract data from any website right in Chrome☆18Updated 7 years ago
- Module that extracts structured information from a rendered html site and outputs JSON. HTML to JSON.☆70Updated 4 years ago
- Web data extraction tool implemented as chrome extension☆269Updated 2 weeks ago
- NLP Functions for amplifying negations, managing elisions, creating ngrams, stems, phonetic codes to tokens and more.☆132Updated last year
- Utilities and constants shared across Apify projects.☆17Updated this week