website-scraper / node-website-scraperLinks
Download website to local directory (including all css, images, js, etc.)
☆1,638Updated last week
Alternatives and similar repositories for node-website-scraper
Users that are interested in node-website-scraper are comparing it to the libraries listed below
Sorting:
- Plugin for website-scraper which returns html for dynamic websites using puppeteer☆343Updated last week
- Flexible event driven crawler for node.☆2,140Updated 4 years ago
- A complete and versatile web scraper.☆3,720Updated 4 years ago
- A web crawler. Supercrawler automatically crawls websites. Define custom handlers to parse content. Obeys robots.txt, rate limits and con…☆380Updated 2 years ago
- The headless Chrome/Chromium driver on top of Puppeteer.☆1,733Updated last week
- Puppeteer (Headless Chrome Node API)-based rendering solution.☆542Updated 3 years ago
- Puppeteer Pool, run a cluster of instances in parallel☆3,431Updated last month
- Plugin for website-scraper which returns html for dynamic websites using PhantomJS.☆59Updated 3 years ago
- A JavaScript library for generating random user agents with data that's updated daily.☆1,082Updated this week
- A curated list of awesome puppeteer resources.☆2,506Updated last year
- Web scraping library made by the Phantombuster team. Modern, simple & works on all websites. (Deprecated)☆500Updated 5 years ago
- Puppeteer example scripts for running Headless Chrome from Node.☆3,054Updated 4 years ago
- Declarative DOM extraction expression evaluator. 👨⚕️☆692Updated 5 years ago
- The next web scraper. See through the <html> noise.☆5,899Updated this week
- Demo app for website-scraper module☆85Updated last year
- Additional module to use with 'puppeteer' for setting proxies per page basis.☆450Updated last year
- Chrome extension that allows easy extraction of CSS and HTML from selected element.☆1,103Updated 5 years ago
- Capture screenshots of websites☆1,984Updated 8 months ago
- Use case-driven examples for using Puppeteer and headless chrome☆2,405Updated last week
- Advanced html to text converter☆1,664Updated last year
- RSS feed generator for Node.☆1,033Updated 4 months ago
- Node.js implementation of a proxy server (think Squid) with support for SSL, authentication and upstream proxy chaining.☆933Updated 2 months ago
- Automatically extract body content (and other cool stuff) from an html document☆2,156Updated 2 years ago
- Easily create XML sitemaps for your website.☆438Updated last year
- Get unified metadata from websites using Open Graph, Microdata, RDFa, Twitter Cards, JSON-LD, HTML, and more.☆2,521Updated last month
- Creates an XML-Sitemap by crawling a given site.☆333Updated 2 years ago
- Distributed crawler powered by Headless Chrome☆5,586Updated 2 years ago
- A chrome extension to record your actions into a nightmare or puppeteer script☆2,766Updated 9 months ago
- Scrape/Crawl article from any site automatically. Make any web page readable, no matter Chinese or English.☆344Updated 7 years ago
- Run Puppeteer code in the cloud☆736Updated last year