website-scraper / node-website-scraper
Download website to local directory (including all css, images, js, etc.)
☆1,609Updated last month
Alternatives and similar repositories for node-website-scraper
Users that are interested in node-website-scraper are comparing it to the libraries listed below
Sorting:
- Plugin for website-scraper which returns html for dynamic websites using puppeteer☆334Updated last month
- Web scraper for NodeJS☆4,111Updated last year
- A complete and versatile web scraper.☆3,717Updated 4 years ago
- 🔮 A Node.js scraper for humans.☆4,053Updated 2 weeks ago
- Flexible event driven crawler for node.☆2,142Updated 4 years ago
- A web crawler. Supercrawler automatically crawls websites. Define custom handlers to parse content. Obeys robots.txt, rate limits and con…☆381Updated 2 years ago
- Declarative DOM extraction expression evaluator. 👨⚕️☆696Updated 4 years ago
- Puppeteer (Headless Chrome Node API)-based rendering solution.☆540Updated 2 years ago
- The headless Chrome/Chromium driver on top of Puppeteer.☆1,706Updated this week
- RSS feed generator for Node.☆1,029Updated last month
- Web Crawler/Spider for NodeJS + server-side jQuery ;-)☆6,757Updated 5 months ago
- An isomorphic JavaScript client for the WordPress REST API☆1,682Updated last year
- Node.js module and CLI tool to get proxies from publicly available proxy lists.☆625Updated 3 years ago
- --DEPRECATED -- 🛑 🛑 Node.js library to bypass cloudflare's anti-ddos page☆604Updated 5 years ago
- Web scraping library made by the Phantombuster team. Modern, simple & works on all websites. (Deprecated)☆502Updated 4 years ago
- The next web scraper. See through the <html> noise.☆5,898Updated 2 weeks ago
- Use case-driven examples for using Puppeteer and headless chrome☆2,392Updated last month
- Get unified metadata from websites using Open Graph, Microdata, RDFa, Twitter Cards, JSON-LD, HTML, and more.☆2,444Updated this week
- artoo.js - the client-side scraping companion.☆1,110Updated 4 years ago
- Easy website screenshots in Node.js☆2,120Updated 5 years ago
- Puppeteer example scripts for running Headless Chrome from Node.☆3,055Updated 4 years ago
- Node.js implementation of a proxy server (think Squid) with support for SSL, authentication and upstream proxy chaining.☆910Updated 2 weeks ago
- Additional module to use with 'puppeteer' for setting proxies per page basis.☆444Updated 11 months ago
- Examples and supplementary documentation for Nightmare☆250Updated 7 years ago
- 🖥🔋Web Extension starter to build "Write Once Run on Any Browser" extension☆2,096Updated last year
- A curated list of awesome puppeteer resources.☆2,474Updated 9 months ago
- A website to find long-tail keywords using search suggestions☆211Updated 2 years ago
- Automatically extract body content (and other cool stuff) from an html document☆2,158Updated last year
- 📕 Barebones boilerplate with Parcel 2, options handler and auto-publishing☆808Updated 3 months ago
- Scrape/Crawl article from any site automatically. Make any web page readable, no matter Chinese or English.☆344Updated 6 years ago