website-scraper / node-website-scraperLinks
Download website to local directory (including all css, images, js, etc.)
☆1,657Updated last week
Alternatives and similar repositories for node-website-scraper
Users that are interested in node-website-scraper are comparing it to the libraries listed below
Sorting:
- A complete and versatile web scraper.☆3,719Updated 5 years ago
- Flexible event driven crawler for node.☆2,134Updated 4 years ago
- Puppeteer (Headless Chrome Node API)-based rendering solution.☆545Updated 3 years ago
- Declarative DOM extraction expression evaluator. 👨⚕️☆689Updated 5 years ago
- Automatically extract body content (and other cool stuff) from an html document☆2,161Updated 2 years ago
- A JavaScript library for generating random user agents with data that's updated daily.☆1,131Updated 2 months ago
- Get unified metadata from websites using Open Graph, Microdata, RDFa, Twitter Cards, JSON-LD, HTML, and more.☆2,590Updated last week
- The headless Chrome/Chromium driver on top of Puppeteer.☆1,760Updated this week
- A web crawler. Supercrawler automatically crawls websites. Define custom handlers to parse content. Obeys robots.txt, rate limits and con…☆381Updated 2 years ago
- 🔮 A Node.js scraper for humans.☆4,064Updated 2 months ago
- Capture screenshots of websites☆1,996Updated last month
- The next web scraper. See through the <html> noise.☆5,906Updated 3 weeks ago
- node.js module for extracting text from html, pdf, doc, docx, xls, xlsx, csv, pptx, png, jpg, gif, rtf and more!☆1,688Updated 3 years ago
- A curated list of awesome puppeteer resources.☆2,527Updated last year
- Web Crawler/Spider for NodeJS + server-side jQuery ;-)☆6,785Updated 6 months ago
- Node.js implementation of a proxy server (think Squid) with support for SSL, authentication and upstream proxy chaining.☆957Updated this week
- Sitemap-generating framework for node.js☆1,692Updated 2 weeks ago
- Run Puppeteer code in the cloud☆735Updated last year
- Web crawler for Node.JS☆256Updated 7 years ago
- Easily create XML sitemaps for your website.☆447Updated last year
- Advanced html to text converter☆1,679Updated 2 years ago
- Node.js module and CLI tool to get proxies from publicly available proxy lists.☆627Updated 4 years ago
- artoo.js - the client-side scraping companion.☆1,117Updated 4 years ago
- Distributed crawler powered by Headless Chrome☆5,673Updated 2 years ago
- Javascript scraping module based on puppeteer for many different search engines...☆565Updated 2 years ago
- Easy website screenshots in Node.js☆2,119Updated 6 years ago
- Web data extraction tool implemented as chrome extension☆1,348Updated 7 years ago
- Plugin for website-scraper which returns html for dynamic websites using PhantomJS.☆58Updated 3 years ago
- Puppeteer Pool, run a cluster of instances in parallel☆3,494Updated 2 weeks ago
- RSS feed generator for Node.☆1,040Updated 3 weeks ago