website-scraper/node-website-scraper

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/website-scraper/node-website-scraper)

website-scraper / node-website-scraper

Download website to local directory (including all css, images, js, etc.)

☆1,739

Alternatives and similar repositories for node-website-scraper

Users that are interested in node-website-scraper are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

IonicaBizau / scrape-it
View on GitHub
🔮 A Node.js scraper for humans.
☆4,074Jul 7, 2026Updated 2 weeks ago
matthewmueller / x-ray
View on GitHub
The next web scraper. See through the <html> noise.
☆5,904May 6, 2026Updated 2 months ago
puppeteer / puppeteer
View on GitHub
JavaScript API for Chrome and Firefox
☆95,334Updated this week
rchipka / node-osmosis
View on GitHub
Web scraper for NodeJS
☆4,107Dec 13, 2023Updated 2 years ago
cheeriojs / cheerio
View on GitHub
The fast, flexible, and elegant library for parsing and manipulating HTML and XML.
☆30,425Updated this week
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
bda-research / node-crawler
View on GitHub
Web Crawler/Spider for NodeJS + server-side jQuery ;-)
☆6,797Jun 18, 2026Updated last month
lovell / sharp
View on GitHub
High performance Node.js image processing, the fastest module to resize JPEG, PNG, WebP, AVIF and TIFF images. Uses the libvips library.
☆32,491Updated this week
vercel / pkg
View on GitHub
Package your Node.js project into an executable
☆24,365Jan 3, 2024Updated 2 years ago
microlinkhq / metascraper
View on GitHub
Get unified metadata from websites using Open Graph, Microdata, RDFa, Twitter Cards, JSON-LD, HTML, and more.
☆2,714Updated this week
apify / crawlee
View on GitHub
Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data …
☆24,916Updated this week
checkly / headless-recorder
View on GitHub
Chrome extension that records your browser interactions and generates a Playwright or Puppeteer script.
☆15,289Dec 16, 2022Updated 3 years ago
ruipgil / scraperjs
View on GitHub
A complete and versatile web scraper.
☆3,716Oct 18, 2020Updated 5 years ago
simplecrawler / simplecrawler
View on GitHub
Flexible event driven crawler for node.
☆2,134Mar 7, 2021Updated 5 years ago
yujiosaka / headless-chrome-crawler
View on GitHub
Distributed crawler powered by Headless Chrome
☆5,642Apr 29, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
jsdom / jsdom
View on GitHub
A JavaScript implementation of various web standards, for use with Node.js
☆21,615Updated this week
naptha / tesseract.js
View on GitHub
Pure Javascript OCR for more than 100 Languages 📖🎉🖥
☆38,557May 17, 2026Updated 2 months ago
typicode / lowdb
View on GitHub
Simple and fast JSON database
☆22,566Mar 27, 2026Updated 3 months ago
nextapps-de / flexsearch
View on GitHub
Next-generation full-text search library for Browser and Node.js
☆13,759Jun 28, 2026Updated 3 weeks ago
strapi / strapi
View on GitHub
🚀 Strapi is the leading open-source headless CMS. It’s 100% JavaScript/TypeScript, fully customizable, and developer-first.
☆72,711Updated this week
fastify / fastify
View on GitHub
Fast and low overhead web framework, for Node.js
☆36,777Updated this week
transitive-bullshit / awesome-puppeteer
View on GitHub
A curated list of awesome puppeteer resources.
☆2,560Jul 19, 2024Updated 2 years ago
jprichardson / node-fs-extra
View on GitHub
Node.js: extra methods for the fs object like copy(), remove(), mkdirs()
☆9,604Jun 29, 2026Updated 3 weeks ago
OptimalBits / bull
View on GitHub
Premium Queue package for handling distributed jobs and messages in NodeJS.
☆16,245Updated this week
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
request / request
View on GitHub
🏊🏾 Simplified HTTP request client.
☆25,532Aug 14, 2024Updated last year
transloadit / uppy
View on GitHub
The next open source file uploader for web browsers
☆30,888Updated this week
fb55 / htmlparser2
View on GitHub
The fast & forgiving HTML and XML parser
☆4,785Updated this week
nexe / nexe
View on GitHub
🎉 create a single executable out of your node.js apps
☆13,570Mar 5, 2026Updated 4 months ago
krisk / Fuse
View on GitHub
Lightweight fuzzy-search, in JavaScript
☆20,410Jul 13, 2026Updated last week
jimp-dev / jimp
View on GitHub
An image processing library written entirely in JavaScript for Node, with zero external or native dependencies.
☆14,656Apr 7, 2026Updated 3 months ago
gatsbyjs / gatsby
View on GitHub
React-based framework with performance, scalability, and security built in.
☆55,949Updated this week
cassidoo / scrapers
View on GitHub
A list of scrapers from around the web.
☆728Feb 7, 2025Updated last year
microsoft / playwright
View on GitHub
Playwright is a framework for Web Testing and Automation. It allows testing Chromium, Firefox and WebKit with a single API.
☆93,320Updated this week
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
tj / commander.js
View on GitHub
node.js command-line interfaces made easy
☆28,328Updated this week
pinojs / pino
View on GitHub
🌲 super fast, all natural json logger
☆18,070Updated this week
motdotla / dotenv
View on GitHub
Loads environment variables from .env for nodejs projects.
☆20,498Updated this week
Unitech / pm2
View on GitHub
Node.js/Bun Production Process Manager with a built-in Load Balancer.
☆43,240Jul 2, 2026Updated 3 weeks ago
ai / nanoid
View on GitHub
A tiny (118 bytes), secure, URL-friendly, unique string ID generator for JavaScript
☆26,899Updated this week
pubkey / rxdb
View on GitHub
The local-first database that runs on every JS runtime and replicates with your existing backend - no vendor, no lock-in - https://rxdb.i…
☆23,285Updated this week
sindresorhus / got
View on GitHub
🌐 Human-friendly and powerful HTTP request library for Node.js
☆14,927Updated this week