ReedD / crawler
Chromium / Puppeteer site crawler
☆48Updated 5 years ago
Alternatives and similar repositories for crawler:
Users that are interested in crawler are comparing it to the libraries listed below
- Robust text renderer using headless chrome.☆66Updated last year
- GitHub automation driven by headless chrome.☆18Updated 4 years ago
- Convenience functions for the Puppeteer☆25Updated 2 years ago
- Node.JS library and cli for scraping websites using Puppeteer (or not) and YAML definitions☆44Updated 2 years ago
- HTML5 Canvas implementation for NodeJS backed by Puppeteer☆65Updated last year
- Floodesh is a distributed web spider written with Nodejs.☆13Updated 4 years ago
- Module that extracts structured information from a rendered html site and outputs JSON. HTML to JSON.☆70Updated 3 years ago
- the mega scraper - scrape a website's content☆27Updated 4 years ago
- A `htmlparser2` handler for parsing rich metadata from HTML. Includes HTML metadata, JSON-LD, RDFa, microdata, OEmbed, Twitter cards and …☆54Updated last year
- Instagram get images 🌄 (hashtags, account, locations) with puppeteer☆75Updated 3 weeks ago
- Add NextJS to Moleculer! 🎉☆11Updated 6 years ago
- A node.js module to help identify browser sessions☆59Updated 2 weeks ago
- Extracts email address from an arbitrary text input.☆62Updated 2 months ago
- Live query. Mirror part of a DB on the client.☆12Updated 8 years ago
- ☆21Updated 5 years ago
- Auto Create Sequelize ORM Models from PostgreSQL.☆22Updated 6 years ago
- Google Search SERP Scraper☆109Updated last year
- The chrome extension of rrweb which helps to run rrweb on any website out of box☆19Updated 2 years ago
- Extracts prices from an arbitrary text input.☆16Updated 6 years ago
- HTML template editor for quickly working with handlebars and liquid templates.☆16Updated 2 years ago
- Capture website thumbnails using the urlbox screenshot as a service API in node☆25Updated 6 months ago
- Highly scalable crawler with best features.☆11Updated 8 years ago
- A Better Scraper, with Puppeteer☆43Updated 5 months ago
- Extracts time from an arbitrary text input.☆18Updated 5 years ago
- ⚡️ Next-generation data transformation framework for TypeScript that puts developer experience first☆53Updated 3 years ago
- Koa 2 CRUD middleware for Mongoose models.☆11Updated 2 years ago
- A web crawler. Supercrawler automatically crawls websites. Define custom handlers to parse content. Obeys robots.txt, rate limits and con…☆380Updated 2 years ago
- Email automation driven by headless chrome.☆166Updated 4 years ago
- a puppeteer walker 🕷 🕸☆79Updated 4 years ago
- A Node.js library to easily handle all your notification templates.☆31Updated 7 years ago