ReedD / crawlerLinks
Chromium / Puppeteer site crawler
☆49Updated 5 years ago
Alternatives and similar repositories for crawler
Users that are interested in crawler are comparing it to the libraries listed below
Sorting:
- Module that extracts structured information from a rendered html site and outputs JSON. HTML to JSON.☆70Updated 3 years ago
- A Better Scraper, with Puppeteer☆43Updated 6 months ago
- Floodesh is a distributed web spider written with Nodejs.☆13Updated 4 years ago
- ExpressJs middleware for rendering PWA to bots using Puppeteer.☆121Updated 3 months ago
- A `htmlparser2` handler for parsing rich metadata from HTML. Includes HTML metadata, JSON-LD, RDFa, microdata, OEmbed, Twitter cards and …☆55Updated last year
- Convenience functions for the Puppeteer☆25Updated 2 years ago
- House of Apify Scrapers. Generic scraping actors with a simple UI to handle complex web crawling and scraping use cases.☆121Updated 2 years ago
- Language agnostic named entity recognizer☆39Updated 2 years ago
- Wish your facebook friends from command line, never miss the wish! http://npm.im/facebook-birthday-cli☆58Updated 5 years ago
- Instagram get images 🌄 (hashtags, account, locations) with puppeteer☆75Updated 2 months ago
- A simple component to check the status of a domain (whois, availability, expired, PR, TrustFlow, ...)☆33Updated 8 years ago
- Nodejs lib to parse Google SERP html pages☆47Updated last year
- Automatically extracts structured information from webpages☆109Updated 2 years ago
- HTML5 Canvas implementation for NodeJS backed by Puppeteer☆65Updated 2 years ago
- Example project demonstrating Headless Chrome + Puppeteer running in their own individual containers.☆70Updated 2 years ago
- Node.JS library and cli for scraping websites using Puppeteer (or not) and YAML definitions☆44Updated 2 years ago
- JS samples using Puppeteer☆19Updated 7 years ago
- 🕸️ Scrape facebook group post permalinks☆37Updated 2 years ago
- Cloud crawler functions for scrapeulous☆45Updated 4 years ago
- Build and evaluate JsonLogic with React components☆25Updated 8 years ago
- Generate an object for testing if a request is sent, request is Mikeal's request.☆44Updated 4 years ago
- the mega scraper - scrape a website's content☆27Updated 4 years ago
- Higher level client for Elasticsearch written in Node.js oriented on facets and simplicity☆20Updated 3 months ago
- Auto Create Sequelize ORM Models from PostgreSQL.☆22Updated 6 years ago
- A node.js module to help identify browser sessions☆59Updated last month
- Gets a consistent xpath for a single DOM element.☆63Updated 10 years ago
- Helps to extract shortest optimal css-selector and multi-selector.☆26Updated 7 years ago
- ☆33Updated 7 years ago
- Create vertical search web application in minutes with generator (based on ItemsAPI)☆20Updated 7 years ago
- ⛏ A versatile Web scraper for Node.js☆45Updated 2 months ago