ReedD / crawler
Chromium / Puppeteer site crawler
☆48Updated 4 years ago
Alternatives and similar repositories for crawler:
Users that are interested in crawler are comparing it to the libraries listed below
- GitHub automation driven by headless chrome.☆18Updated 4 years ago
- ☆78Updated 10 years ago
- Robust text renderer using headless chrome.☆65Updated last year
- 📐 A fast, general-purpose JSON Rules Engine.☆54Updated last year
- Base environment image for Puppeteer (Headless Chrome Node API)☆48Updated last year
- House of Apify Scrapers. Generic scraping actors with a simple UI to handle complex web crawling and scraping use cases.☆120Updated last year
- Easily generate animated GIFs from websites☆105Updated last year
- Floodesh is a distributed web spider written with Nodejs.☆13Updated 4 years ago
- ⚡️ Next-generation data transformation framework for TypeScript that puts developer experience first☆53Updated 2 years ago
- Higher level client for Elasticsearch written in Node.js oriented on facets and simplicity☆20Updated last month
- A Node.js library to easily handle all your notification templates.☆31Updated 6 years ago
- Module that extracts structured information from a rendered html site and outputs JSON. HTML to JSON.☆69Updated 3 years ago
- Convenience functions for the Puppeteer☆25Updated 2 years ago
- Node.JS library and cli for scraping websites using Puppeteer (or not) and YAML definitions☆44Updated 2 years ago
- HTML5 Canvas implementation for NodeJS backed by Puppeteer☆65Updated last year
- Watch, fake or block requests from puppeteer matching patterns☆49Updated 4 years ago
- Extracts email address from an arbitrary text input.☆62Updated last month
- Email automation driven by headless chrome.☆165Updated 4 years ago
- Generate HAR file with puppeteer☆162Updated last year
- Example project demonstrating Headless Chrome + Puppeteer running in their own individual containers.☆71Updated 2 years ago
- HyperTrack Placeline web application sample using NextJS, Ant-Design, Styled-Components, and Heroku☆92Updated 2 years ago
- 🎠 Run headless Chrome (aka Puppeteer) as a service.☆49Updated 7 years ago
- Extracts time from an arbitrary text input.☆18Updated 5 years ago
- Tool to run light house cron jobs on multiple urls and ship results☆44Updated 2 years ago
- HTML template editor for quickly working with handlebars and liquid templates.☆16Updated 2 years ago
- A Node.js library to easily manage and rotate a pool of web browsers, using any of the popular browser automation libraries like Puppetee…☆94Updated 2 years ago
- Evaluate objects against a set of JSON rules supporting nested ALL, NONE and ANY predicates with standard operators☆19Updated 5 years ago
- Wish your facebook friends from command line, never miss the wish! http://npm.im/facebook-birthday-cli☆58Updated 5 years ago
- 🖨️ Printer: Productivity Focused Next.js CLI Tool☆10Updated last year
- ☆33Updated 7 years ago