ReedD / crawlerLinks
Chromium / Puppeteer site crawler
☆48Updated 5 years ago
Alternatives and similar repositories for crawler
Users that are interested in crawler are comparing it to the libraries listed below
Sorting:
- Email automation driven by headless chrome.☆167Updated 5 years ago
- A Better Scraper, with Puppeteer☆43Updated last month
- House of Apify Scrapers. Generic scraping actors with a simple UI to handle complex web crawling and scraping use cases.☆128Updated last week
- Example project demonstrating Headless Chrome + Puppeteer running in their own individual containers.☆70Updated 3 years ago
- A web crawler. Supercrawler automatically crawls websites. Define custom handlers to parse content. Obeys robots.txt, rate limits and con…☆382Updated 3 years ago
- Easily generate animated GIFs from websites☆106Updated 2 years ago
- ⛏ A versatile Web scraper for Node.js☆46Updated 2 weeks ago
- Convenience functions for the Puppeteer☆25Updated 2 years ago
- Library and CLI for automating captcha verification across multiple providers.☆121Updated 5 years ago
- Module that extracts structured information from a rendered html site and outputs JSON. HTML to JSON.☆70Updated 4 years ago
- Robust text renderer using headless chrome.☆66Updated 2 years ago
- Puppeteer (Headless Chrome Node API)-based rendering solution.☆548Updated 3 years ago
- Query multiple APIs and DBs and join them in a single query☆136Updated 2 years ago
- Nodejs web scraper. Contains a command line, docker container, terraform module and ansible roles for distributed cloud scraping. Support…☆113Updated 2 years ago
- Node.JS library and cli for scraping websites using Puppeteer (or not) and YAML definitions☆49Updated 3 years ago
- ☆33Updated 8 years ago
- Cloud crawler functions for scrapeulous☆45Updated 4 years ago
- Instagram get images 🌄 (hashtags, account, locations) with puppeteer☆79Updated 2 months ago
- ProxyCrawl Node library for scraping and crawling☆23Updated 2 years ago
- Nodejs lib to parse Google SERP html pages☆46Updated 2 years ago
- A simple component to check the status of a domain (whois, availability, expired, PR, TrustFlow, ...)☆33Updated 9 years ago
- 🎠 Run headless Chrome (aka Puppeteer) as a service.☆50Updated 7 years ago
- HTML5 Canvas implementation for NodeJS backed by Puppeteer☆65Updated 2 years ago
- Capture website thumbnails using the urlbox screenshot as a service API in node☆26Updated last year
- Automagical summarization for webpages and articles. 🔥☆45Updated 3 years ago
- ⚡️ Next-generation data transformation framework for TypeScript that puts developer experience first☆53Updated 3 years ago
- Language agnostic named entity recognizer☆40Updated 2 years ago
- Declarative Messenger chatbot framework☆55Updated 2 years ago
- Wish your facebook friends from command line, never miss the wish! http://npm.im/facebook-birthday-cli☆58Updated 6 years ago
- Real-Time Proxy & Web Scraping API☆24Updated 6 years ago