christian-fei / mega-scraperLinks
the mega scraper - scrape a website's content
☆28Updated 5 years ago
Alternatives and similar repositories for mega-scraper
Users that are interested in mega-scraper are comparing it to the libraries listed below
Sorting:
- Robust text renderer using headless chrome.☆66Updated last year
- Scrape subreddits based on search criteria or get the X latest from 'hot' or 'new' categories☆27Updated 4 years ago
- An experimental distributed JWT token cracker built using Node.js and ZeroMQ☆57Updated last year
- A plugin for puppeteer-extra to add proxy support☆18Updated 2 years ago
- Chrome binary compatible with AWS Lambda.☆55Updated 6 years ago
- Simple proxy rotation service☆30Updated 9 years ago
- Extracts prices from an arbitrary text input.☆16Updated 6 years ago
- An undetectable browser automation framework 🤖☆34Updated 3 years ago
- Extracts email address from an arbitrary text input.☆64Updated 7 months ago
- Nodejs web scraper. Contains a command line, docker container, terraform module and ansible roles for distributed cloud scraping. Support…☆114Updated 2 years ago
- A zero-boilerplate solution for using ES7 async functions in Express and other middleware-based web frameworks.☆24Updated 7 years ago
- Refresh, monitor and balance your proxies☆16Updated 3 months ago
- Module that extracts structured information from a rendered html site and outputs JSON. HTML to JSON.☆70Updated 4 years ago
- Naive Bayes Classifier in JavaScript☆32Updated 8 years ago
- ⛏ A versatile Web scraper for Node.js☆45Updated last week
- Create HTML snippets/embeds from URLs using info from oEmbed, Open Graph, meta tags.☆66Updated 2 years ago
- A `htmlparser2` handler for parsing rich metadata from HTML. Includes HTML metadata, JSON-LD, RDFa, microdata, OEmbed, Twitter cards and …☆57Updated last year
- Convert a URL to a valid filename☆78Updated last year
- Decorator to memoize the results of async functions via lru-cache.☆24Updated 5 years ago
- A tool to show mouse position and status for screenshots in automation test such as Puppeteer or Playwright☆24Updated 2 years ago
- Naive Bayes Text Classifier☆40Updated 6 months ago
- 🌃 Start and control a Tor instance.☆13Updated 3 years ago
- Language agnostic named entity recognizer☆39Updated 2 years ago
- Convenience functions for the Puppeteer☆25Updated 2 years ago
- Email automation driven by headless chrome.☆168Updated 4 years ago
- a puppeteer walker 🕷 🕸☆79Updated 5 years ago
- 🏴 A straightforward forward-proxy written in Node.js.☆84Updated last year
- Identifies and extracts phone numbers from arbitrary text☆39Updated 8 years ago
- Create a stream of Sequelize create, update, and destroy events.☆11Updated 5 years ago
- Automatic SSL renewal for NodeJS☆52Updated 5 years ago