christian-fei / mega-scraperLinks
the mega scraper - scrape a website's content
☆28Updated 5 years ago
Alternatives and similar repositories for mega-scraper
Users that are interested in mega-scraper are comparing it to the libraries listed below
Sorting:
- Robust text renderer using headless chrome.☆66Updated 2 years ago
- Scrape subreddits based on search criteria or get the X latest from 'hot' or 'new' categories☆28Updated 4 years ago
- Simple proxy rotation service☆30Updated 10 years ago
- 🌃 Start and control a Tor instance.☆13Updated 3 years ago
- A plugin for puppeteer-extra to add proxy support☆18Updated 2 years ago
- Convenience functions for the Puppeteer☆25Updated 2 years ago
- A `htmlparser2` handler for parsing rich metadata from HTML. Includes HTML metadata, JSON-LD, RDFa, microdata, OEmbed, Twitter cards and …☆57Updated last year
- Naive Bayes Classifier in JavaScript☆32Updated 8 years ago
- a puppeteer walker 🕷 🕸☆79Updated 5 years ago
- Gather meta information from chrome web store.☆16Updated 4 years ago
- Chrome binary compatible with AWS Lambda.☆55Updated 6 years ago
- CLI for rendering text with headless chrome.☆11Updated 5 years ago
- Extracts prices from an arbitrary text input.☆16Updated 6 years ago
- Naive Bayes Text Classifier☆40Updated 8 months ago
- Technologies I've learned☆66Updated last week
- Convert a URL to a valid filename☆79Updated 2 months ago
- Chromium / Puppeteer site crawler☆48Updated 5 years ago
- A sparse array optimised for low memory whilst still being fast☆32Updated 2 years ago
- ⛏ A versatile Web scraper for Node.js☆46Updated last month
- Multi-client, multi-threaded reverse shell handler written in Node.js☆74Updated 3 years ago
- plugin to transform from HTML (rehype) to prose (retext)☆19Updated 7 months ago
- Module that extracts structured information from a rendered html site and outputs JSON. HTML to JSON.☆70Updated 4 years ago
- Simularity identification in JS☆37Updated last year
- Language agnostic named entity recognizer☆39Updated 2 years ago
- An A/B testing engine☆37Updated last year
- Auto installs npm dependencies from the script you want to run and runs the script☆47Updated last year
- A proxy that sits in between a chromium devtools frontend and the remote chromium being debugged and logs requests, responses and websock…☆21Updated 5 years ago
- Extracts all JSON objects from an arbitrary text document.☆30Updated 5 years ago
- List of words for making random mnemonic sentences☆85Updated last year
- Easily generate correct user-agent strings for popular browsers☆73Updated 3 years ago