christian-fei / mega-scraperLinks
the mega scraper - scrape a website's content
☆28Updated 5 years ago
Alternatives and similar repositories for mega-scraper
Users that are interested in mega-scraper are comparing it to the libraries listed below
Sorting:
- Robust text renderer using headless chrome.☆66Updated last year
- A plugin for puppeteer-extra to add proxy support☆18Updated 2 years ago
- Module that extracts structured information from a rendered html site and outputs JSON. HTML to JSON.☆70Updated 4 years ago
- Simple proxy rotation service☆30Updated 9 years ago
- Nodejs web scraper. Contains a command line, docker container, terraform module and ansible roles for distributed cloud scraping. Support…☆114Updated 2 years ago
- a puppeteer walker 🕷 🕸☆79Updated 4 years ago
- Make your functions resilient and fail-fast to failures or delays☆13Updated last year
- Refresh, monitor and balance your proxies☆16Updated 2 months ago
- Small module wrapper for the AWS sdk that allows you to easily use s3 or the local file system☆79Updated 5 years ago
- Accurate and fast sentiment scoring of phrases with #hashtags, emoticons :) & emojis 🎉☆62Updated 2 years ago
- 🌃 Start and control a Tor instance.☆13Updated 3 years ago
- Scrape subreddits based on search criteria or get the X latest from 'hot' or 'new' categories☆26Updated 4 years ago
- HTML template editor for quickly working with handlebars and liquid templates.☆16Updated 2 years ago
- Extracts all JSON objects from an arbitrary text document.☆30Updated 5 years ago
- Extracts prices from an arbitrary text input.☆16Updated 6 years ago
- A `htmlparser2` handler for parsing rich metadata from HTML. Includes HTML metadata, JSON-LD, RDFa, microdata, OEmbed, Twitter cards and …☆55Updated last year
- Efficient (de)compression package for AWS Lambda☆26Updated 4 months ago
- Bypass CORS (Cross-Origin Resource Sharing) get HTML from external domains and make your own API☆14Updated 7 years ago
- Extracts email address from an arbitrary text input.☆64Updated 6 months ago
- RPC calls via postMessage☆17Updated last year
- Browser automation API for repetitive web-based tasks, with a friendly user interface. You can use it to scrape content or do many other …☆31Updated 3 years ago
- Get a JSON array of a Twitter user's latest tweets -- no Twitter API required!☆16Updated 2 years ago
- Language agnostic named entity recognizer☆39Updated 2 years ago
- A zero-boilerplate solution for using ES7 async functions in Express and other middleware-based web frameworks.☆24Updated 7 years ago
- Run an array of functions in series, each passing its results to the next function☆91Updated 4 years ago
- Chop a single stream of data into a series of readable streams☆29Updated 6 years ago
- an ftp client that expose the node fs API☆37Updated 5 years ago
- A context aware debug logger☆42Updated 5 years ago
- A proxy that sits in between a chromium devtools frontend and the remote chromium being debugged and logs requests, responses and websock…☆20Updated 4 years ago
- ⛏ A versatile Web scraper for Node.js☆45Updated 3 weeks ago