christian-fei / mega-scraperLinks
the mega scraper - scrape a website's content
☆28Updated 5 years ago
Alternatives and similar repositories for mega-scraper
Users that are interested in mega-scraper are comparing it to the libraries listed below
Sorting:
- Robust text renderer using headless chrome.☆66Updated last year
- Scrape subreddits based on search criteria or get the X latest from 'hot' or 'new' categories☆27Updated 4 years ago
- A plugin for puppeteer-extra to add proxy support☆18Updated 2 years ago
- Simple proxy rotation service☆30Updated 9 years ago
- Create a stream of Sequelize create, update, and destroy events.☆11Updated 5 years ago
- Extracts email address from an arbitrary text input.☆64Updated 8 months ago
- a puppeteer walker 🕷 🕸☆79Updated 5 years ago
- A `htmlparser2` handler for parsing rich metadata from HTML. Includes HTML metadata, JSON-LD, RDFa, microdata, OEmbed, Twitter cards and …☆57Updated last year
- Extracts all JSON objects from an arbitrary text document.☆30Updated 5 years ago
- Have a stray console.log but too lazy to find it?☆68Updated 5 years ago
- Browser automation API for repetitive web-based tasks, with a friendly user interface. You can use it to scrape content or do many other …☆31Updated 3 years ago
- Get a JSON array of a Twitter user's latest tweets -- no Twitter API required!☆16Updated 2 years ago
- Nice PG SQL toolkit. Loves SQL. Not an ORM. Can do migrations.☆10Updated 2 years ago
- Language agnostic named entity recognizer☆39Updated 2 years ago
- A list of common eMail providers.☆47Updated last year
- Nodejs web scraper. Contains a command line, docker container, terraform module and ansible roles for distributed cloud scraping. Support…☆112Updated 2 years ago
- A context aware debug logger☆42Updated 6 years ago
- Technologies I've learned☆66Updated last week
- A "top" like module for your Node.js process. Collects CPU usage etc.☆85Updated 6 months ago
- ⚡️ Extremely stable HTTP request module built on top of libcurl with retries, timeouts, async/await/promise and callback APIs☆24Updated 3 months ago
- Efficient (de)compression package for AWS Lambda☆26Updated 6 months ago
- A sparse array optimised for low memory whilst still being fast☆32Updated 2 years ago
- Naive Bayes Text Classifier☆40Updated 7 months ago
- 🌃 Start and control a Tor instance.☆13Updated 3 years ago
- A light weight logger with a status bar on the bottom that does not disappear with scrolling☆48Updated last year
- Extracts prices from an arbitrary text input.☆16Updated 6 years ago
- Elasticsearch storage adapter for Gun DB☆29Updated last year
- Simple way to send messages to slack. Works on both the client and server.☆19Updated 9 years ago
- Small module wrapper for the AWS sdk that allows you to easily use s3 or the local file system☆79Updated 5 years ago
- Auto installs npm dependencies from the script you want to run and runs the script☆47Updated last year