get-set-fetch / scraperLinks
Nodejs web scraper. Contains a command line, docker container, terraform module and ansible roles for distributed cloud scraping. Supported databases: SQLite, MySQL, PostgreSQL. Supported headless clients: Puppeteer, Playwright, Cheerio, JSdom.
☆113Updated 2 years ago
Alternatives and similar repositories for scraper
Users that are interested in scraper are comparing it to the libraries listed below
Sorting:
- House of Apify Scrapers. Generic scraping actors with a simple UI to handle complex web crawling and scraping use cases.☆128Updated this week
- web scraping extension☆85Updated 6 months ago
- Browser automation API for repetitive web-based tasks, with a friendly user interface. You can use it to scrape content or do many other …☆31Updated 4 years ago
- A single tab web browser built with puppeteer. Also, no client-side JS. Viewport is streamed with MJPEG. For realz.☆59Updated 2 years ago
- Base Docker images for Apify actors.☆90Updated this week
- Email automation driven by headless chrome.☆167Updated 5 years ago
- Node.JS library and cli for scraping websites using Puppeteer (or not) and YAML definitions☆49Updated 3 years ago
- Parses OTP messages for a verification code and service provider.☆23Updated 3 years ago
- Web data extraction tool implemented as chrome extension☆274Updated 2 weeks ago
- An alternative to sticking that lovely web app into an <iframe> on a corp website☆51Updated 4 years ago
- Chromium Browser Automation (extension for chrome browser automation).☆126Updated last year
- Cloud crawler functions for scrapeulous☆45Updated 4 years ago
- Simple proxy rotation service☆30Updated 10 years ago
- DronaHQ offers a low-code platform to build internal tools. Drag-and-drop UI components and connect them to your databases and APIs to bu…☆69Updated 9 months ago
- Mail Bot: IMAP client sorting entering emails and triggering operations of your choice☆33Updated 8 years ago
- Amazon products scraper with using of rotating proxies and headless Chrome from ScrapingAnt☆88Updated last year
- Refresh, monitor and balance your proxies☆16Updated 8 months ago
- Hosted web-client for the browserless debugger☆50Updated 4 months ago
- Chromium / Puppeteer site crawler☆48Updated 5 years ago
- Amazon affiliate link storefront powered by a Reddit scraper☆13Updated 8 years ago
- Extract data from any website right in Chrome☆19Updated 7 years ago
- Web data extraction tool implemented as chrome extension with much more features☆47Updated 7 years ago
- Grammarify is a npm package that safely cleans up text that has mispellings, improper capitalization, lexical illusions, among other thin…☆73Updated 3 years ago
- KeepLink is a simple bookmark service with tags and archive build with Supabase and Next.js. It doesn't have any social sharing featrue a…☆68Updated 2 years ago
- Automagically generates summaries from html or text.☆69Updated 2 years ago
- A plugin for puppeteer-extra to add proxy support☆18Updated 3 years ago
- Instagram get images 🌄 (hashtags, account, locations) with puppeteer☆79Updated 2 months ago
- Wish your facebook friends from command line, never miss the wish! http://npm.im/facebook-birthday-cli☆58Updated 6 years ago
- Building extensible automation. Tideflow is a Realtime, open source workflows execution and monitorization web application.☆114Updated 2 years ago
- Tools and Images to Build a Raspberry Pi n8n server☆79Updated 4 years ago