get-set-fetch / scraperLinks
Nodejs web scraper. Contains a command line, docker container, terraform module and ansible roles for distributed cloud scraping. Supported databases: SQLite, MySQL, PostgreSQL. Supported headless clients: Puppeteer, Playwright, Cheerio, JSdom.
☆114Updated 2 years ago
Alternatives and similar repositories for scraper
Users that are interested in scraper are comparing it to the libraries listed below
Sorting:
- web scraping extension☆84Updated 2 months ago
- Base Docker images for Apify actors.☆80Updated this week
- House of Apify Scrapers. Generic scraping actors with a simple UI to handle complex web crawling and scraping use cases.☆121Updated 2 years ago
- Browser automation API for repetitive web-based tasks, with a friendly user interface. You can use it to scrape content or do many other …☆30Updated 3 years ago
- Cloud crawler functions for scrapeulous☆45Updated 4 years ago
- Amazon products scraper with using of rotating proxies and headless Chrome from ScrapingAnt☆83Updated last year
- A single tab web browser built with puppeteer. Also, no client-side JS. Viewport is streamed with MJPEG. For realz.☆56Updated last year
- Parses OTP messages for a verification code and service provider.☆24Updated 2 years ago
- Module that extracts structured information from a rendered html site and outputs JSON. HTML to JSON.☆70Updated 4 years ago
- A case management app built with Lowdefy.☆32Updated last year
- Building extensible automation. Tideflow is a Realtime, open source workflows execution and monitorization web application.☆116Updated 2 years ago
- The ultimate tool for obtaining free proxies from multiple sources and storing them in a MongoDB database.☆21Updated 2 years ago
- Playwright Docker Images. (Ubuntu, Alpine) x (ARMv8, x64) x (Chromium, Firefox, WebKit, Chrome, Edge)☆60Updated 4 months ago
- Phantombuster's SDK☆14Updated 9 months ago
- Web data extraction tool implemented as chrome extension with much more features☆47Updated 6 years ago
- All In One API to easily scrape data from any website, without worrying about captchas and bot detection mecanisms.☆22Updated last year
- A collaborative low code headless CMS and Javascript framework for building collaborative no code platforms, apps and UI's. Build powerfu…☆32Updated this week
- An alternative to sticking that lovely web app into an <iframe> on a corp website☆50Updated 3 years ago
- Web data extraction tool implemented as chrome extension☆259Updated this week
- PixieBrix browser extension☆85Updated 7 months ago
- Google Search SERP Scraper☆114Updated 2 years ago
- Node.JS library and cli for scraping websites using Puppeteer (or not) and YAML definitions☆45Updated 2 years ago
- A example survey app built with Lowdefy.☆17Updated 2 years ago
- DronaHQ offers a low-code platform to build internal tools. Drag-and-drop UI components and connect them to your databases and APIs to bu…☆59Updated 2 months ago
- The high resilient queue for processing URLs.☆13Updated last month
- 🚀A Go program to schedule posts for reddit.☆18Updated 5 years ago
- Send emails from Google Sheets☆71Updated 4 years ago
- KeepLink is a simple bookmark service with tags and archive build with Supabase and Next.js. It doesn't have any social sharing featrue a…☆70Updated 2 years ago
- Email automation driven by headless chrome.☆167Updated 4 years ago
- Automated functional testing via the Chrome DevTools Protocol. Easy to use and open source. Generates unique CSS and Xpath selectors. Out…☆57Updated 4 years ago