teticio / lambda-scraperLinks
Use AWS Lambda functions as a proxy pool to scrape web pages.
☆132Updated last year
Alternatives and similar repositories for lambda-scraper
Users that are interested in lambda-scraper are comparing it to the libraries listed below
Sorting:
- The Web Scraping Club Free Repository☆144Updated 3 weeks ago
- A fork of Dragnet that also extract author, headline, date, keywords from context, as well as built in metadata extraction all in one pac…☆280Updated 2 weeks ago
- Minimal set of tools to conduct stealthy scraping.☆156Updated 2 years ago
- Lego AI Parser is an open-source application that uses OpenAI to parse visible text of HTML elements.☆233Updated 11 months ago
- estela, an elastic web scraping cluster 🕸☆181Updated last week
- create your rotating proxy server with docker. self hosted rotating proxy service.☆175Updated 2 years ago
- Python wrapper for google people-alos-ask☆107Updated 8 months ago
- Cloud crawler functions for scrapeulous☆45Updated 4 years ago
- Clean, filter and sample URLs to optimize data collection – Python & command-line – Deduplication, spam, content and language filters☆138Updated 5 months ago
- Module that extracts structured information from a rendered html site and outputs JSON. HTML to JSON.☆70Updated 3 years ago
- A drop-in replacement for playwright-python patched with rebrowser-patches. It allows to pass modern automation detection tests.☆72Updated 3 weeks ago
- A fork of https://github.com/AtuboDad/playwright_stealth☆95Updated 2 weeks ago
- Super Fast, Super Anti-Detect, and Super Intuitive Web Driver☆67Updated last month
- Patching CDP (Chrome DevTools Protocol) leaks on OS level. Easy to use with Playwright, Selenium, and other web automation tools.☆121Updated 9 months ago
- ☆131Updated last year
- Introducing AmazonMe, a Python-based web scraper designed to extract data from amazon.com using the requests and beautifulSoup libraries.…☆64Updated last year
- Get structured JSON data from any page.☆175Updated last year
- Create on demand free HTTPS/SOCKS5 proxy servers using AWS Free Tier EC2 instances automatically with Terraform☆92Updated 2 years ago
- Common crawl extractor☆75Updated last year
- Scrapy download handler that can impersonate browser' TLS signatures or JA3 fingerprints.☆165Updated 3 weeks ago
- ☆23Updated 6 months ago
- Library that helps use puppeteer in scrapy.☆52Updated last month
- Anonymous automation via playwright with fingerprint replacement technology.☆214Updated last month
- A Puppeteer Browser that acts like a human. For when you really, really, REALLY need to prove you are definitely not a robot.☆27Updated last year
- Detects the presence of anti-bot and fingerprinting technologies on websites by analyzing requests, headers, cookies, and more. Built on …☆47Updated 7 months ago
- TypeScript library for Google search scraping using http requests with proxy support, pagination, and regional customization. Built for w…☆36Updated 3 months ago
- Home of the Ulixee Open Data Platform☆52Updated this week
- A python package for finding e-mails, checking deliverability and more.☆65Updated last year
- Serverless Amazon Search and Product API, runs on Cloudflare worker. Supports multiple country's amazon version.☆113Updated 3 weeks ago
- 🎭 Intelligent browser header & fingerprint generator☆567Updated 2 months ago