teticio / lambda-scraperLinks
Use AWS Lambda functions as a proxy pool to scrape web pages.
☆139Updated last year
Alternatives and similar repositories for lambda-scraper
Users that are interested in lambda-scraper are comparing it to the libraries listed below
Sorting:
- The Web Scraping Club Free Repository☆151Updated last week
- Lego AI Parser is an open-source application that uses OpenAI to parse visible text of HTML elements.☆234Updated last year
- A fork of Dragnet that also extract author, headline, date, keywords from context, as well as built in metadata extraction all in one pac…☆295Updated 5 months ago
- Minimal set of tools to conduct stealthy scraping.☆160Updated 2 years ago
- A python package for finding e-mails, checking deliverability and more.☆74Updated last year
- TypeScript library for Google search scraping using http requests with proxy support, pagination, and regional customization. Built for w…☆56Updated 8 months ago
- Staff fetcher library for LinkedIn - obtain experiences, schools, skills & contact info☆204Updated 4 months ago
- AI article writer to automatically generate articles with 1,500-7,000+ words to boost your website's SEO and make it more alive☆28Updated last year
- Library that helps use puppeteer in scrapy.☆52Updated 2 months ago
- Get structured JSON data from any page.☆178Updated 2 years ago
- This bot mass DMs Reddit users(from a list) a specified message.☆58Updated 6 months ago
- Super Fast, Super Anti-Detect, and Super Intuitive Web Driver☆84Updated 4 months ago
- Generic REST API for scraping websites. Drop-in replacement for ScrapingBee, ScrapingAnt, and ScraperAPI services. And it is open-source!☆32Updated 11 months ago
- Open Source LinkedIn Scraper☆112Updated 9 months ago
- AutoBrowse is an autonomous AI agent that can perform web browsing tasks.☆94Updated last year
- Common crawl extractor☆80Updated last year
- Distributed crawling infrastructure running on top of severless computation, cloud storage (such as S3) and sophisticated queues.☆436Updated 2 years ago
- Retrieval of fully structured data made easy. Use LLMs or custom models. Specialized on PDFs and HTML files. Extensive support of tabular…☆75Updated last week
- Browser automation engine benchmark - Test bypass rates, performance & stealth against Cloudflare, DataDome, reCAPTCHA and other bot dete…☆120Updated last week
- Detects the presence of anti-bot and fingerprinting technologies on websites by analyzing requests, headers, cookies, and more. Built on …☆53Updated last year
- Self-hosted version of Microsoft's OmniParser Image-to-text model☆78Updated 5 months ago
- A library to extract the main content from html. Developed for information on LLM and for feeding data into LangChain and LlamaIndex.☆50Updated last year
- Web scraper with a simple REST API living in Docker and using a Headless browser and Readability.js for parsing.☆281Updated 6 months ago
- Introducing AmazonMe, a Python-based web scraper designed to extract data from amazon.com using the requests and beautifulSoup libraries.…☆65Updated last year
- Web scraping API for building AI applications.☆39Updated last year
- Extract structured data from Shopify websites.☆97Updated last year
- A Python module for automating interactions to mimic human behavior in standalone apps or browsers when using Selenium, Pyppeteer, or Pla…☆89Updated 8 months ago
- Home of the Ulixee Open Data Platform☆55Updated last month
- Python SEO keywords suggestion tool. Google Autocomplete, People Also Ask and Related Searches.☆135Updated 2 years ago
- Curated list of everything related to captchas, including libraries, solvers and scoring☆44Updated last month