oxylabs / playwright-web-scrapingLinks
A tutorial for web scraping using Playwright headless browser
☆143Updated 4 months ago
Alternatives and similar repositories for playwright-web-scraping
Users that are interested in playwright-web-scraping are comparing it to the libraries listed below
Sorting:
- Nodejs web scraper. Contains a command line, docker container, terraform module and ansible roles for distributed cloud scraping. Support…☆113Updated 2 years ago
- ☆123Updated 3 months ago
- Use AWS Lambda functions as a proxy pool to scrape web pages.☆139Updated 2 years ago
- ScrapingAnt API client for Python.☆43Updated last year
- Amazon products scraper with using of rotating proxies and headless Chrome from ScrapingAnt☆88Updated last year
- A python script to loop through urls in a csv and look for specific keywords on the scraped homepage.☆16Updated 3 years ago
- A full-featured, hackable Next.js AI chatbot built by Vercel but running solely on a VPS, no outside APIs except for LLMs☆12Updated last year
- Semantic Search + Keyword Search + Hybrid Search + Filtering + Faceting on 300K HN Comments☆55Updated last year
- House of Apify Scrapers. Generic scraping actors with a simple UI to handle complex web crawling and scraping use cases.☆127Updated last week
- ScraperAI is an open-source, AI-powered tool designed to simplify web scraping for users of all skill levels.☆241Updated 4 months ago
- Python Wrapper on top of Unofficial Medium API to quickly extract data from Medium's website.☆61Updated 6 months ago
- Spider ported to Python☆103Updated last week
- Professional scrapers that provide full control to the users. Crawlee One builds on top of Crawlee and Apify and extends them with featur…☆35Updated last year
- AI article writer to automatically generate articles with 1,500-7,000+ words to boost your website's SEO and make it more alive☆29Updated 2 years ago
- Scalable Visual-ChatGPT deployment on Kubernetes - Distributed multi-model inference graph powered by BentoML☆16Updated 2 years ago
- Type-complete Python wrapper for the Reddit API.☆52Updated last year
- A dead simple REST API to use Playwright to scrape the text contents from any URL.☆29Updated 2 years ago
- Common crawl extractor☆84Updated last year
- Python, Javascript, and Rust libraries for the Spider Cloud API.☆22Updated last week
- OpenAI Web Search RAG LLM API with BUN.js☆23Updated 2 years ago
- Lego AI Parser is an open-source application that uses OpenAI to parse visible text of HTML elements.☆238Updated last year
- Offline Youtube with advanced vector searching☆33Updated last year
- G2 Scraper helps you collect G2 product data, including names, product descriptions, reviews, ratings, comparisons, alternatives, and mor…☆55Updated 3 months ago
- Some words that LLM regularly uses☆88Updated last year
- OpenCopilot flows editor☆11Updated 2 years ago
- Web data extraction tool implemented as chrome extension☆274Updated last week
- FastHTML app that makes other FastHTML apps with LLMs☆19Updated last year
- Scrapfly Python SDK for headless browsers and proxy rotation☆50Updated 3 weeks ago
- ☆32Updated 2 months ago
- Introducing AmazonMe, a Python-based web scraper designed to extract data from amazon.com using the requests and beautifulSoup libraries.…☆67Updated last year