amerkurev / scrapper
Web scraper with a simple REST API living in Docker and using a Headless browser and Readability.js for parsing.
☆252Updated last month
Alternatives and similar repositories for scrapper
Users that are interested in scrapper are comparing it to the libraries listed below
Sorting:
- The Web Scraping Club Free Repository☆143Updated last week
- Unflare helps you to bypass Cloudflare protection☆105Updated last month
- AI web agent to find answers to any question☆32Updated 3 months ago
- Turn Webpage to LLM friendly input text. Similar to Firecrawl and Jina Reader API. Makes RAG, AI web scraping, image & webpage links extr…☆176Updated last month
- estela, an elastic web scraping cluster 🕸☆180Updated 2 months ago
- Detects the presence of anti-bot and fingerprinting technologies on websites by analyzing requests, headers, cookies, and more. Built on …☆46Updated 7 months ago
- A fork of Dragnet that also extract author, headline, date, keywords from context, as well as built in metadata extraction all in one pac…☆276Updated last year
- Lego AI Parser is an open-source application that uses OpenAI to parse visible text of HTML elements.☆233Updated 11 months ago
- Open source AI Agent evaluation framework for web tasks 🐒🍌☆294Updated 4 months ago
- Use AWS Lambda functions as a proxy pool to scrape web pages.☆131Updated last year
- A multi-arch image provides one HTTP proxy endpoint with many concurrent tunnels to the Tor network.☆171Updated 3 weeks ago
- Free IP Proxy rotator library for python☆239Updated last month
- 🎭 Intelligent browser header & fingerprint generator☆538Updated last month
- Web application that converts audio and video to text using AI, supporting various formats and self-hosting.☆93Updated last month
- TypeScript library for Google search scraping using http requests with proxy support, pagination, and regional customization. Built for w…☆35Updated 3 months ago
- Undetected Python version of the Playwright testing and automation library.☆509Updated last week
- ☆40Updated last week
- ☆23Updated 5 months ago
- Minimalist web-searching platform with an AI assistant that runs directly from your browser. Uses WebLLM, Wllama and SearXNG. Demo: https…☆402Updated this week
- A low-code data extractor for websites with built in proxy and parsing capabilities. Great for testing and debugging css selectors☆183Updated 8 months ago
- n8n node to interact with browserless instance☆155Updated 7 months ago
- Scrape Reddit for marketing pain points using GPT — built to fuel Cronlytic's growth.☆23Updated last week
- 🔎 Search from DuckDuckGo and utilize its spice APIs in Node☆193Updated last month
- ScraperAI is an open-source, AI-powered tool designed to simplify web scraping for users of all skill levels.☆156Updated 8 months ago
- A simple worker for extracting page content for a given URL☆111Updated last year
- Super Fast, Super Anti-Detect, and Super Intuitive Web Driver☆66Updated last month
- Undetected NodeJS version of the Playwright testing and automation library.☆254Updated last week
- Viral Factory is a highly modular gradio app that automates the production of various forms of social media content. Thanks to it's comp…☆49Updated this week
- converts url content into JSON with a simple prefix☆68Updated last year
- A blazing fast, async-first, undetectable webscraping/web automation framework based on ultrafunkamsterdam/nodriver. Now with Docker supp…☆431Updated last week