amerkurev / scrapperLinks
Web scraper with a simple REST API living in Docker and using a Headless browser and Readability.js for parsing.
☆269Updated 3 months ago
Alternatives and similar repositories for scrapper
Users that are interested in scrapper are comparing it to the libraries listed below
Sorting:
- A low-code data extractor for websites with built in proxy and parsing capabilities. Great for testing and debugging css selectors☆187Updated 10 months ago
- Get structured JSON data from any page.☆177Updated last year
- Unflare helps you to bypass Cloudflare protection☆135Updated 3 weeks ago
- Turn Webpage to LLM friendly input text. Similar to Firecrawl and Jina Reader API. Makes RAG, AI web scraping, image & webpage links extr…☆206Updated last month
- A fork of Dragnet that also extract author, headline, date, keywords from context, as well as built in metadata extraction all in one pac…☆291Updated 2 months ago
- ScrapeGPT is a RAG-based Telegram bot designed to scrape and analyze websites, then answer questions based on the scraped content. The bo…☆83Updated last year
- Free IP Proxy rotator library for python☆263Updated 2 weeks ago
- AI web agent to find answers to any question☆33Updated 2 months ago
- n8n node to interact with browserless instance☆179Updated 10 months ago
- Automated web scraping spider generation using Browser Use and LLMs. Streamline the creation of Playwright-based spiders with minimal man…☆82Updated last month
- Self-hosted version of Microsoft's OmniParser Image-to-text model☆71Updated 2 months ago
- Undetected web-scraping & seamless HTML parsing in Python!☆279Updated 3 weeks ago
- Spider ported to Python☆89Updated 6 months ago
- Scrape Reddit for marketing pain points using GPT — built to fuel Cronlytic's growth.☆60Updated 2 months ago
- Minimalist web-searching platform with an AI assistant that runs directly from your browser. Uses WebLLM, Wllama and SearXNG. Demo: https…☆467Updated last week
- Toolkits to create a human-in-the-loop approval layer to monitor and guide AI agents workflow in real-time.☆180Updated 8 months ago
- The Architecture of a Web Crawler: Building a Google-Inspired Distributed Web Crawler☆121Updated 7 months ago
- Some words that LLM regularly uses☆85Updated last year
- Get Google, Yandex, Baidu search engine results via API or CLI for free 🎉☆472Updated 2 weeks ago
- The PyVisionAI Official Repo☆103Updated 2 weeks ago
- n8n node for browser automation using Puppeteer☆356Updated this week
- Yet Another Reddit Scrapper (without API keys) | Scrap search results, posts and images from subreddits filtered by hot, new etc and bulk…☆95Updated last month
- A python package for finding e-mails, checking deliverability and more.☆70Updated last year
- Detects the presence of anti-bot and fingerprinting technologies on websites by analyzing requests, headers, cookies, and more. Built on …☆49Updated 9 months ago
- A powerful starter template for building undetectable web scrapers and browser automation bots.☆54Updated 3 months ago
- Use AWS Lambda functions as a proxy pool to scrape web pages.☆135Updated last year
- TypeScript library for Google search scraping using http requests with proxy support, pagination, and regional customization. Built for w…☆44Updated 5 months ago
- Generic REST API for scraping websites. Drop-in replacement for ScrapingBee, ScrapingAnt, and ScraperAPI services. And it is open-source!☆31Updated 8 months ago
- The Web Scraping Club Free Repository☆147Updated 2 months ago
- Lego AI Parser is an open-source application that uses OpenAI to parse visible text of HTML elements.☆235Updated last year