amerkurev / scrapperLinks
Web scraper with a simple REST API living in Docker and using a Headless browser and Readability.js for parsing.
☆274Updated 4 months ago
Alternatives and similar repositories for scrapper
Users that are interested in scrapper are comparing it to the libraries listed below
Sorting:
- A low-code data extractor for websites with built in proxy and parsing capabilities. Great for testing and debugging css selectors☆189Updated 11 months ago
- Get structured JSON data from any page.☆177Updated last year
- A fork of Dragnet that also extract author, headline, date, keywords from context, as well as built in metadata extraction all in one pac…☆292Updated 3 months ago
- Turn Webpage to LLM friendly input text. Similar to Firecrawl and Jina Reader API. Makes RAG, AI web scraping, image & webpage links extr…☆209Updated last week
- Unflare helps you to bypass Cloudflare protection☆153Updated last month
- AI web agent to find answers to any question☆33Updated 3 months ago
- Free IP Proxy rotator library for python☆268Updated last month
- Get Google, Yandex, Baidu search engine results via API or CLI for free 🎉☆485Updated last week
- Minimalist web-searching platform with an AI assistant that runs directly from your browser. Uses WebLLM, Wllama and SearXNG. Demo: https…☆473Updated this week
- ScrapeGPT is a RAG-based Telegram bot designed to scrape and analyze websites, then answer questions based on the scraped content. The bo…☆85Updated last year
- Self-hosted version of Microsoft's OmniParser Image-to-text model☆73Updated 3 months ago
- The Web Scraping Club Free Repository☆151Updated 3 months ago
- Undetected web-scraping & seamless HTML parsing in Python!☆284Updated last month
- Automated web scraping spider generation using Browser Use and LLMs. Streamline the creation of Playwright-based spiders with minimal man…☆82Updated this week
- Spider ported to Python☆89Updated 7 months ago
- Lego AI Parser is an open-source application that uses OpenAI to parse visible text of HTML elements.☆236Updated last year
- ☆372Updated 5 months ago
- OLLama IMage CAtegorizer☆68Updated 7 months ago
- A powerful starter template for building undetectable web scrapers and browser automation bots.☆56Updated 3 months ago
- This tool allows you scrape and re-write hundreds of articles in an original way using ChatGPT. Leverage the power of chatGPT in SEO, and…☆168Updated 2 years ago
- Super Fast, Super Anti-Detect, and Super Intuitive Web Driver☆80Updated 2 months ago
- Use AWS Lambda functions as a proxy pool to scrape web pages.☆137Updated last year
- A blazing fast, async-first, undetectable webscraping/web automation framework based on ultrafunkamsterdam/nodriver. Now with Docker supp…☆688Updated this week
- TypeScript library for Google search scraping using http requests with proxy support, pagination, and regional customization. Built for w…☆53Updated 6 months ago
- AI Coding assistant for large and complex codebases.☆153Updated 6 months ago
- Toolkits to create a human-in-the-loop approval layer to monitor and guide AI agents workflow in real-time.☆179Updated 9 months ago
- EmailGenius: AI-Driven Email Categorization☆27Updated last year
- Some words that LLM regularly uses☆85Updated last year
- Clean, filter and sample URLs to optimize data collection – Python & command-line – Deduplication, spam, content and language filters☆145Updated 8 months ago
- A python package for finding e-mails, checking deliverability and more.☆72Updated last year