amerkurev / scrapperLinks
Web scraper with a simple REST API living in Docker and using a Headless browser and Readability.js for parsing.
☆282Updated 6 months ago
Alternatives and similar repositories for scrapper
Users that are interested in scrapper are comparing it to the libraries listed below
Sorting:
- A fork of Dragnet that also extract author, headline, date, keywords from context, as well as built in metadata extraction all in one pac…☆295Updated 5 months ago
 - A low-code data extractor for websites with built in proxy and parsing capabilities. Great for testing and debugging css selectors☆189Updated last year
 - Get structured JSON data from any page.☆178Updated 2 years ago
 - Turn Webpage to LLM friendly input text. Similar to Firecrawl and Jina Reader API. Makes RAG, AI web scraping, image & webpage links extr…☆227Updated last week
 - Get [Google, Yandex, Baidu, Bing, DuckDuckGo] search results via API for free 🎉☆534Updated 3 weeks ago
 - Undetected web-scraping & seamless HTML parsing in Python!☆307Updated 3 months ago
 - Unflare helps you to bypass Cloudflare protection☆161Updated 3 months ago
 - The Web Scraping Club Free Repository☆151Updated 2 weeks ago
 - Lego AI Parser is an open-source application that uses OpenAI to parse visible text of HTML elements.☆234Updated last year
 - Spider ported to Python☆96Updated 9 months ago
 - TypeScript library for Google search scraping using http requests with proxy support, pagination, and regional customization. Built for w…☆56Updated 8 months ago
 - Clean, filter and sample URLs to optimize data collection – Python & command-line – Deduplication, spam, content and language filters☆149Updated 10 months ago
 - Mixpost Installation with Docker Containers☆12Updated 2 years ago
 - Use AWS Lambda functions as a proxy pool to scrape web pages.☆139Updated last year
 - Open source AI Agent evaluation framework for web tasks 🐒🍌☆311Updated 10 months ago
 - n8n node to interact with browserless instance☆194Updated last year
 - ☆380Updated 7 months ago
 - An open source ChatGPT UI, powered by Window.AI. - omnimodel.chat☆33Updated 2 years ago
 - AI web agent to find answers to any question☆35Updated 5 months ago
 - Automated web scraping spider generation using Browser Use and LLMs. Streamline the creation of Playwright-based spiders with minimal man…☆94Updated 2 months ago
 - ScrapeGPT is a RAG-based Telegram bot designed to scrape and analyze websites, then answer questions based on the scraped content. The bo…☆87Updated last year
 - 📰 Newspaper4k a fork of the beloved Newspaper3k. Extraction of articles, titles, and metadata from news websites.☆912Updated 7 months ago
 - Data Encoding and Representation Analysis☆40Updated last year
 - Toolkits to create a human-in-the-loop approval layer to monitor and guide AI agents workflow in real-time.☆184Updated 11 months ago
 - Minimalist web-searching platform with an AI assistant that runs directly from your browser. Uses WebLLM, Wllama and SearXNG. Demo: https…☆488Updated this week
 - The GPT-based Universal Web Scraper MVP is a solution that leverages GPT models and web scraping libraries to generate scraper code based…☆270Updated last year
 - A lightweight Amazon scraper library.☆73Updated 6 months ago
 - Common crawl extractor☆80Updated last year
 - A ChatGPT plugin to create Spotify playlists based on user description☆23Updated 2 years ago
 - The SEO Data Platform automates SEO analysis, aggregating data from Google Analytics 4, Search Console, Page Speed Insights, and rendered…☆30Updated last year