amerkurev / scrapper
Web scraper with a simple REST API living in Docker and using a Headless browser and Readability.js for parsing.
☆248Updated 2 weeks ago
Alternatives and similar repositories for scrapper:
Users that are interested in scrapper are comparing it to the libraries listed below
- Turn Webpage to LLM friendly input text. Similar to Firecrawl and Jina Reader API. Makes RAG, AI web scraping, image & webpage links extr…☆163Updated last week
- Clean, filter and sample URLs to optimize data collection – Python & command-line – Deduplication, spam, content and language filters☆137Updated 3 months ago
- ☆116Updated 6 months ago
- Unflare helps you to bypass Cloudflare protection☆97Updated last week
- An open-source AI assistant for your email records.☆91Updated last year
- Spider ported to Python☆77Updated 2 months ago
- Common crawl extractor☆75Updated 11 months ago
- ScrapeGPT is a RAG-based Telegram bot designed to scrape and analyze websites, then answer questions based on the scraped content. The bo…☆83Updated last year
- AI web agent to find answers to any question☆32Updated 2 months ago
- 🔊 Kokoro Web: Free AI text-to-speech, online or self-hosted, OpenAI compatible!☆261Updated last month
- Get structured JSON data from any page.☆176Updated last year
- Automated web scraping spider generation using Browser Use and LLMs. Streamline the creation of Playwright-based spiders with minimal man…☆58Updated last week
- EmailGenius: AI-Driven Email Categorization☆26Updated last year
- A fork of Dragnet that also extract author, headline, date, keywords from context, as well as built in metadata extraction all in one pac…☆275Updated last year
- OLLama IMage CAtegorizer☆66Updated 3 months ago
- A curated list of AI copilots and assistants that enhance productivity across various domains, with a focus on coding and software develo…☆182Updated this week
- ScriptGPT turns your ideas into JS/TS functional code with the power of GPT4☆22Updated last year
- Open source AI Agent evaluation framework for web tasks 🐒🍌☆293Updated 3 months ago
- This tool allows you scrape and re-write hundreds of articles in an original way using ChatGPT. Leverage the power of chatGPT in SEO, and…☆170Updated last year
- n8n node to interact with browserless instance☆148Updated 6 months ago
- AutoBrowse is an autonomous AI agent that can perform web browsing tasks.☆85Updated last year
- ☆26Updated 6 months ago
- An open source ChatGPT UI, powered by Window.AI. - omnimodel.chat☆32Updated last year
- ChatGPT powered SEO content creator plugin for WordPress using the GPT-3, GPT-3.5 and GPT-4 models☆44Updated last year
- Staff fetcher library for LinkedIn - obtain experiences, schools, skills & contact info☆137Updated 3 weeks ago
- ☆147Updated last year
- react + next.js dashboard for R2R: The most advanced AI retrieval system. Containerized, Retrieval-Augmented Generation (RAG) with a REST…☆147Updated last week
- TypeScript library for Google search scraping using http requests with proxy support, pagination, and regional customization. Built for w…☆32Updated 2 months ago
- Web service for web page to Markdown conversion☆197Updated 2 months ago
- n8n node for browser automation using Puppeteer☆226Updated this week