mishushakov / llm-scraperLinks
Turn any webpage into structured data using LLMs
β6,131Updated last week
Alternatives and similar repositories for llm-scraper
Users that are interested in llm-scraper are comparing it to the libraries listed below
Sorting:
- π₯ Open Source Browser API for AI Agents & Apps. Steel Browser is a batteries-included browser sandbox that lets you automate the web witβ¦β6,038Updated last week
- File Parser optimised for LLM Ingestion with no loss π§ Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.β7,243Updated 9 months ago
- CrawleeβA web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Dowβ¦β7,290Updated this week
- Scira (Formerly MiniPerplx) is a minimalistic AI-powered search engine that helps you find information on the internet and cites it too. β¦β11,187Updated last month
- The SOTA Open-Source Browser Agent for autonomously performing complex tasks on the webβ2,327Updated 6 months ago
- Automate browser based workflows with AIβ19,707Updated last week
- Stay on top of trending topics on social media and the web with AIβ3,913Updated 10 months ago
- A fast tool to convert any website into LLM-ready markdown data. Built by https://supermemory.aiβ1,759Updated last year
- Large Action Model framework to develop AI Web Agentsβ6,215Updated 10 months ago
- Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/β9,490Updated 7 months ago
- An open source deep research clone. AI Agent that reasons large amounts of web data extracted with Firecrawlβ6,119Updated 7 months ago
- An AI-powered search engine with a generative UIβ8,408Updated 2 weeks ago
- Document to Markdown OCR library with Llama 3.2 visionβ2,413Updated 10 months ago
- Open Source Machine Learning Research Platform designed for frontier AI/ML workflows. Local, on-prem, or in the cloud. Open source.β4,593Updated this week
- π A better UX for chat, writing content, and coding with LLMs.β5,210Updated 4 months ago
- π AI search engine - self-host with local or cloud LLMsβ3,492Updated last year
- The fastest way to build robust AI agentsβ1,993Updated 5 months ago
- β3,509Updated last year
- Lightpanda: the headless browser designed for AI and automationβ11,104Updated this week
- The Enterprise-Grade Production-Ready Multi-Agent Orchestration Framework. Website: https://swarms.aiβ5,508Updated this week
- Your agent in your terminal, equipped with local tools: writes code, uses the terminal, browses the web, vision.β4,087Updated this week
- Open source Claude Artifacts β built with Llama 3.1 405Bβ6,791Updated last week
- Vision infrastructure to turn complex documents into RAG/LLM-ready dataβ2,918Updated 2 months ago
- A template for building web agents with Stagehand on Browserbaseβ1,875Updated 2 weeks ago
- Laminar - open-source all-in-one platform for engineering AI products. Create data flywheel for your AI app. Traces, Evals, Datasets, Labβ¦β2,477Updated this week
- LLM-powered multiagent persona simulation for imagination enhancement and business insights.β7,145Updated 3 months ago
- Local-first, open-source tools for automating everyday work.β4,281Updated this week
- β¨ AI-powered markdown editor - leverage LLMs with your documents - 100% local or in the cloudβ1,281Updated 3 weeks ago
- No-code LLM Platform to launch APIs and ETL Pipelines to structure unstructured documentsβ6,007Updated this week
- Keep searching, reading webpages, reasoning until it finds the answer (or exceeding the token budget)β5,022Updated last week