mishushakov / llm-scraperLinks
Turn any webpage into structured data using LLMs
β5,923Updated 2 months ago
Alternatives and similar repositories for llm-scraper
Users that are interested in llm-scraper are comparing it to the libraries listed below
Sorting:
- π₯ Open Source Browser API for AI Agents & Apps. Steel Browser is a batteries-included browser instance that lets you automate the web wiβ¦β4,894Updated this week
- File Parser optimised for LLM Ingestion with no loss π§ Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.β7,064Updated 5 months ago
- The AI Browser Automation Frameworkβ15,797Updated this week
- Python scraper based on AIβ20,954Updated last month
- A fast tool to convert any website into LLM-ready markdown data. Built by https://supermemory.aiβ1,613Updated last year
- No-code LLM Platform to launch APIs and ETL Pipelines to structure unstructured documentsβ5,606Updated this week
- Vision infrastructure to turn complex documents into RAG/LLM-ready dataβ2,711Updated last week
- Sim is an open-source AI agent workflow builder. Sim Studio's interface is a lightweight, intuitive way to quickly build and deploy LLMs β¦β6,302Updated this week
- Lightpanda: the headless browser designed for AI and automationβ9,466Updated this week
- CrawleeβA web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Dowβ¦β6,129Updated this week
- Large Action Model framework to develop AI Web Agentsβ6,106Updated 6 months ago
- Automate browser-based workflows with LLMs and Computer Visionβ14,015Updated this week
- An AI-powered search engine with a generative UIβ7,885Updated this week
- Scira (Formerly MiniPerplx) is a minimalistic AI-powered search engine that helps you find information on the internet and cites it too. β¦β10,259Updated this week
- The SOTA Open-Source Browser Agent for autonomously performing complex tasks on the webβ2,315Updated last month
- Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/β9,030Updated 3 months ago
- Lightweight library for scraping web-sites with LLMsβ1,175Updated 2 months ago
- Document to Markdown OCR library with Llama 3.2 visionβ2,374Updated 6 months ago
- LLocalSearch is a completely locally running search aggregator using LLM Agents. The user can ask a question and the system will use a chβ¦β5,947Updated 3 months ago
- A template for building web agents with Stagehand on Browserbaseβ1,771Updated 2 months ago
- Latitude is the open-source prompt engineering platform to build, evaluate, and refine your prompts with AIβ3,205Updated this week
- Build a Perplexity-Inspired Answer Engine Using Next.js, Groq, Llama-3, Langchain, OpenAI, Upstash, Brave & Serperβ4,911Updated last month
- SoTA production-ready AI retrieval system. Agentic Retrieval-Augmented Generation (RAG) with a RESTful API.β7,140Updated last month
- PraisonAI is a production-ready Multi AI Agents framework, designed to create AI Agents to automate and solve problems ranging from simplβ¦β5,262Updated this week
- c/ua is the Docker Container for Computer-Use AI Agents.β9,112Updated last week
- The first AI agent that builds permissionless integrations through reverse engineering platforms' internal APIs.β4,426Updated 5 months ago
- A visual playground for agentic workflows: Iterate over your agents 10x fasterβ5,327Updated 2 weeks ago
- The easiest way to use Agentic RAG in any enterpriseβ4,297Updated 6 months ago
- A framework for Claude Opus to intelligently orchestrate subagents.β4,269Updated last year
- Your agent in your terminal, equipped with local tools: writes code, uses the terminal, browses the web, vision.β3,943Updated this week