mishushakov / llm-scraper
Turn any webpage into structured data using LLMs
β4,628Updated 6 months ago
Alternatives and similar repositories for llm-scraper:
Users that are interested in llm-scraper are comparing it to the libraries listed below
- π₯ Open Source Browser API for AI Agents & Apps. Steel Browser is a batteries-included browser instance that lets you automate the web wiβ¦β4,003Updated last week
- Automate browser-based workflows with LLMs and Computer Visionβ12,710Updated this week
- File Parser optimised for LLM Ingestion with no loss π§ Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.β5,884Updated last month
- An AI web browsing framework focused on simplicity and extensibility.β9,062Updated this week
- Scira (Formerly MiniPerplx) is a minimalistic AI-powered search engine that helps you find information on the internet. Powered by Vercelβ¦β7,392Updated last week
- Open source Claude Artifacts β built with Llama 3.1 405Bβ5,715Updated 2 months ago
- CrawleeβA web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Dowβ¦β5,430Updated this week
- A visual playground for agentic workflows: Iterate over your agents 10x fasterβ3,736Updated this week
- Python SDK for AI agent monitoring, LLM cost tracking, benchmarking, and more. Integrates with most LLMs and agent frameworks including Oβ¦β4,083Updated this week
- Build a Perplexity-Inspired Answer Engine Using Next.js, Groq, Llama-3, Langchain, OpenAI, Upstash, Brave & Serperβ4,870Updated 5 months ago
- RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundryβ3,961Updated last month
- Large Action Model framework to develop AI Web Agentsβ5,965Updated 2 months ago
- pingcap/autoflow is a Graph RAG based and conversational knowledge base tool built with TiDB Serverless Vector Storage. Demo: https://tidβ¦β2,465Updated this week
- The open source Cursor for Designers. Design directly in your live React app and publish your changes to code.β8,882Updated this week
- Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/β8,312Updated last week
- No-code LLM Platform to launch APIs and ETL Pipelines to structure unstructured documentsβ4,938Updated this week
- Task-Aware Agent-driven Prompt Optimization Frameworkβ3,002Updated this week
- π₯ Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.β31,990Updated this week
- Fully local web research and report writing assistantβ6,214Updated this week
- The easiest way to use Agentic RAG in any enterpriseβ4,157Updated 2 months ago
- Lightpanda: the headless browser designed for AI and automationβ7,508Updated this week
- Open-source Next.js template for building apps that are fully generated by AI. By E2B.β4,679Updated last week
- Document to Markdown OCR library with Llama 3.2 visionβ2,224Updated 2 months ago
- Automatable GenAI Scriptingβ2,372Updated this week
- β3,424Updated 4 months ago
- π§ Open source LLM observability platform. One line of code to monitor, evaluate, and experiment. YC W23 πβ3,431Updated this week
- Flexible and powerful framework for managing multiple AI agents and handling complex conversationsβ4,425Updated last week
- π AI search engine - self-host with local or cloud LLMsβ3,233Updated 5 months ago
- Official Firecrawl MCP Server - Adds powerful web scraping to Cursor, Claude and any other LLM clients.β1,650Updated this week
- A fast multimodal LLM for real-time voiceβ3,757Updated last month