mishushakov / llm-scraper
Turn any webpage into structured data using LLMs
β4,739Updated 7 months ago
Alternatives and similar repositories for llm-scraper:
Users that are interested in llm-scraper are comparing it to the libraries listed below
- An AI web browsing framework focused on simplicity and extensibility.β10,288Updated this week
- π₯ Open Source Browser API for AI Agents & Apps. Steel Browser is a batteries-included browser instance that lets you automate the web wiβ¦β4,149Updated this week
- Scira (Formerly MiniPerplx) is a minimalistic AI-powered search engine that helps you find information on the internet. Powered by Vercelβ¦β7,621Updated this week
- File Parser optimised for LLM Ingestion with no loss π§ Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.β5,958Updated last month
- Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/β8,548Updated this week
- Lightpanda: the headless browser designed for AI and automationβ8,267Updated this week
- No-code LLM Platform to launch APIs and ETL Pipelines to structure unstructured documentsβ5,050Updated this week
- Automate browser-based workflows with LLMs and Computer Visionβ13,016Updated this week
- Your agent in your terminal, equipped with local tools: writes code, uses the terminal, browses the web, vision.β3,724Updated this week
- Flexible and powerful framework for managing multiple AI agents and handling complex conversationsβ4,701Updated last week
- π₯ Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.β35,829Updated this week
- The first AI agent that builds permissionless integrations through reverse engineering platforms' internal APIs.β4,269Updated 2 months ago
- Let AI be your browser operator.β7,917Updated this week
- A fast multimodal LLM for real-time voiceβ3,844Updated 2 months ago
- Large Action Model framework to develop AI Web Agentsβ6,014Updated 2 months ago
- An Open Source Python alternative to NotebookLM's podcast feature: Transforming Multimodal Content into Captivating Multilingual Audio Coβ¦β3,564Updated last month
- A powerful framework for building realtime voice AI agents π€ποΈπΉβ5,617Updated this week
- CrawleeβA web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Dowβ¦β5,519Updated this week
- Build Real-Time Knowledge Graphs for AI Agentsβ3,961Updated this week
- π A better UX for chat, writing content, and coding with LLMs.β4,387Updated this week
- An open source deep research clone. AI Agent that reasons large amounts of web data extracted with Firecrawlβ5,358Updated last month
- SoTA production-ready AI retrieval system. Agentic Retrieval-Augmented Generation (RAG) with a RESTful API.β6,415Updated this week
- Open-source Next.js template for building apps that are fully generated by AI. By E2B.β5,229Updated last week
- The easiest way to use Agentic RAG in any enterpriseβ4,189Updated 2 months ago
- RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundryβ4,007Updated last month
- β3,467Updated 5 months ago
- π§ Open source LLM observability platform. One line of code to monitor, evaluate, and experiment. YC W23 πβ3,589Updated last week
- Build a Perplexity-Inspired Answer Engine Using Next.js, Groq, Llama-3, Langchain, OpenAI, Upstash, Brave & Serperβ4,881Updated 6 months ago
- A template for building web agents with Stagehand on Browserbaseβ1,534Updated last month
- Open source Claude Artifacts β built with Llama 3.1 405Bβ5,903Updated last week