ScrapeGraphAI / Scrapegraph-aiLinks
Python scraper based on AI
β21,985Updated this week
Alternatives and similar repositories for Scrapegraph-ai
Users that are interested in Scrapegraph-ai are comparing it to the libraries listed below
Sorting:
- An open-source RAG-based tool for chatting with your documents.β24,745Updated 5 months ago
- ππ€ Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyNβ57,111Updated this week
- CrawleeβA web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Dowβ¦β7,269Updated last week
- Universal memory layer for AI Agentsβ44,227Updated this week
- Automate browser based workflows with AIβ19,707Updated this week
- π₯ The Web Data API for AI - Turn entire websites into LLM-ready markdown or structured dataβ69,537Updated this week
- Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/β9,490Updated 7 months ago
- Get your documents ready for gen AIβ46,599Updated this week
- Convert PDF to markdown + JSON quickly with high accuracyβ30,314Updated 3 weeks ago
- Build Real-Time Knowledge Graphs for AI Agentsβ21,046Updated last week
- An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.β27,697Updated 2 months ago
- Turn any webpage into structured data using LLMsβ6,128Updated last week
- Rapidly build AI apps in Pythonβ6,497Updated this week
- OCR, layout analysis, reading order, table recognition in 90+ languagesβ18,978Updated last month
- The first real AI developerβ33,701Updated last month
- The unified stack for multi-agent systems.β35,906Updated last week
- OpenUI let's you describe UI using your imagination, then see it rendered live.β21,859Updated 2 weeks ago
- File Parser optimised for LLM Ingestion with no loss π§ Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.β7,241Updated 9 months ago
- An LLM agent that conducts deep research (local and web) on any given topic and generates a long report with citations.β24,455Updated 2 weeks ago
- Toolkit for linearizing PDFs for LLM datasets/trainingβ16,203Updated this week
- Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing aβ¦β32,402Updated this week
- Retrieval Augmented Generation (RAG) chatbot powered by Weaviateβ7,470Updated 5 months ago
- Structured data extraction and instruction calling with ML, LLM and Vision LLMβ5,062Updated last week
- Large Action Model framework to develop AI Web Agentsβ6,215Updated 10 months ago
- Ingest, parse, and optimize any data format β‘οΈ from documents to multimedia β‘οΈ for enhanced compatibility with GenAI frameworksβ6,748Updated 6 months ago
- Turn any website into clean data pipelines & structured APIs in minutes!β14,052Updated this week
- Letta is the platform for building stateful agents: open AI with advanced memory that can learn and self-improve over time.β19,465Updated 2 weeks ago
- Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.β20,699Updated 9 months ago
- Crawl a site to generate knowledge files to create your own custom GPT from a URLβ22,054Updated 5 months ago
- Python tool for converting files and office documents to Markdown.β84,096Updated 2 weeks ago