apify / crawlee-pythonLinks
Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with BeautifulSoup, Playwright, and raw HTTP. Both headful and headless mode. With proxy rotation.
☆5,683Updated this week
Alternatives and similar repositories for crawlee-python
Users that are interested in crawlee-python are comparing it to the libraries listed below
Sorting:
- Rapidly build AI apps in Python☆6,285Updated 2 weeks ago
- Python scraper based on AI☆19,817Updated this week
- Build Real-Time Knowledge Graphs for AI Agents☆9,878Updated this week
- Automate browser-based workflows with LLMs and Computer Vision☆13,452Updated this week
- Agent Framework / shim to use Pydantic with LLMs☆9,915Updated this week
- File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.☆6,449Updated 3 months ago
- 🔥 Open Source Browser API for AI Agents & Apps. Steel Browser is a batteries-included browser instance that lets you automate the web wi…☆4,491Updated this week
- Turn any webpage into structured data using LLMs☆4,895Updated last week
- An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.☆24,399Updated 3 weeks ago
- No-code LLM Platform to launch APIs and ETL Pipelines to structure unstructured documents☆5,284Updated this week
- OCR & Document Extraction using vision models☆11,232Updated last week
- Turns Data and AI algorithms into production-ready web applications in no time.☆18,101Updated this week
- Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/☆8,806Updated 3 weeks ago
- Open-source Next.js template for building apps that are fully generated by AI. By E2B.☆5,426Updated last week
- Uncomplicated Observability for Python and beyond! 🪵🔥☆3,153Updated this week
- Pydoll is a library for automating chromium-based browsers without a WebDriver, offering realistic interactions.☆3,577Updated this week
- A powerful framework for building realtime voice AI agents 🤖🎙️📹☆6,140Updated this week
- LLM-powered multiagent persona simulation for imagination enhancement and business insights.☆6,729Updated 2 months ago
- RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry☆4,084Updated 3 months ago
- Task-Aware Agent-driven Prompt Optimization Framework☆3,265Updated last week
- The first AI agent that builds permissionless integrations through reverse engineering platforms' internal APIs.☆4,351Updated 3 months ago
- NVIDIA Ingest is an early access set of microservices for parsing hundreds of thousands of complex, messy unstructured PDFs and other ent…☆2,675Updated this week
- AI app store powered by 24/7 desktop history. open source | 100% local | dev friendly | 24/7 screen, mic recording☆14,834Updated this week
- A language model programming library.☆5,766Updated 3 months ago
- The TypeScript framework for automating browsers with AI☆12,041Updated this week
- PraisonAI is a production-ready Multi AI Agents framework, designed to create AI Agents to automate and solve problems ranging from simpl…☆4,266Updated this week
- Agno is a lightweight, high-performance library for building Agents.☆27,218Updated this week
- A blazing fast AI Gateway with integrated guardrails. Route to 200+ LLMs, 50+ AI Guardrails with 1 fast & friendly API.☆7,936Updated this week
- Scira (Formerly MiniPerplx) is a minimalistic AI-powered search engine that helps you find information on the internet and cites it too. …☆8,165Updated this week
- Your agent in your terminal, equipped with local tools: writes code, uses the terminal, browses the web, vision.☆3,815Updated this week