D4Vinci / ScraplingLinks
🕷️ An undetectable, powerful, flexible, high-performance Python library to make Web Scraping Easy and Effortless as it should be!
☆6,480Updated this week
Alternatives and similar repositories for Scrapling
Users that are interested in Scrapling are comparing it to the libraries listed below
Sorting:
- 🔥 Open Source Browser API for AI Agents & Apps. Steel Browser is a batteries-included browser instance that lets you automate the web wi…☆4,988Updated this week
- Swiss-army tool for scraping and extracting data from online assets, made for hackers☆3,495Updated 10 months ago
- Document (PDF, Word, PPTX ...) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents …☆2,815Updated last month
- AnyCrawl 🚀: A Node.js/TypeScript crawler that turns websites into LLM-ready data and extracts structured SERP results from Google/Bing/B…☆2,102Updated this week
- 🦊 Anti-detect browser☆3,070Updated 5 months ago
- Turn any webpage into structured data using LLMs☆5,975Updated 3 months ago
- Document to Markdown OCR library with Llama 3.2 vision☆2,381Updated 7 months ago
- A free + OSS logo generator powered by Flux on Together AI☆5,940Updated 7 months ago
- Fetch an entire site and save it as a text file (to be used with AI models).☆1,623Updated 7 months ago
- Replace 'hub' with 'ingest' in any GitHub URL to get a prompt-friendly extract of a codebase☆11,963Updated last week
- Lightweight coding agent that runs in your terminal☆1,950Updated 3 months ago
- Video-based AI memory library. Store millions of text chunks in MP4 files with lightning-fast semantic search. No database needed.☆8,304Updated 2 months ago
- Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Dow…☆6,228Updated this week
- Vision infrastructure to turn complex documents into RAG/LLM-ready data☆2,817Updated 2 weeks ago
- Lightweight library for scraping web-sites with LLMs☆1,212Updated last week
- Open-source, vision-first browser agent☆3,563Updated 3 weeks ago
- A community driven list of open source alternatives to proprietary software and applications.☆5,042Updated 2 months ago
- LinkedIn -> personal site generator☆2,308Updated 3 months ago
- Document intelligence framework for Python - Extract text, metadata, and structured data from PDFs, images, Office documents, and more.…☆2,314Updated last week
- LLM-powered multiagent persona simulation for imagination enhancement and business insights.☆7,031Updated last week
- Lightpanda: the headless browser designed for AI and automation☆9,625Updated this week
- The AI Browser Automation Framework☆16,707Updated this week
- Your agent in your terminal, equipped with local tools: writes code, uses the terminal, browses the web, vision.☆3,981Updated this week
- Fast Real-time Object Detection with High-Res Output https://x.com/_akhaliq/status/1840213012818329826 https://x.com/githubprojects/statu…☆570Updated 5 months ago
- An Open Source implementation of Notebook LM with more flexibility and features☆3,930Updated last month
- 🚀 Curated list of open-source, self-hosted projects deployable with Docker and docker-compose. Your go-to resource for amazing self-host…☆3,429Updated 3 months ago
- The official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.☆5,719Updated this week
- Open Source Application for Advanced LLM + Diffusion Engineering: interact, train, fine-tune, and evaluate large language models on your …☆4,164Updated this week
- ☆1,994Updated 5 months ago
- Create architecture diagrams from code automatically using large language models (LLMs).☆1,108Updated 5 months ago