D4Vinci / ScraplingLinks
π·οΈ An undetectable, powerful, flexible, high-performance Python library to make Web Scraping Easy and Effortless as it should be!
β8,084Updated 2 weeks ago
Alternatives and similar repositories for Scrapling
Users that are interested in Scrapling are comparing it to the libraries listed below
Sorting:
- Swiss-army tool for scraping and extracting data from online assets, made for hackersβ3,998Updated last year
- π₯ Open Source Browser API for AI Agents & Apps. Steel Browser is a batteries-included browser sandbox that lets you automate the web witβ¦β5,945Updated last week
- Fetch an entire site and save it as a text file (to be used with AI models).β1,636Updated 9 months ago
- CrawleeβA web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Dowβ¦β7,113Updated last week
- The AI Browser Automation Frameworkβ18,994Updated this week
- Stay on top of trending topics on social media and the web with AIβ3,886Updated 8 months ago
- Document (PDF, Word, PPTX ...) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documentsβ¦β2,926Updated last month
- Lightpanda: the headless browser designed for AI and automationβ10,285Updated this week
- Video-based AI memory library. Store millions of text chunks in MP4 files with lightning-fast semantic search. No database needed.β10,346Updated 3 weeks ago
- Pydoll is a library for automating chromium-based browsers without a WebDriver, offering realistic interactions.β5,958Updated last week
- Turn any webpage into structured data using LLMsβ6,083Updated 3 weeks ago
- π Curated list of open-source, self-hosted projects deployable with Docker and docker-compose. Your go-to resource for amazing self-hostβ¦β3,537Updated 5 months ago
- Fast Real-time Object Detection with High-Res Output https://x.com/_akhaliq/status/1840213012818329826 https://x.com/githubprojects/statuβ¦β578Updated 7 months ago
- AnyCrawl π: A Node.js/TypeScript crawler that turns websites into LLM-ready data and extracts structured SERP results from Google/Bing/Bβ¦β2,372Updated 3 weeks ago
- Lightweight coding agent that runs in your terminalβ2,138Updated 6 months ago
- A community driven list of open source alternatives to proprietary software and applications.β5,113Updated 4 months ago
- Self-hosted, multi-user API that drops bots into Google Meet for real-time transcripts.β1,513Updated last week
- BillionMail gives you open-source MailServer, NewsLetter, Email Marketing β fully self-hosted, dev-friendly, and free from monthly fees.β¦β12,097Updated this week
- Open Source Application for Advanced LLM + Diffusion Engineering: interact, train, fine-tune, and evaluate large language models on your β¦β4,479Updated this week
- File Parser optimised for LLM Ingestion with no loss π§ Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.β7,216Updated 8 months ago
- OCR model that handles complex tables, forms, handwriting with full layout.β1,864Updated this week
- LinkedIn -> personal site generatorβ2,415Updated 5 months ago
- β2,061Updated 7 months ago
- An invoice generator app built using Next.js, Typescript, and Shadcnβ6,027Updated last week
- The All in One Framework to Build Undefeatable Scrapersβ3,178Updated 2 weeks ago
- Python GUI builder. GUI builder for Tkinter, CustomTkinter, Kivy and PySide (upcoming)β1,873Updated last month
- ContextGem: Effortless LLM extraction from documentsβ1,703Updated last month
- Open-source platform to build and deploy AI agent workflows.β17,923Updated this week
- Convert a Docker image to an executableβ1,709Updated 6 months ago
- Open-source, vision-first browser agentβ3,749Updated 3 weeks ago