alirezamika/autoscraper

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/alirezamika/autoscraper)

alirezamika / autoscraper

A Smart, Automatic, Fast and Lightweight Web Scraper for Python

☆7,653

Alternatives and similar repositories for autoscraper

Users that are interested in autoscraper are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

lorien / awesome-web-scraping
View on GitHub
List of libraries, tools and APIs for web scraping and data processing.
☆7,983Jul 12, 2026Updated last week
twintproject / twint
View on GitHub
An advanced Twitter scraping & OSINT tool written in Python that doesn't use Twitter's API, allowing you to scrape a user's followers, fo…
☆16,393Feb 23, 2023Updated 3 years ago
LetsUpgrade / Python-Essentials
View on GitHub
☆398Jan 31, 2026Updated 5 months ago
PrefectHQ / prefect
View on GitHub
Prefect is a workflow orchestration framework for building resilient data pipelines in Python.
☆23,431Updated this week
bee-san / Ciphey
View on GitHub
⚡ Automatically decrypt encryptions without knowing the key or cipher, decode encodings, and crack hashes ⚡
☆21,531Updated this week
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
Textualize / rich
View on GitHub
Rich is a Python library for rich text and beautiful formatting in the terminal.
☆56,891Jun 23, 2026Updated 3 weeks ago
jbesomi / texthero
View on GitHub
Text preprocessing, representation and visualization from zero to hero.
☆2,910Aug 29, 2023Updated 2 years ago
ScrapeGraphAI / Scrapegraph-ai
View on GitHub
Python scraper based on AI
☆28,478Updated this week
apify / crawlee
View on GitHub
Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data …
☆24,805Updated this week
scrapy / scrapy
View on GitHub
Scrapy, a fast high-level web crawling & scraping framework for Python.
☆63,234Jul 13, 2026Updated last week
JaidedAI / EasyOCR
View on GitHub
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and …
☆29,786Dec 5, 2025Updated 7 months ago
codelucas / newspaper
View on GitHub
newspaper3k is a news, full-text, and article metadata extraction in Python 3. Advanced docs:
☆15,109Jul 8, 2026Updated last week
streamlit / streamlit
View on GitHub
Streamlit — A faster way to build and share data apps.
☆45,277Updated this week
adbar / trafilatura
View on GitHub
Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XM…
☆6,315Updated this week
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
deepset-ai / haystack
View on GitHub
Open-source AI orchestration framework for building context-engineered, production-ready LLM applications. Design modular pipelines and a…
☆25,943Updated this week
neuml / txtai
View on GitHub
💡 All-in-one AI framework for semantic search, LLM orchestration and language model workflows
☆12,733Updated this week
microsoft / playwright-python
View on GitHub
Python version of the Playwright testing and automation library.
☆14,834Updated this week
learning-at-home / hivemind
View on GitHub
Decentralized deep learning in PyTorch. Built to train models on thousands of volunteers across the world.
☆2,495Jan 11, 2026Updated 6 months ago
jina-ai / serve
View on GitHub
☁️ Build multimodal AI applications with cloud-native stack
☆21,859Mar 24, 2025Updated last year
schollz / croc
View on GitHub
Easily and securely send things from one computer to another
☆35,614Updated this week
BruceDone / awesome-crawler
View on GitHub
A collection of awesome web crawler,spider in different languages
☆7,256Jun 16, 2024Updated 2 years ago
chriskiehl / Gooey
View on GitHub
Turn (almost) any Python command line program into a full GUI application with one line
☆21,906Mar 23, 2026Updated 3 months ago
ml-tooling / opyrator
View on GitHub
🪄 Turns your machine learning code into microservices with web API, interactive GUI, and more.
☆3,134Updated this week
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
CorentinJ / Real-Time-Voice-Cloning
View on GitHub
Clone a voice in 5 seconds to generate arbitrary speech in real-time
☆60,047Mar 9, 2026Updated 4 months ago
lux-org / lux
View on GitHub
Automatically visualize your pandas dataframe via a single print! 📊 💡
☆5,379Mar 20, 2024Updated 2 years ago
mingrammer / diagrams
View on GitHub
Diagram as Code for prototyping cloud system architectures
☆42,448Updated this week
scrapinghub / portia
View on GitHub
Visual scraping for Scrapy
☆9,508Jun 26, 2024Updated 2 years ago
fastapi / fastapi
View on GitHub
FastAPI framework, high performance, easy to learn, fast to code, ready for production
☆100,665Updated this week
huginn / huginn
View on GitHub
Create agents that monitor and act on your behalf. Your agents are standing by!
☆49,644Updated this week
mherrmann / helium
View on GitHub
Lighter web automation with Python
☆8,313Jul 7, 2026Updated last week
huangsam / ultimate-python
View on GitHub
Ultimate Python study guide 🐍 🐍 🐍
☆5,899Updated this week
explosion / spaCy
View on GitHub
💫 Industrial-strength Natural Language Processing (NLP) in Python
☆33,756May 19, 2026Updated 2 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
h2oai / wave
View on GitHub
Realtime Web Apps and Dashboards for Python and R
☆4,247Updated this week
sherlock-project / sherlock
View on GitHub
Hunt down social media accounts by username across social networks
☆86,795Updated this week
psf / requests-html
View on GitHub
Pythonic HTML Parsing for Humans™
☆13,827Apr 16, 2024Updated 2 years ago
assafelovic / gpt-researcher
View on GitHub
An autonomous agent that conducts deep research on any data using any LLM providers
☆28,446Updated this week
tradytics / surpriver
View on GitHub
Find big moving stocks before they move using machine learning and anomaly detection
☆1,865Aug 13, 2021Updated 4 years ago
unclecode / crawl4ai
View on GitHub
🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN
☆73,292Updated this week
mxrch / GHunt
View on GitHub
🕵️‍♂️ Offensive Google framework.
☆19,234Apr 10, 2026Updated 3 months ago