A Smart, Automatic, Fast and Lightweight Web Scraper for Python
β7,157Jun 9, 2025Updated 11 months ago
Alternatives and similar repositories for autoscraper
Users that are interested in autoscraper are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- π€ Scrape data from HTML websites automatically by just providing examplesβ1,389Mar 17, 2024Updated 2 years ago
- List of libraries, tools and APIs for web scraping and data processing.β7,896Apr 17, 2026Updated 3 weeks ago
- An advanced Twitter scraping & OSINT tool written in Python that doesn't use Twitter's API, allowing you to scrape a user's followers, foβ¦β16,368Feb 23, 2023Updated 3 years ago
- Prefect is a workflow orchestration framework for building resilient data pipelines in Python.β22,329Updated this week
- β‘ Automatically decrypt encryptions without knowing the key or cipher, decode encodings, and crack hashes β‘β21,371May 4, 2026Updated last week
- Managed Kubernetes at scale on DigitalOcean β’ AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Python scraper based on AIβ23,444May 4, 2026Updated last week
- Rich is a Python library for rich text and beautiful formatting in the terminal.β56,294Apr 12, 2026Updated 3 weeks ago
- Text preprocessing, representation and visualization from zero to hero.β2,911Aug 29, 2023Updated 2 years ago
- β398Jan 31, 2026Updated 3 months ago
- CrawleeβA web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data β¦β23,051Apr 30, 2026Updated last week
- Scrapy, a fast high-level web crawling & scraping framework for Python.β61,573Updated this week
- Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and β¦β29,396Dec 5, 2025Updated 5 months ago
- newspaper3k is a news, full-text, and article metadata extraction in Python 3. Advanced docs:β15,036Apr 16, 2026Updated 3 weeks ago
- Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XMβ¦β5,866Sep 12, 2025Updated 7 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Streamlit β A faster way to build and share data apps.β44,447Updated this week
- Open-source AI orchestration framework for building context-engineered, production-ready LLM applications. Design modular pipelines and aβ¦β25,140Updated this week
- π‘ All-in-one AI framework for semantic search, LLM orchestration and language model workflowsβ12,478Updated this week
- Python version of the Playwright testing and automation library.β14,576Apr 30, 2026Updated last week
- Decentralized deep learning in PyTorch. Built to train models on thousands of volunteers across the world.β2,428Jan 11, 2026Updated 4 months ago
- βοΈ Build multimodal AI applications with cloud-native stackβ21,872Mar 24, 2025Updated last year
- Easily and securely send things from one computer to anotherβ34,913Apr 26, 2026Updated 2 weeks ago
- A collection of awesome web crawler,spider in different languagesβ7,191Jun 16, 2024Updated last year
- πͺ Turns your machine learning code into microservices with web API, interactive GUI, and more.β3,137Updated this week
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Turn (almost) any Python command line program into a full GUI application with one lineβ21,904Mar 23, 2026Updated last month
- Clone a voice in 5 seconds to generate arbitrary speech in real-timeβ59,711Mar 9, 2026Updated 2 months ago
- Diagram as Code for prototyping cloud system architecturesβ42,239Apr 13, 2026Updated 3 weeks ago
- Automatically visualize your pandas dataframe via a single print! π π‘β5,383Mar 20, 2024Updated 2 years ago
- Visual scraping for Scrapyβ9,492Jun 26, 2024Updated last year
- Create agents that monitor and act on your behalf. Your agents are standing by!β49,236May 2, 2026Updated last week
- Lighter web automation with Pythonβ8,286Apr 22, 2026Updated 2 weeks ago
- FastAPI framework, high performance, easy to learn, fast to code, ready for productionβ97,863May 3, 2026Updated last week
- Ultimate Python study guide π π πβ5,845Apr 30, 2026Updated last week
- Open source password manager - Proton Pass β’ AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- π« Industrial-strength Natural Language Processing (NLP) in Pythonβ33,544Mar 28, 2026Updated last month
- Realtime Web Apps and Dashboards for Python and Rβ4,234Apr 28, 2026Updated last week
- Hunt down social media accounts by username across social networksβ83,157Updated this week
- ππ€ Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyNβ64,964Apr 30, 2026Updated last week
- Pythonic HTML Parsing for Humansβ’β13,850Apr 16, 2024Updated 2 years ago
- An autonomous agent that conducts deep research on any data using any LLM providersβ26,934Apr 16, 2026Updated 3 weeks ago
- Find big moving stocks before they move using machine learning and anomaly detectionβ1,865Aug 13, 2021Updated 4 years ago