A Smart, Automatic, Fast and Lightweight Web Scraper for Python
β7,178Jun 9, 2025Updated last year
Alternatives and similar repositories for autoscraper
Users that are interested in autoscraper are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- π€ Scrape data from HTML websites automatically by just providing examplesβ1,385Mar 17, 2024Updated 2 years ago
- List of libraries, tools and APIs for web scraping and data processing.β7,919May 28, 2026Updated last week
- An advanced Twitter scraping & OSINT tool written in Python that doesn't use Twitter's API, allowing you to scrape a user's followers, foβ¦β16,375Feb 23, 2023Updated 3 years ago
- Prefect is a workflow orchestration framework for building resilient data pipelines in Python.β22,560Updated this week
- β‘ Automatically decrypt encryptions without knowing the key or cipher, decode encodings, and crack hashes β‘β21,437Updated this week
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Python scraper based on AIβ26,731Jun 2, 2026Updated last week
- Rich is a Python library for rich text and beautiful formatting in the terminal.β56,577Apr 12, 2026Updated last month
- Text preprocessing, representation and visualization from zero to hero.β2,912Aug 29, 2023Updated 2 years ago
- β398Jan 31, 2026Updated 4 months ago
- CrawleeβA web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data β¦β23,640Jun 2, 2026Updated last week
- Scrapy, a fast high-level web crawling & scraping framework for Python.β62,120Updated this week
- Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and β¦β29,547Dec 5, 2025Updated 6 months ago
- newspaper3k is a news, full-text, and article metadata extraction in Python 3. Advanced docs:β15,069May 13, 2026Updated 3 weeks ago
- Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XMβ¦β6,037Jun 3, 2026Updated last week
- Managed Kubernetes at scale on DigitalOcean β’ AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Streamlit β A faster way to build and share data apps.β44,829Updated this week
- Open-source AI orchestration framework for building context-engineered, production-ready LLM applications. Design modular pipelines and aβ¦β25,502Updated this week
- π‘ All-in-one AI framework for semantic search, LLM orchestration and language model workflowsβ12,642Updated this week
- Python version of the Playwright testing and automation library.β14,700May 18, 2026Updated 3 weeks ago
- Decentralized deep learning in PyTorch. Built to train models on thousands of volunteers across the world.β2,466Jan 11, 2026Updated 4 months ago
- βοΈ Build multimodal AI applications with cloud-native stackβ21,862Mar 24, 2025Updated last year
- Easily and securely send things from one computer to anotherβ35,215Updated this week
- A collection of awesome web crawler,spider in different languagesβ7,223Jun 16, 2024Updated last year
- πͺ Turns your machine learning code into microservices with web API, interactive GUI, and more.β3,136Updated this week
- Managed Database hosting by DigitalOcean β’ AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Turn (almost) any Python command line program into a full GUI application with one lineβ21,892Mar 23, 2026Updated 2 months ago
- Clone a voice in 5 seconds to generate arbitrary speech in real-timeβ59,868Mar 9, 2026Updated 3 months ago
- Diagram as Code for prototyping cloud system architecturesβ42,314Updated this week
- Automatically visualize your pandas dataframe via a single print! π π‘β5,379Mar 20, 2024Updated 2 years ago
- Visual scraping for Scrapyβ9,505Jun 26, 2024Updated last year
- Create agents that monitor and act on your behalf. Your agents are standing by!β49,404Updated this week
- Lighter web automation with Pythonβ8,297Jun 3, 2026Updated last week
- FastAPI framework, high performance, easy to learn, fast to code, ready for productionβ98,868Updated this week
- Ultimate Python study guide π π πβ5,855May 30, 2026Updated last week
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- π« Industrial-strength Natural Language Processing (NLP) in Pythonβ33,634May 19, 2026Updated 3 weeks ago
- Realtime Web Apps and Dashboards for Python and Rβ4,235Jun 3, 2026Updated last week
- Hunt down social media accounts by username across social networksβ84,749Updated this week
- ππ€ Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyNβ67,725Updated this week
- Pythonic HTML Parsing for Humansβ’β13,828Apr 16, 2024Updated 2 years ago
- An autonomous agent that conducts deep research on any data using any LLM providersβ27,545May 28, 2026Updated last week
- Find big moving stocks before they move using machine learning and anomaly detectionβ1,867Aug 13, 2021Updated 4 years ago