A Smart, Automatic, Fast and Lightweight Web Scraper for Python
β7,297Jun 9, 2025Updated last year
Alternatives and similar repositories for autoscraper
Users that are interested in autoscraper are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- π€ Scrape data from HTML websites automatically by just providing examplesβ1,385Mar 17, 2024Updated 2 years ago
- List of libraries, tools and APIs for web scraping and data processing.β7,948May 28, 2026Updated last month
- An advanced Twitter scraping & OSINT tool written in Python that doesn't use Twitter's API, allowing you to scrape a user's followers, foβ¦β16,387Feb 23, 2023Updated 3 years ago
- Prefect is a workflow orchestration framework for building resilient data pipelines in Python.β22,701Updated this week
- β‘ Automatically decrypt encryptions without knowing the key or cipher, decode encodings, and crack hashes β‘β21,461Updated this week
- Managed Database hosting by DigitalOcean β’ AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Python scraper based on AIβ27,473Jun 23, 2026Updated last week
- Rich is a Python library for rich text and beautiful formatting in the terminal.β56,713Jun 23, 2026Updated last week
- Text preprocessing, representation and visualization from zero to hero.β2,912Aug 29, 2023Updated 2 years ago
- CrawleeβA web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data β¦β24,227Updated this week
- Scrapy, a fast high-level web crawling & scraping framework for Python.β62,529Updated this week
- Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and β¦β29,685Dec 5, 2025Updated 6 months ago
- newspaper3k is a news, full-text, and article metadata extraction in Python 3. Advanced docs:β15,081May 13, 2026Updated last month
- Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XMβ¦β6,203Updated this week
- Streamlit β A faster way to build and share data apps.β45,050Updated this week
- Proton VPN Special Offer - Get 70% off β’ AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Open-source AI orchestration framework for building context-engineered, production-ready LLM applications. Design modular pipelines and aβ¦β25,764Updated this week
- π‘ All-in-one AI framework for semantic search, LLM orchestration and language model workflowsβ12,683Jun 22, 2026Updated last week
- Python version of the Playwright testing and automation library.β14,776Updated this week
- Decentralized deep learning in PyTorch. Built to train models on thousands of volunteers across the world.β2,478Jan 11, 2026Updated 5 months ago
- βοΈ Build multimodal AI applications with cloud-native stackβ21,863Mar 24, 2025Updated last year
- Easily and securely send things from one computer to anotherβ35,328Jun 19, 2026Updated last week
- A collection of awesome web crawler,spider in different languagesβ7,239Jun 16, 2024Updated 2 years ago
- πͺ Turns your machine learning code into microservices with web API, interactive GUI, and more.β3,136Jun 20, 2026Updated last week
- Turn (almost) any Python command line program into a full GUI application with one lineβ21,897Mar 23, 2026Updated 3 months ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Clone a voice in 5 seconds to generate arbitrary speech in real-timeβ59,961Mar 9, 2026Updated 3 months ago
- Diagram as Code for prototyping cloud system architecturesβ42,397Jun 9, 2026Updated 3 weeks ago
- Automatically visualize your pandas dataframe via a single print! π π‘β5,380Mar 20, 2024Updated 2 years ago
- Visual scraping for Scrapyβ9,506Jun 26, 2024Updated 2 years ago
- Create agents that monitor and act on your behalf. Your agents are standing by!β49,514Jun 20, 2026Updated last week
- Lighter web automation with Pythonβ8,309Jun 3, 2026Updated 3 weeks ago
- FastAPI framework, high performance, easy to learn, fast to code, ready for productionβ99,556Jun 21, 2026Updated last week
- Ultimate Python study guide π π πβ5,881Updated this week
- π« Industrial-strength Natural Language Processing (NLP) in Pythonβ33,697May 19, 2026Updated last month
- GPU virtual machines on DigitalOcean Gradient AI β’ AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Realtime Web Apps and Dashboards for Python and Rβ4,242Jun 11, 2026Updated 2 weeks ago
- Hunt down social media accounts by username across social networksβ85,724Updated this week
- ππ€ Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyNβ70,185Updated this week
- Pythonic HTML Parsing for Humansβ’β13,829Apr 16, 2024Updated 2 years ago
- An autonomous agent that conducts deep research on any data using any LLM providersβ27,929Updated this week
- Find big moving stocks before they move using machine learning and anomaly detectionβ1,866Aug 13, 2021Updated 4 years ago
- CrawleeβA web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Dowβ¦β9,266Updated this week