A Smart, Automatic, Fast and Lightweight Web Scraper for Python
β7,126Jun 9, 2025Updated 9 months ago
Alternatives and similar repositories for autoscraper
Users that are interested in autoscraper are comparing it to the libraries listed below
Sorting:
- π€ Scrape data from HTML websites automatically by just providing examplesβ1,379Mar 17, 2024Updated 2 years ago
- List of libraries, tools and APIs for web scraping and data processing.β7,811Updated this week
- An advanced Twitter scraping & OSINT tool written in Python that doesn't use Twitter's API, allowing you to scrape a user's followers, foβ¦β16,345Feb 23, 2023Updated 3 years ago
- Prefect is a workflow orchestration framework for building resilient data pipelines in Python.β21,910Updated this week
- β‘ Automatically decrypt encryptions without knowing the key or cipher, decode encodings, and crack hashes β‘β21,254Mar 5, 2025Updated last year
- Python scraper based on AIβ23,032Updated this week
- Rich is a Python library for rich text and beautiful formatting in the terminal.β55,777Feb 26, 2026Updated 3 weeks ago
- Text preprocessing, representation and visualization from zero to hero.β2,910Aug 29, 2023Updated 2 years ago
- β399Jan 31, 2026Updated last month
- CrawleeβA web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data β¦β22,366Updated this week
- Scrapy, a fast high-level web crawling & scraping framework for Python.β60,886Updated this week
- Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XMβ¦β5,517Sep 12, 2025Updated 6 months ago
- newspaper3k is a news, full-text, and article metadata extraction in Python 3. Advanced docs:β15,010Dec 6, 2025Updated 3 months ago
- Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and β¦β29,093Dec 5, 2025Updated 3 months ago
- Streamlit β A faster way to build and share data apps.β43,928Updated this week
- Open-source AI orchestration framework for building context-engineered, production-ready LLM applications. Design modular pipelines and aβ¦β24,519Updated this week
- π‘ All-in-one AI framework for semantic search, LLM orchestration and language model workflowsβ12,291Mar 14, 2026Updated last week
- Python version of the Playwright testing and automation library.β14,394Feb 11, 2026Updated last month
- βοΈ Build multimodal AI applications with cloud-native stackβ21,849Mar 24, 2025Updated 11 months ago
- Easily and securely send things from one computer to anotherβ34,401Mar 12, 2026Updated last week
- A collection of awesome web crawler,spider in different languagesβ7,148Jun 16, 2024Updated last year
- πͺ Turns your machine learning code into microservices with web API, interactive GUI, and more.β3,137Mar 11, 2026Updated last week
- Turn (almost) any Python command line program into a full GUI application with one lineβ22,025Mar 12, 2026Updated last week
- Clone a voice in 5 seconds to generate arbitrary speech in real-timeβ59,525Mar 9, 2026Updated last week
- Diagram as Code for prototyping cloud system architecturesβ42,082Feb 7, 2026Updated last month
- Automatically visualize your pandas dataframe via a single print! π π‘β5,370Mar 20, 2024Updated 2 years ago
- Visual scraping for Scrapyβ9,497Jun 26, 2024Updated last year
- Create agents that monitor and act on your behalf. Your agents are standing by!β48,895Updated this week
- Lighter web automation with Pythonβ8,259Feb 4, 2026Updated last month
- FastAPI framework, high performance, easy to learn, fast to code, ready for productionβ96,291Updated this week
- Ultimate Python study guide π π πβ5,817Updated this week
- π« Industrial-strength Natural Language Processing (NLP) in Pythonβ33,352Updated this week
- Hunt down social media accounts by username across social networksβ73,755Updated this week
- Realtime Web Apps and Dashboards for Python and Rβ4,225Updated this week
- ππ€ Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyNβ62,080Updated this week
- Pythonic HTML Parsing for Humansβ’β13,869Apr 16, 2024Updated last year
- An autonomous agent that conducts deep research on any data using any LLM providersβ25,875Mar 14, 2026Updated last week
- Find big moving stocks before they move using machine learning and anomaly detectionβ1,856Aug 13, 2021Updated 4 years ago
- π΅οΈββοΈ Offensive Google framework.β18,535Mar 14, 2026Updated last week