alirezamika / autoscraperLinks
A Smart, Automatic, Fast and Lightweight Web Scraper for Python
β6,960Updated 3 months ago
Alternatives and similar repositories for autoscraper
Users that are interested in autoscraper are comparing it to the libraries listed below
Sorting:
- π€ Scrape data from HTML websites automatically by just providing examplesβ1,363Updated last year
- The web scraping open project repository aims to share knowledge and experiences about web scraping with Pythonβ1,674Updated last year
- Headless chrome/chromium automation library (unofficial port of puppeteer)β3,905Updated last year
- List of libraries, tools and APIs for web scraping and data processing.β7,339Updated 9 months ago
- πͺ Turns your machine learning code into microservices with web API, interactive GUI, and more.β3,133Updated last week
- A curated list of awesome packages, articles, and other cool resources from the Scrapy community.β550Updated 2 years ago
- Python version of the Playwright testing and automation library.β13,714Updated last week
- Custom Selenium Chromedriver | Zero-Config | Passes ALL bot mitigation systems (like Distil / Imperva/ Datadadome / CloudFlare IUAM)β11,799Updated 2 months ago
- A Python module to bypass Cloudflare's anti-bot page.β5,597Updated 3 months ago
- Lighter web automation with Pythonβ8,040Updated 5 months ago
- If Google News had a Python libraryβ1,368Updated 9 months ago
- CrawleeβA web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Dowβ¦β6,322Updated this week
- A service daemon to run Scrapy spidersβ3,061Updated 3 weeks ago
- Async Python 3.6+ web scraping micro-framework based on asyncioβ1,750Updated 2 years ago
- Headless chrome/chromium automation library (unofficial port of puppeteer)β3,569Updated 4 years ago
- Visual scraping for Scrapyβ9,455Updated last year
- Realtime Web Apps and Dashboards for Python and Rβ4,190Updated this week
- A Python 3 compatible version of goose http://goose3.readthedocs.io/en/latest/index.htmlβ887Updated 2 weeks ago
- The free Zapier/IFTTT alternative for developers to automate your workflows based on Github actionsβ3,293Updated 3 months ago
- Up-to-date simple useragent faker with real world databaseβ3,985Updated this week
- Text preprocessing, representation and visualization from zero to hero.β2,908Updated 2 years ago
- Intelligent proxy pool for Humansβ’ to extract content from the internet and build your own Large Language Models in this new AI eraβ4,015Updated 3 months ago
- π Playwright integration for Scrapyβ1,271Updated last month
- An advanced Twitter scraping & OSINT tool written in Python that doesn't use Twitter's API, allowing you to scrape a user's followers, foβ¦β16,227Updated 2 years ago
- Decentralized deep learning in PyTorch. Built to train models on thousands of volunteers across the world.β2,261Updated last month
- Lightweight, scriptable browser as a service with an HTTP APIβ4,174Updated last year
- Deploy a ML inference service on a budget in less than 10 lines of code.β1,346Updated last year
- a delightful machine learning tool that allows you to train, test, and use models without writing codeβ3,131Updated 2 years ago
- Prefect is a workflow orchestration framework for building resilient data pipelines in Python.β20,480Updated this week
- Extract embedded metadata from HTML markupβ933Updated 3 weeks ago