alirezamika / autoscraperLinks
A Smart, Automatic, Fast and Lightweight Web Scraper for Python
β7,066Updated 7 months ago
Alternatives and similar repositories for autoscraper
Users that are interested in autoscraper are comparing it to the libraries listed below
Sorting:
- π€ Scrape data from HTML websites automatically by just providing examplesβ1,369Updated last year
- Python version of the Playwright testing and automation library.β14,127Updated last month
- Programmatically collect normalized news from (almost) any website.β2,971Updated 5 years ago
- Lighter web automation with Pythonβ8,193Updated 2 months ago
- Realtime Web Apps and Dashboards for Python and Rβ4,218Updated 3 weeks ago
- Headless chrome/chromium automation library (unofficial port of puppeteer)β3,932Updated last year
- The web scraping open project repository aims to share knowledge and experiences about web scraping with Pythonβ1,696Updated last year
- List of libraries, tools and APIs for web scraping and data processing.β7,714Updated 3 months ago
- π‘ All-in-one AI framework for semantic search, LLM orchestration and language model workflowsβ12,003Updated last week
- borb is a library for reading, creating and manipulating PDF files in python.β3,551Updated 2 weeks ago
- Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XMβ¦β5,151Updated 4 months ago
- newspaper3k is a news, full-text, and article metadata extraction in Python 3. Advanced docs:β14,927Updated last month
- π Playwright integration for Scrapyβ1,341Updated last month
- Archivy is a self-hostable knowledge repository that allows you to learn and retain information in your own personal and extensible wiki.β3,252Updated 2 years ago
- A Python library for automating interaction with websites.β4,836Updated last month
- a delightful machine learning tool that allows you to train, test, and use models without writing codeβ3,134Updated last month
- Scrape job websites into a single spreadsheet with no duplicates.β2,115Updated last month
- πΈ Identify anything. pyWhat easily lets you identify emails, IP addresses, and more. Feed it a .pcap file or some text and it'll tell β¦β7,123Updated 2 years ago
- Analysis of Bot Protection systems with available countermeasures πΏ. How to defeat anti-bot system π» and get around browser fingerprintβ¦β4,934Updated last year
- Web app for Scrapyd cluster management, Scrapy log analysis & visualization, Auto packaging, Timer tasks, Monitor & Alert, and Mobile UI.β¦β3,401Updated 10 months ago
- Hide your scrapers IP behind the cloud. Provision proxy servers across different cloud providers to improve your scraping success.β1,652Updated 3 weeks ago
- Transforms PDF, Documents and Images into Enriched Structured Dataβ6,149Updated 2 years ago
- Headless chrome/chromium automation library (unofficial port of puppeteer)β3,561Updated 4 years ago
- Visual scraping for Scrapyβ9,478Updated last year
- Python tool for grabbing text via screenshotβ1,775Updated last year
- π π€ AI for medical and scientific papersβ1,678Updated 6 months ago
- Scrapoxy is a super proxies manager that orchestrates all your proxies into one place, rather than spreading management across multiple sβ¦β2,420Updated 5 months ago
- An advanced Twitter scraping & OSINT tool written in Python that doesn't use Twitter's API, allowing you to scrape a user's followers, foβ¦β16,291Updated 2 years ago
- Python APIs for web automation, testing, and bypassing bot-detection with ease.β12,090Updated this week
- The free Zapier/IFTTT alternative for developers to automate your workflows based on Github actionsβ3,316Updated 2 months ago