lorien/awesome-web-scraping

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/lorien/awesome-web-scraping)

lorien / awesome-web-scraping

List of libraries, tools and APIs for web scraping and data processing.

☆7,985

Alternatives and similar repositories for awesome-web-scraping

Users that are interested in awesome-web-scraping are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

BruceDone / awesome-crawler
View on GitHub
A collection of awesome web crawler,spider in different languages
☆7,257Jun 16, 2024Updated 2 years ago
lorien / grab
View on GitHub
Web Scraping Framework
☆2,461Sep 19, 2025Updated 10 months ago
scrapy / scrapy
View on GitHub
Scrapy, a fast high-level web crawling & scraping framework for Python.
☆63,405Updated this week
alirezamika / autoscraper
View on GitHub
A Smart, Automatic, Fast and Lightweight Web Scraper for Python
☆7,725Jun 9, 2025Updated last year
scrapinghub / portia
View on GitHub
Visual scraping for Scrapy
☆9,505Jun 26, 2024Updated 2 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
jnv / lists
View on GitHub
The definitive list of lists (of lists) curated on GitHub and elsewhere
☆11,355Mar 23, 2026Updated 4 months ago
apify / crawlee
View on GitHub
Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data …
☆24,984Updated this week
DopplerHQ / awesome-bots
View on GitHub
The most awesome list about bots ⭐️🤖
☆4,169Jul 3, 2024Updated 2 years ago
veggiemonk / awesome-docker
View on GitHub
A curated list of Docker resources and projects
☆36,524Updated this week
huginn / huginn
View on GitHub
Create agents that monitor and act on your behalf. Your agents are standing by!
☆49,685Updated this week
alebcay / awesome-shell
View on GitHub
A curated list of awesome command-line frameworks, toolkits, guides and gizmos. Inspired by awesome-php.
☆37,327Aug 28, 2025Updated 10 months ago
MontFerret / ferret
View on GitHub
Declarative data automation language and Go runtime for structured extraction workflows.
☆6,010Updated this week
neutraltone / awesome-stock-resources
View on GitHub
A collection of links for free stock photography, video and Illustration websites
☆14,388Feb 11, 2026Updated 5 months ago
oxnr / awesome-bigdata
View on GitHub
A curated list of awesome big data frameworks, ressources and other awesomeness.
☆14,508May 19, 2026Updated 2 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
TheWebScrapingClub / webscraping-from-0-to-hero
View on GitHub
The web scraping open project repository aims to share knowledge and experiences about web scraping with Python
☆1,728May 27, 2024Updated 2 years ago
agarrharr / awesome-cli-apps
View on GitHub
🖥 📊 🕹 🛠 A curated list of command line apps
☆20,022Updated this week
gztchan / awesome-design
View on GitHub
🌟 Curated design resources from all over the world.
☆17,323Jul 4, 2024Updated 2 years ago
dhamaniasad / HeadlessBrowsers
View on GitHub
A list of (almost) all headless web browsers in existence
☆6,663Oct 12, 2025Updated 9 months ago
detailyang / awesome-cheatsheet
View on GitHub
awesome cheatsheet
☆8,509Mar 26, 2026Updated 4 months ago
ChromeDevTools / awesome-chrome-devtools
View on GitHub
Awesome tooling and resources in the Chrome DevTools & DevTools Protocol ecosystem
☆7,080Mar 27, 2026Updated 3 months ago
codelucas / newspaper
View on GitHub
newspaper3k is a news, full-text, and article metadata extraction in Python 3. Advanced docs:
☆15,123Updated this week
inputsh / awesome-linux
View on GitHub
A list of awesome projects and resources that make Linux even more awesome.
☆5,090Feb 4, 2023Updated 3 years ago
luong-komorebi / Awesome-Linux-Software
View on GitHub
🐧 A list of awesome Linux softwares
☆25,495May 3, 2026Updated 2 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
viatsko / awesome-vscode
View on GitHub
🎨 A curated list of delightful VS Code packages and resources.
☆28,909Jun 21, 2026Updated last month
kahun / awesome-sysadmin
View on GitHub
A curated list of amazingly awesome open source sysadmin resources inspired by Awesome PHP.
☆24,329Mar 26, 2024Updated 2 years ago
unicodeveloper / awesome-opensource-apps
View on GitHub
Curated list of awesome open source crafted web & mobile applications - Learn, Fork, Contribute & Most Importantly Enjoy!
☆3,867Mar 3, 2026Updated 4 months ago
awesomedata / awesome-public-datasets
View on GitHub
A topic-centric list of HQ open datasets.
☆77,686Jul 13, 2026Updated last week
scrapinghub / splash
View on GitHub
Lightweight, scriptable browser as a service with an HTTP API
☆4,190Aug 2, 2024Updated last year
lukasz-madon / awesome-remote-job
View on GitHub
A curated list of awesome remote jobs and resources. Inspired by https://github.com/vinta/awesome-python
☆47,076May 8, 2026Updated 2 months ago
herrbischoff / awesome-command-line-apps
View on GitHub
Use your terminal shell to do awesome things.
☆4,199Mar 4, 2026Updated 4 months ago
carpedm20 / awesome-hacking
View on GitHub
A curated list of awesome Hacking tutorials, tools and resources
☆16,763Jun 2, 2024Updated 2 years ago
vinta / awesome-python
View on GitHub
An opinionated list of Python frameworks, libraries, tools, and resources
☆310,276Updated this week
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
gocolly / colly
View on GitHub
Elegant Scraper and Crawler Framework for Golang
☆25,390Jun 18, 2026Updated last month
sindresorhus / awesome
View on GitHub
😎 Awesome lists about all kinds of interesting topics
☆488,891Jun 30, 2026Updated 3 weeks ago
psf / requests-html
View on GitHub
Pythonic HTML Parsing for Humans™
☆13,826Apr 16, 2024Updated 2 years ago
josephmisiti / awesome-machine-learning
View on GitHub
A curated list of awesome Machine Learning frameworks, libraries and software.
☆73,714Updated this week
awesome-foss / awesome-sysadmin
View on GitHub
A curated list of amazingly awesome open-source sysadmin resources.
☆34,699Jun 20, 2026Updated last month
moul / awesome-ssh
View on GitHub
A curated list of SSH resources.
☆2,816Aug 10, 2023Updated 2 years ago
bayandin / awesome-awesomeness
View on GitHub
A curated list of awesome awesomeness
☆33,565Jun 2, 2024Updated 2 years ago