A collection of awesome web scaper, crawler.
☆288Apr 4, 2024Updated 2 years ago
Alternatives and similar repositories for awesome-web-scraper
Users that are interested in awesome-web-scraper are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official Python package for ArchiveBox, the self-hosted internet archiving solution.☆12Oct 5, 2024Updated last year
- A curated list of awesome WordPress plugins for developers.☆39Nov 14, 2014Updated 11 years ago
- Homebrew formula for the ArchiveBox self-hosted internet archiving solution.☆28Oct 5, 2024Updated last year
- 📚 A compilation of research relevant to Data Together's efforts tackling the general problem of data resilience & interactivity☆99Sep 27, 2018Updated 7 years ago
- An Awesome List for getting started with web archiving☆2,551Apr 27, 2026Updated last month
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- 🎭 An introduction to the Internet Archiving ecosystem, tooling, and some of the ethical dilemmas that the community faces.☆15Oct 19, 2020Updated 5 years ago
- Awesome Reddit subreddits☆187Jun 24, 2020Updated 5 years ago
- A list of scrapers from around the web.☆721Feb 7, 2025Updated last year
- List of libraries, tools and APIs for web scraping and data processing.☆7,910Apr 17, 2026Updated last month
- A Website Crawler Implementation written in PHP. High extendible, Indexes PDFs and is very memory efficient.☆10Apr 18, 2026Updated last month
- brozzler - distributed browser-based web crawler☆796May 19, 2026Updated last week
- Awesome Research Papers☆330Jul 30, 2020Updated 5 years ago
- A curated list of FOSS tools to improve the Hacker News experience.☆182Apr 8, 2024Updated 2 years ago
- 🤖 A curated list of in-browser bookmarklets, tools, and resources for modern full-stack software engineers.☆524Apr 24, 2026Updated last month
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Delightful Markdown stuff.☆923Aug 21, 2024Updated last year
- A curated list of awesome packages, articles, and other cool resources from the Scrapy community.☆560Dec 28, 2022Updated 3 years ago
- Search all awesome lists in seconds.☆648Mar 22, 2026Updated 2 months ago
- Send starred github repos to pinboard☆43May 22, 2023Updated 3 years ago
- Interact with ArchiveBox to automatically archive all your saved reddit posts and comments.☆20Nov 26, 2022Updated 3 years ago
- permanently forked from Win10-Initial-Setup-Script☆11May 22, 2018Updated 8 years ago
- Awesome Podcasts☆93Apr 7, 2023Updated 3 years ago
- Awesome Chrome Extensions☆477Mar 3, 2026Updated 2 months ago
- 💡Limiting personal data leaks on the internet☆1,018Jan 23, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A collection of helper functions written in PHP for PHP☆13Jun 28, 2017Updated 8 years ago
- Search the awesome curated list without browser☆284Dec 8, 2022Updated 3 years ago
- Awesome Command Line Utilities☆485May 19, 2026Updated last week
- Export your Github activity: events, repositories, stars, etc.☆57Jan 31, 2026Updated 3 months ago
- Mirrored from https://gitea.zoemp.be/sansguidon/bookmarks ! +5K awesome resources for geeks and software crafters☆535Nov 12, 2025Updated 6 months ago
- Offline-first web browser☆92Jan 14, 2019Updated 7 years ago
- Awesome Privacy - A curated list of services and alternatives that respect your privacy because PRIVACY MATTERS. With repository stars⭐ a…☆47May 20, 2026Updated last week
- 🕹 A curated list of awesome things on Discord.☆525May 11, 2026Updated 2 weeks ago
- An Alfred Workflow for your Readwise account☆16Apr 22, 2026Updated last month
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- A curated list of open source, high-quality, popular and well maintained "zero-configuration" (#0CJS) toolkits☆548Dec 22, 2019Updated 6 years ago
- 📄 A curated list of awesome developer personal websites☆264Jun 8, 2021Updated 4 years ago
- A Curated Collection of the Best Twitter Bots 🤖☆120Feb 26, 2019Updated 7 years ago
- A curated list of awesome lists of awesome lists.☆224Jan 24, 2021Updated 5 years ago
- A collection of awesome scripts from developers around the globe.☆224Oct 5, 2023Updated 2 years ago
- A reddit bot that finds original publish dates on linked articles.☆10Nov 30, 2024Updated last year
- A curated list of awesome resources related to Semantic Search🔎 and Semantic Similarity tasks.☆365Dec 9, 2025Updated 5 months ago