A collection of awesome web scaper, crawler.
☆286Apr 4, 2024Updated 2 years ago
Alternatives and similar repositories for awesome-web-scraper
Users that are interested in awesome-web-scraper are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A collection of awesome web crawler,spider in different languages☆7,184Jun 16, 2024Updated last year
- Official Python package for ArchiveBox, the self-hosted internet archiving solution.☆12Oct 5, 2024Updated last year
- 📚 A compilation of research relevant to Data Together's efforts tackling the general problem of data resilience & interactivity☆99Sep 27, 2018Updated 7 years ago
- An Awesome List for getting started with web archiving☆2,537Apr 27, 2026Updated last week
- 🎭 An introduction to the Internet Archiving ecosystem, tooling, and some of the ethical dilemmas that the community faces.☆15Oct 19, 2020Updated 5 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Awesome Reddit subreddits☆188Jun 24, 2020Updated 5 years ago
- A list of scrapers from around the web.☆721Feb 7, 2025Updated last year
- Awesome Firefox Extensions☆64May 19, 2022Updated 3 years ago
- List of libraries, tools and APIs for web scraping and data processing.☆7,871Apr 17, 2026Updated 3 weeks ago
- A social media open post web archiving tool☆26Feb 4, 2026Updated 3 months ago
- All the tools, processes and resources you need to create an awesome API & Project documentation☆217Mar 11, 2026Updated last month
- brozzler - distributed browser-based web crawler☆796Apr 27, 2026Updated last week
- Privacy resources for the layperson. Highlights resources, tools, VPNs, search engines, articles, books, and dark patterns.☆69Apr 27, 2026Updated last week
- Awesome Research Papers☆329Jul 30, 2020Updated 5 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A curated list of FOSS tools to improve the Hacker News experience.☆182Apr 8, 2024Updated 2 years ago
- 🤖 A curated list of in-browser bookmarklets, tools, and resources for modern full-stack software engineers.☆523Apr 24, 2026Updated 2 weeks ago
- Search all awesome lists in seconds.☆647Mar 22, 2026Updated last month
- Send starred github repos to pinboard☆43May 22, 2023Updated 2 years ago
- Interact with ArchiveBox to automatically archive all your saved reddit posts and comments.☆20Nov 26, 2022Updated 3 years ago
- Awesome Podcasts☆93Apr 7, 2023Updated 3 years ago
- 💡Limiting personal data leaks on the internet☆1,013Jan 23, 2024Updated 2 years ago
- Awesome Chrome Extensions☆477Mar 3, 2026Updated 2 months ago
- Unofficial Anna's Archive API written in JS.☆59Jul 20, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- OSINT Bookmarks for Firefox / Chrome / Edge / Safari☆66May 24, 2020Updated 5 years ago
- Awesome Command Line Utilities☆484Oct 12, 2024Updated last year
- Mirrored from https://gitea.zoemp.be/sansguidon/bookmarks ! +5K awesome resources for geeks and software crafters☆533Nov 12, 2025Updated 5 months ago
- Export your Github activity: events, repositories, stars, etc.☆57Jan 31, 2026Updated 3 months ago
- Offline-first web browser☆92Jan 14, 2019Updated 7 years ago
- Awesome Privacy - A curated list of services and alternatives that respect your privacy because PRIVACY MATTERS. With repository stars⭐ a…☆47Updated this week
- 🕹 A curated list of awesome things on Discord.☆525Aug 17, 2024Updated last year
- An Alfred Workflow for your Readwise account☆16Apr 22, 2026Updated 2 weeks ago
- An #OSINT Framework to perform various recon techniques on Companies, People, Phone Number, Bitcoin Addresses, etc., aggregate all the r…☆13Jan 3, 2018Updated 8 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A curated list of open source, high-quality, popular and well maintained "zero-configuration" (#0CJS) toolkits☆548Dec 22, 2019Updated 6 years ago
- 📄 A curated list of awesome developer personal websites☆261Jun 8, 2021Updated 4 years ago
- A curated list of awesome lists of awesome lists.☆219Jan 24, 2021Updated 5 years ago
- Node scraper that checks for links in different places and creates a json file with a direct reference to any mega.co.nz links posted in …☆16Jul 23, 2017Updated 8 years ago
- A collection of awesome scripts from developers around the globe.☆223Oct 5, 2023Updated 2 years ago
- A collection of useful things regarding Actions on Google.☆108Oct 31, 2020Updated 5 years ago
- A reddit bot that finds original publish dates on linked articles.☆10Nov 30, 2024Updated last year