lorien / awesome-web-scraping
List of libraries, tools and APIs for web scraping and data processing.
☆6,831Updated 3 weeks ago
Alternatives and similar repositories for awesome-web-scraping:
Users that are interested in awesome-web-scraping are comparing it to the libraries listed below
- A collection of awesome web crawler,spider in different languages☆6,575Updated 7 months ago
- Web Scraping Framework☆2,400Updated 10 months ago
- A list of (almost) all headless web browsers in existence☆6,282Updated 6 months ago
- 🔍 A helpful checklist/collection of Search Engine Optimization (SEO) tips and techniques.☆2,484Updated 8 months ago
- A curated list of awesome packages, articles, and other cool resources from the Scrapy community.☆541Updated 2 years ago
- Visual scraping for Scrapy☆9,338Updated 6 months ago
- Lightweight, scriptable browser as a service with an HTTP API☆4,112Updated 5 months ago
- A curated list of awesome puppeteer resources.☆2,425Updated 5 months ago
- admin ui for scrapy/open source scrapinghub☆2,748Updated last year
- A scalable frontier for web crawlers☆1,304Updated last year
- A pure-python HTML screen-scraping library☆1,869Updated 2 years ago
- The most awesome list about bots ⭐️🤖☆3,862Updated 6 months ago
- A collection of services with great free tiers for developers on a budget. Sponsored by Mockoon, the best mock API tool. https://mockoon.…☆12,177Updated 2 months ago
- Compiled list of links from "Ask HN: Where can I post my startup to get beta users?"☆6,066Updated 2 weeks ago
- Declarative web scraping☆5,768Updated last month
- A service daemon to run Scrapy spiders☆2,984Updated 3 weeks ago
- The definitive list of lists (of lists) curated on GitHub and elsewhere☆10,116Updated 3 months ago
- ☆3,704Updated 4 years ago
- Distributed crawler powered by Headless Chrome☆5,540Updated last year
- A Python library for automating interaction with websites.☆4,694Updated 2 months ago
- Scrapy+Splash for JavaScript integration☆3,171Updated last year
- Scrapoxy is a super proxies manager that orchestrates all your proxies into one place, rather than spreading management across multiple s…☆2,113Updated this week
- Web data extraction tool implemented as chrome extension☆1,325Updated 6 years ago
- Random proxy middleware for Scrapy☆1,659Updated 5 years ago
- Use your terminal shell to do awesome things.☆3,984Updated 3 years ago
- Resources for independent developers to make money☆10,050Updated 7 months ago
- Cool open source projects. Choose your project and get involved in Open Source development now.☆9,471Updated 10 months ago
- This Scrapy project uses Redis and Kafka to create a distributed on demand scraping cluster.☆1,184Updated last year
- A curated list of awesome big data frameworks, ressources and other awesomeness.☆13,382Updated 8 months ago
- Scrapy middleware to handle javascript pages using selenium☆932Updated 6 months ago