scrapy / scrapyLinks
Scrapy, a fast high-level web crawling & scraping framework for Python.
☆57,524Updated this week
Alternatives and similar repositories for scrapy
Users that are interested in scrapy are comparing it to the libraries listed below
Sorting:
- A Powerful Spider(Web Crawler) System in Python.☆16,704Updated last year
- Visual scraping for Scrapy☆9,429Updated last year
- Python Development Workflow for Humans.☆25,078Updated this week
- The Python micro framework for building web applications.☆69,919Updated last month
- A simple, yet elegant, HTTP library.☆53,015Updated 3 weeks ago
- The Web framework for perfectionists with deadlines.☆84,168Updated last week
- Lightweight, scriptable browser as a service with an HTTP API☆4,161Updated 11 months ago
- A collection of design patterns/idioms in Python☆41,638Updated 2 months ago
- Scrapy+Splash for JavaScript integration☆3,214Updated 5 months ago
- Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects,…☆45,889Updated this week
- Simple job queues for Python☆10,251Updated 3 weeks ago
- Distributed Task Queue (development branch)☆26,775Updated this week
- The world’s fastest framework for building websites.☆82,155Updated this week
- Tesseract Open Source OCR Engine (main repository)☆68,025Updated this week
- Open source UI framework written in Python, running on Windows, Linux, macOS, Android and iOS☆18,468Updated last week
- Pythonic HTML Parsing for Humans™☆13,825Updated last year
- matplotlib: plotting with Python☆21,423Updated this week
- Ansible is a radically simple IT automation platform that makes your applications and systems easier to deploy and maintain. Automate eve…☆65,546Updated last week
- scikit-learn: machine learning in Python☆62,629Updated this week
- Project documentation with Markdown.☆20,732Updated 4 months ago
- A service daemon to run Scrapy spiders☆3,043Updated this week
- Faker is a Python package that generates fake data for you.☆18,537Updated last month
- 💫 Industrial-strength Natural Language Processing (NLP) in Python☆31,935Updated last month
- Tornado is a Python web framework and asynchronous networking library, originally developed at FriendFeed.☆22,038Updated last week
- 🦍 The Cloud-Native API Gateway and AI Gateway.☆41,229Updated last week
- JavaScript API for Chrome and Firefox☆91,138Updated this week
- Python version of the Playwright testing and automation library.☆13,335Updated this week
- Web mining module for Python, with tools for scraping, natural language processing, machine learning, network analysis and visualization.☆8,823Updated last year
- Tensors and Dynamic neural networks in Python with strong GPU acceleration☆91,472Updated this week
- Python composable command line interface toolkit☆16,616Updated last week