scrapy / parsel
Parsel lets you extract data from XML/HTML documents using XPath or CSS selectors
☆1,205Updated 2 weeks ago
Alternatives and similar repositories for parsel:
Users that are interested in parsel are comparing it to the libraries listed below
- Integration layer between Requests and Selenium for automation of web actions.☆1,835Updated 2 months ago
- A service daemon to run Scrapy spiders☆3,014Updated last month
- Scrapy Extension for monitoring spiders execution.☆539Updated 3 months ago
- Scrapy middleware to handle javascript pages using selenium☆939Updated 8 months ago
- Python library of web-related functions☆400Updated last month
- Python binding to Modest and Lexbor engines (fast HTML5 parser with CSS selectors).☆1,231Updated last month
- HTTP API for Scrapy spiders☆850Updated 8 months ago
- Run JavaScript code from Python (EOL: https://gist.github.com/doloopwhile/8c6ec7dd4703e8a44e559411cb2ea221)☆715Updated 4 years ago
- A toolbelt of useful classes and functions to be used with python-requests☆1,006Updated 2 months ago
- Extract embedded metadata from HTML markup☆895Updated last month
- Useful extensions to the standard Python datetime features☆2,433Updated 3 weeks ago
- python parser for human readable dates☆2,629Updated this week
- A Python library for automating interaction with websites.☆4,729Updated last month
- Command line client for Scrapyd server☆773Updated 2 weeks ago
- Standards-compliant library for parsing and serializing HTML documents and fragments in Python☆1,175Updated last year
- Lightweight, scriptable browser as a service with an HTTP API☆4,133Updated 7 months ago
- 🎭 Playwright integration for Scrapy☆1,128Updated last month
- Persistent HTTP cache for python requests☆1,391Updated last month
- Yet another URL library☆1,386Updated this week
- 🌐 URL parsing and manipulation made easy.☆2,667Updated 2 weeks ago
- A scalable frontier for web crawlers☆1,307Updated last month
- A jquery-like library for python☆2,341Updated 6 months ago
- Scrapy+Splash for JavaScript integration☆3,193Updated last month
- PyMiniRacer is a V8 bridge in Python.☆736Updated 9 months ago
- File support for asyncio☆2,986Updated last month
- A pure-python HTML screen-scraping library☆1,870Updated 2 years ago
- Extends Selenium WebDriver classes to include the request function from the Requests library, while doing all the needed cookie and reque…☆493Updated last year
- Web Content Retrieval for Humans™☆618Updated 2 years ago
- Web Scraping Framework☆2,401Updated last year
- Asynchronous Python HTTP Requests for Humans using Futures☆2,113Updated 2 months ago