kennethreitz / requests-html
Pythonic HTML Parsing for Humans™
☆314Updated 8 months ago
Alternatives and similar repositories for requests-html:
Users that are interested in requests-html are comparing it to the libraries listed below
- Parsing JavaScript objects into Python data structures☆202Updated last month
- Parsel lets you extract data from XML/HTML documents using XPath or CSS selectors☆1,188Updated 3 weeks ago
- A Scrapy middleware to bypass the CloudFlare's anti-bot protection☆106Updated 3 years ago
- Generator of User-Agent header☆338Updated 8 months ago
- Proxy (HTTP, SOCKS) connector for aiohttp☆230Updated last month
- Python binding to Modest and Lexbor engines (fast HTML5 parser with CSS selectors).☆1,215Updated this week
- Pure python Tor client implementation☆415Updated last year
- Faster requests on Python 3☆1,111Updated last week
- A forward proxy server in Python☆132Updated last year
- A Python library to sanitize/validate a string such as filenames/file-paths/etc.☆243Updated this week
- A crawler demo to illustrate web crawling.☆28Updated 4 years ago
- Proxy (HTTP, SOCKS) transports for httpx☆79Updated 2 months ago
- ☆121Updated this week
- Truly universal encoding detector in pure Python☆619Updated 3 weeks ago
- Web grep: search all rendered resources used by a URI☆85Updated 7 months ago
- Extends Selenium WebDriver classes to include the request function from the Requests library, while doing all the needed cookie and reque…☆492Updated 11 months ago
- ☆392Updated last week
- A simple, yet elegant HTTP library.☆285Updated 10 months ago
- Scrapy middleware to handle javascript pages using selenium☆933Updated 7 months ago
- Common interface for data container classes☆66Updated 2 weeks ago
- Locally saves webpages to your hard disk with images, css, js & links as is.☆573Updated 6 months ago
- Headless chrome/chromium automation library (unofficial port of puppeteer)☆3,776Updated 7 months ago
- Python library for scraping google search results☆115Updated 2 months ago
- Requests + Gevent = <3☆4,519Updated 6 months ago
- 🎭 Playwright integration for Scrapy☆1,111Updated this week
- Simple retry client for aiohttp.☆253Updated 3 months ago
- A modern CSS selector implementation for BeautifulSoup☆229Updated this week
- A simple, Qt-Webengine powered web browser with built in functionality for basic scrapy webscraping support.☆109Updated 9 months ago
- Advanced email sending for Python☆401Updated 10 months ago
- A human-readable regular expression module for Python.☆406Updated last year