scrapy/parsel

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/scrapy/parsel)

scrapy / parsel

Parsel lets you extract data from XML/HTML documents using XPath or CSS selectors

☆1,345

Alternatives and similar repositories for parsel

Users that are interested in parsel are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

scrapy / w3lib
View on GitHub
Python library of web-related functions
☆419Updated this week
scrapy / itemloaders
View on GitHub
Library to populate items using XPath and CSS with a convenient API
☆49Updated this week
scrapy / cssselect
View on GitHub
CSS Selectors for Python
☆309Updated this week
scrapinghub / scrapy-poet
View on GitHub
Page Object pattern for Scrapy
☆127Jun 8, 2026Updated last month
scrapinghub / extruct
View on GitHub
Extract embedded metadata from HTML markup
☆967Apr 1, 2026Updated 3 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
encode / httpx
View on GitHub
A next generation HTTP client for Python. 🦋
☆15,371Mar 29, 2026Updated 3 months ago
scrapinghub / dateparser
View on GitHub
python parser for human readable dates
☆2,845Updated this week
scrapinghub / scrapyrt
View on GitHub
HTTP API for Scrapy spiders
☆882Jun 29, 2026Updated 3 weeks ago
Granitosaurus / parsel-cli
View on GitHub
cli for evaluating css and xpath selectors
☆29Jul 4, 2023Updated 3 years ago
scrapy / itemadapter
View on GitHub
Common interface for data container classes
☆70Updated this week
scrapinghub / shub
View on GitHub
Scrapinghub Command Line Client
☆129Updated this week
scrapinghub / splash
View on GitHub
Lightweight, scriptable browser as a service with an HTTP API
☆4,190Aug 2, 2024Updated last year
tryolabs / requestium
View on GitHub
Integration layer between Requests and Selenium for automation of web actions.
☆1,834Updated this week
scrapy / scrapely
View on GitHub
A pure-python HTML screen-scraping library
☆1,884Apr 4, 2022Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
psf / requests-html
View on GitHub
Pythonic HTML Parsing for Humans™
☆13,826Apr 16, 2024Updated 2 years ago
scrapy / scrapyd
View on GitHub
A service daemon to run Scrapy spiders
☆3,097Updated this week
scrapinghub / js2xml
View on GitHub
Convert Javascript code to an XML document
☆188Mar 14, 2022Updated 4 years ago
scrapy / queuelib
View on GitHub
Collection of persistent (disk-based) and non-persistent (memory-based) queues for Python
☆299Updated this week
rushter / selectolax
View on GitHub
Python binding to Modest and Lexbor engines. Fast HTML5 parser with CSS selectors for Python.
☆1,658Jul 15, 2026Updated last week
scrapy-plugins / scrapy-splash
View on GitHub
Scrapy+Splash for JavaScript integration
☆3,229Feb 11, 2025Updated last year
jd / tenacity
View on GitHub
Retrying library for Python
☆8,731Jul 15, 2026Updated last week
scrapy / protego
View on GitHub
A pure-Python robots.txt parser with support for modern conventions.
☆90Updated this week
scrapinghub / spidermon
View on GitHub
Scrapy Extension for monitoring spiders execution.
☆561May 28, 2026Updated last month
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
jmespath / jmespath.py
View on GitHub
JMESPath is a query language for JSON.
☆2,451Apr 20, 2026Updated 3 months ago
Tinche / aiofiles
View on GitHub
File support for asyncio
☆3,252Jul 18, 2026Updated last week
python-pendulum / pendulum
View on GitHub
Python datetimes made easy
☆6,672Jul 6, 2026Updated 2 weeks ago
aio-libs / aiohttp
View on GitHub
Asynchronous HTTP client/server framework for asyncio and Python
☆16,504Updated this week
gawel / pyquery
View on GitHub
A jquery-like library for python
☆2,381Updated this week
python-attrs / attrs
View on GitHub
Python Classes Without Boilerplate
☆5,818Updated this week
scrapy-plugins / scrapy-playwright
View on GitHub
🎭 Playwright integration for Scrapy
☆1,436Updated this week
scrapy-plugins / scrapy-pagestorage
View on GitHub
A scrapy extension to store requests and responses information in storage service
☆27Mar 11, 2022Updated 4 years ago
lxml / lxml
View on GitHub
The lxml XML toolkit for Python
☆3,046Updated this week
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
scrapinghub / portia
View on GitHub
Visual scraping for Scrapy
☆9,505Jun 26, 2024Updated 2 years ago
ijl / orjson
View on GitHub
Fast, correct Python JSON library supporting dataclasses, datetimes, and numpy
☆8,174Updated this week
scrapinghub / price-parser
View on GitHub
Extract price amount and currency symbol from a raw text string
☆346Mar 19, 2026Updated 4 months ago
redapple / parslepy
View on GitHub
Python implementation of the Parsley language for extracting structured data from web pages
☆92Oct 26, 2017Updated 8 years ago
dateutil / dateutil
View on GitHub
Useful extensions to the standard Python datetime features
☆2,631May 19, 2026Updated 2 months ago
gruns / furl
View on GitHub
🌐 The easiest way to parse and modify URLs in Python.
☆2,810Feb 22, 2026Updated 5 months ago
scrapy / scrapy
View on GitHub
Scrapy, a fast high-level web crawling & scraping framework for Python.
☆63,405Updated this week