Nykakin / chompjs
Parsing JavaScript objects into Python data structures
☆203Updated 3 weeks ago
Alternatives and similar repositories for chompjs:
Users that are interested in chompjs are comparing it to the libraries listed below
- Page Object pattern for Scrapy☆121Updated last month
- Extract price amount and currency symbol from a raw text string☆323Updated last month
- Web scraping Page Objects core library☆99Updated last month
- Scrapy Extension for monitoring spiders execution.☆540Updated 3 months ago
- Automatic unit test generation for Scrapy.☆56Updated 3 years ago
- Zyte API integration for Scrapy☆38Updated 2 weeks ago
- A Scrapy middleware to bypass the CloudFlare's anti-bot protection☆109Updated 3 years ago
- Extract text from HTML☆135Updated 4 years ago
- Parse numbers written in natural language☆110Updated 5 months ago
- Common interface for data container classes☆67Updated last week
- Library to populate items using XPath and CSS with a convenient API☆48Updated last week
- Ultimate Website Sitemap Parser☆199Updated this week
- Web grep: search all rendered resources used by a URI☆87Updated this week
- Python binding to Modest and Lexbor engines (fast HTML5 parser with CSS selectors).☆1,237Updated last month
- Splash + HAProxy + Docker Compose☆197Updated 6 years ago
- Scrapy download handler that can impersonate browser' TLS signatures or JA3 fingerprints.☆141Updated last week
- Convert Javascript code to an XML document☆186Updated 3 years ago
- Scrapy middleware to add extra fields to items, like timestamp, response fields, spider attributes etc.☆56Updated 3 years ago
- Detect and classify pagination links☆102Updated 4 years ago
- The most advanced debugging and testing tool for Scrapy☆16Updated last year
- Parsel lets you extract data from XML/HTML documents using XPath or CSS selectors☆1,210Updated this week
- A simple, Qt-Webengine powered web browser with built in functionality for basic scrapy webscraping support.☆109Updated 10 months ago
- Run a Scrapy spider programmatically from a script or a Celery task - no project required.☆122Updated 9 months ago
- Library that helps use puppeteer in scrapy.☆52Updated 3 weeks ago
- Python clients for Zyte AutoExtract API☆40Updated 3 years ago
- Proxy (HTTP, SOCKS) connector for aiohttp☆234Updated 3 months ago
- ☆128Updated last year
- 🎭 Playwright integration for Scrapy☆1,134Updated last month
- 🕶 Awesome list of Scrapy tools and libraries☆59Updated 4 years ago
- Async WebDriver implementation for asyncio and asyncio-compatible frameworks☆358Updated 11 months ago