ScrapingBee / scrapy-scrapingbeeLinks
JavaScript support and proxy rotation for Scrapy with ScrapingBee.
☆38Updated last year
Alternatives and similar repositories for scrapy-scrapingbee
Users that are interested in scrapy-scrapingbee are comparing it to the libraries listed below
Sorting:
- Web scraping Page Objects core library☆103Updated 2 weeks ago
- Python clients for Zyte AutoExtract API☆41Updated 3 years ago
- Extract text from HTML☆135Updated 5 years ago
- Automatic unit test generation for Scrapy.☆57Updated 4 years ago
- A Python implementation of Lunr.js 🌖☆201Updated 9 months ago
- Parse numbers written in natural language☆123Updated last year
- PyTorch library for transforming entities like companies, products, etc. into vectors to support scalable Record Linkage / Entity Resolut…☆159Updated 3 years ago
- Page Object pattern for Scrapy☆124Updated last month
- Analyze scraped data☆46Updated 6 years ago
- Extract price amount and currency symbol from a raw text string☆342Updated 2 months ago
- A tiny library for Python text normalisation. Useful for ad-hoc text processing.☆157Updated 2 months ago
- Scalable String Similarity Joins in Python☆39Updated last year
- Comparing Polars to Pandas and a small introduction☆44Updated 4 years ago
- GCP's Cloud Tasks + Cloud Scheduler + FastAPI = Partial replacement for celery.☆43Updated last year
- Generate reports for spaCy models.☆29Updated 3 years ago
- A Python module to convert natural language numerics into ints and floats.☆233Updated last year
- Scrapy schema validation pipeline and Item builder using JSON Schema☆44Updated 4 years ago
- A maximum-strength name parser for record linkage.☆39Updated 3 months ago
- Library to populate items using XPath and CSS with a convenient API☆47Updated this week
- Zyte Automatic Extraction integration for Scrapy☆56Updated 3 years ago
- 🧬 A VS Code extension for annotating data with Prodigy☆30Updated 4 years ago
- geonamescache - a Python library for quick access to a subset of GeoNames data.☆119Updated 2 months ago
- spaCy match and replace, maintaining conjugation☆36Updated 3 years ago
- Detect and classify pagination links☆104Updated 2 months ago
- A Cython implementation of the affine gap string distance☆57Updated 2 years ago
- 🏗️ Create APIs from CSV files within seconds, using fastapi☆79Updated 4 years ago
- A free, Python proxy server running on AWS lambda☆42Updated 5 years ago
- Extract city and country mentions from Text like GeoText without regex, but FlashText, a Aho-Corasick implementation.☆62Updated 2 weeks ago
- A library to extract a publication date from a web page, along with a measure of the accuracy.☆41Updated 6 years ago
- Restful Autocomplete service with Neo4j graph backend. Returns top suggestions.☆40Updated last week