ZenRows / scaling-to-distributed-crawling
Repository for the Mastering Web Scraping in Python: Scaling to Distributed Crawling blogpost with the final code.
β40Updated 2 years ago
Related projects: β
- Web scraping Page Objects core libraryβ93Updated 2 months ago
- ποΈ Create APIs from CSV files within seconds, using fastapiβ78Updated 3 years ago
- Building a Concurrent Web Scraper with Python and Seleniumβ35Updated 2 years ago
- Python bindings for Upwork API (OAuth2)β35Updated last year
- β οΈ Development moved to Sourcehutβ51Updated last year
- Scrapfly Python SDK for headless browsers and proxy rotationβ30Updated this week
- Learn how to scrape websites with Python, Selenium, Requests HTML, Celery, FastAPI, & NoSQL with Cassandra via AstraDB.β89Updated 2 years ago
- FastAPI-PostgreSQL-Celery-RabbitMQ-Redis bakcend with Docker containerizationβ68Updated last year
- Code examples on how to integrate various types of scrapers with Scraper API.β23Updated 3 years ago
- Page Object pattern for Scrapyβ119Updated 2 months ago
- Product scraping from Walmart Canada website, with further cleaning and integration of data from a different store.β15Updated last year
- A simple OCR tool made using FastAPI and Tesseractβ42Updated 3 years ago
- Async bulk data ingestion and querying in various document, graph and vector databases via their Python clientsβ32Updated 10 months ago
- Zyte Automatic Extraction integration for Scrapyβ55Updated 2 years ago
- β18Updated 3 years ago
- Python clients for Zyte AutoExtract APIβ39Updated 2 years ago
- FastAPI with Docker and Traefikβ105Updated last year
- deploying an ML model to Heroku with FastAPIβ48Updated 5 months ago
- Library that helps use puppeteer in scrapy.β51Updated this week
- List of automatically rated Python packages for FastAPI.β14Updated last week
- A Python library to interact with ScrapingBee's API for headless browsers and proxy rotationβ19Updated 2 weeks ago
- Python client for Zyte APIβ19Updated 3 months ago
- Scrapy extension that gives you all the scraping monitoring, alerting, scheduling, and data validation you will need straight out of theβ¦β36Updated last month
- JavaScript support and proxy rotation for Scrapy with ScrapingBee.β34Updated 4 months ago
- A Minimalist End-to-End Scrapy Tutorialβ69Updated 2 years ago
- Spider templates for automatic crawlers.β19Updated this week
- Fast API SAAS Base Appβ70Updated 3 years ago
- Backend, modern REST API for obtaining match and odds data crawled from multiple sites. Using FastAPI, MongoDB as database, Motor as asynβ¦β61Updated last year
- Working example for serving a ML model using FastAPI and Celeryβ71Updated 2 years ago
- FastAPI + GraphQLβ47Updated last year