ispras / scrapy-puppeteer-serviceLinks
A special service that runs puppeteer instances.
☆18Updated last month
Alternatives and similar repositories for scrapy-puppeteer-service
Users that are interested in scrapy-puppeteer-service are comparing it to the libraries listed below
Sorting:
- Library that helps use puppeteer in scrapy.☆52Updated last month
- Zyte Automatic Extraction integration for Scrapy☆56Updated 3 years ago
- Spider templates for automatic crawlers.☆31Updated 2 months ago
- Scrapy project boilerplate done right☆48Updated 7 months ago
- A complimentary proxy to help to use SPM with headless browsers☆108Updated 2 years ago
- Singer.io tap for Facebook Marketing API☆116Updated last week
- Web scraping Page Objects core library☆101Updated 3 weeks ago
- estela, an elastic web scraping cluster 🕸☆188Updated 2 weeks ago
- Page Object pattern for Scrapy☆121Updated 3 weeks ago
- Python clients for Zyte AutoExtract API☆40Updated 3 years ago
- Common interface for data container classes☆68Updated last week
- Extract text from HTML☆134Updated 5 years ago
- KNOTS is an intuitive desktop application built to simplify the configuration of Singer pipelines☆67Updated 2 years ago
- 💾 Script to import issues from a JIRA instance into a database.☆56Updated 2 years ago
- UpWork scraping☆49Updated 6 years ago
- Use AWS Lambda functions as a proxy pool to scrape web pages.☆137Updated last year
- 🏗️ Create APIs from CSV files within seconds, using fastapi☆77Updated 4 years ago
- A fork of Dragnet that also extract author, headline, date, keywords from context, as well as built in metadata extraction all in one pac…☆293Updated 3 months ago
- A simple python tool that generates a requests/bs4 based web scraper☆27Updated 3 years ago
- Techcrunch Incremental Scrapy Spider With MongoDB☆16Updated 6 years ago
- Library to populate items using XPath and CSS with a convenient API☆47Updated last week
- Apify actor to scrape Youtube search results. You can set the maximum videos to scrape per page as well as the date from which to start s…☆26Updated 3 years ago
- Ssebowa is free and open source library in Python that provides generative-ai models.☆14Updated last year
- Repository for the Mastering Web Scraping in Python: Scaling to Distributed Crawling blogpost with the final code.☆45Updated 3 years ago
- A free, Python proxy server running on AWS lambda☆42Updated 5 years ago
- A Python DB-API and SQLAlchemy dialect to Google Spreasheets☆221Updated 2 years ago
- Visual Studio Code extension to convert HTML to FastHTML FT☆19Updated 6 months ago
- Automatic unit test generation for Scrapy.☆57Updated 4 years ago
- 🚕 A spreadsheet-like data preparation web app that works over Optimus (Pandas, Dask, cuDF, Dask-cuDF, Spark and Vaex)☆141Updated 2 years ago
- Scrapy pipeline to store chunked items into Amazon S3 or Google Cloud Storage bucket.☆76Updated 3 years ago