OryJonay / scrapy-headless
Scrapy with Headless Selenium, for scraping interactive web pages
☆10Updated 2 years ago
Alternatives and similar repositories for scrapy-headless:
Users that are interested in scrapy-headless are comparing it to the libraries listed below
- Datasette plugin adding a llm_embed(model_id, text) SQL function☆12Updated 10 months ago
- JupyterLite as a Datasette plugin☆11Updated 3 years ago
- Add your configs for tmux☆13Updated 2 years ago
- Datasette enrichment for analyzing row data using OpenAI's GPT models☆19Updated 8 months ago
- Datasette plugin for authenticating access using API tokens☆11Updated 4 months ago
- A Python script to help you add user attributions to your Twitter bots☆11Updated 4 years ago
- A Google Trends Analytics Package☆13Updated 7 months ago
- A pre-configured docker-compose files collection helping web developers. Additional Cli to manage registered compose files from everywher…☆19Updated last month
- ☆12Updated last year
- Command-line tool to look up abbreviations for terms☆25Updated 3 years ago
- Tools to easy generate RSS feed that contains each scraped item using Scrapy framework.☆32Updated last month
- Automatically sync Omnivore pages to Raindrop.io☆21Updated last month
- webapp for unglue.it - A Free Ebook Foundation program☆17Updated last month
- ScrapingAnt API client for Python.☆36Updated 6 months ago
- LLM access to models by Anthropic, including the Claude series☆12Updated last month
- A Scrapy crawler for http://books.toscrape.com☆27Updated 7 years ago
- Tools for running enrichments against data stored in Datasette☆21Updated this week
- A collection of prompts for use with the LLM CLI tool☆13Updated last year
- Datasette plugin providing instructions for exporting data to Jupyter or Observable☆12Updated last year
- dropshipping knowledge basics☆12Updated 3 years ago
- A micro-framework for asynchronous deep crawls and web scraping with Python☆13Updated last year
- A python3 module that converts your bs4 Tag into json object (dict)☆13Updated 10 months ago
- Telegram > OpenAI > Read Later [instapaper/pocket/omnivore]☆17Updated last year
- SEMRush SERP Tutorial. Using advertools to Extract and Analyze Search Engine Results Pages Data☆14Updated 6 years ago
- Repository to allow collaboration between Cycle Labs Cloud community in support of the community.☆9Updated 3 years ago
- A simple Web crawler for stackshare.io using scrapy .☆9Updated 5 years ago
- Web scraping python script to convert a list of Facebook events pages into a ical calendar.☆23Updated 4 years ago
- A curated list of awesome open source tools and commercial products to catalog, version, and manage data 🚀☆29Updated 2 years ago
- Self tracking your browser history!☆20Updated last year
- A Python package that simplifies the use of secrets in a Jupyter notebook☆21Updated 3 years ago