richardpenman / webscraping
☆11Updated 2 months ago
Alternatives and similar repositories for webscraping:
Users that are interested in webscraping are comparing it to the libraries listed below
- Scrape various open data directories to create an index of what's available out there☆36Updated last week
- automatic and extensive scraper for forums☆17Updated this week
- Where knowledge grows.☆14Updated 3 months ago
- advertools visualizations☆18Updated 7 months ago
- Track changes to GraphQL APIs by git scraping their schemas☆28Updated last week
- Benson turns a list of URLs into mp3s of the contents of each web page - take control over your reading backlog!☆14Updated 3 months ago
- ☆14Updated 4 months ago
- A server code for serving BERT-based models for text classification. It is designed by SerpApi for heavy-load prototyping and production …☆13Updated 10 months ago
- A GitHub action for turning scanned PDF's into searchable documents☆12Updated last year
- Dockerized workflow automation tool☆17Updated this week
- Useful packages I maintain for Espanso, including a tool for communicating with ChatGPT.☆15Updated last year
- 🐍A curated list of awesome python environment.☆13Updated 4 years ago
- Homebrew formula for the ArchiveBox self-hosted internet archiving solution.☆28Updated 4 months ago
- Datasette plugin for uploading CSV files and converting them to database tables☆25Updated 10 months ago
- Cloudflare AI API Python Wrapper☆12Updated 11 months ago
- Live demo of shot-scraper☆38Updated this week
- advertools crawler UI☆28Updated 2 years ago
- Host-free RSS reader in your browser.☆15Updated last year
- A collection of awesome Homebrew taps and resources! Stay tuned!☆15Updated 2 years ago
- Self tracking your browser history!☆20Updated last year
- 😎 A community-curated list of awesome lawtech software and learning resources for legal technology and design.☆24Updated 5 years ago
- Export desired amount of posts from specified subreddit and category/sort without any API wrappers☆21Updated last year
- Python script to extract news from RSS feeds and save it as json.☆18Updated 2 years ago
- Datasette pre-configured with useful plugins. Experimental alpha.☆28Updated 8 months ago
- A code editing & sharing utility☆12Updated last year
- DigestBox takes any webpage URL (news article, video link, comment thread, etc.) and gives you just the raw content. It's powered by Arch…☆18Updated last year
- Scrape and parse Google search results in Python☆32Updated last year
- Parse government documents into well formed JSON☆67Updated last week
- 🚀 OFFICIAL STARTER TEMPLATE FOR BOTASAURUS SCRAPING FRAMEWORK 🤖☆23Updated 6 months ago
- Summarize and ask questions about items in the Internet Archive☆16Updated last year