richardpenman / webscrapingLinks
β11Updated 3 weeks ago
Alternatives and similar repositories for webscraping
Users that are interested in webscraping are comparing it to the libraries listed below
Sorting:
- πA curated list of awesome python environment.β13Updated 5 years ago
- Scrape various open data directories to create an index of what's available out thereβ37Updated 3 months ago
- A server code for serving BERT-based models for text classification. It is designed by SerpApi for heavy-load prototyping and production β¦β14Updated last year
- You know, an awesome list of search engines.β22Updated 3 months ago
- Generate a longtail keywords for SEO // Generador de palabras clave largas para SEOβ12Updated 7 years ago
- A list of awesome browser extensions to help ith SEO and rank higher!β23Updated 4 years ago
- Transcribe a Youtube video's captions with timestamps into Obsidian MD formatβ12Updated 2 years ago
- Koalati tool that checks for on-site SEO of a page and that provides suggestions for improvement.β22Updated 3 years ago
- A multi-threaded fast script to check broken links on any WordPress website. Checks all the posts, looks for broken internal and externalβ¦β17Updated last year
- Python script to extract news from RSS feeds and save it as json.β17Updated 2 years ago
- Use GPTparser with your OpenAI API to scrape & parse files into structured JSON files.β14Updated last year
- advertools visualizationsβ19Updated 10 months ago
- Telegram > OpenAI > Read Later [instapaper/pocket/omnivore]β17Updated last year
- Your "yellow pages" of Enterprise Free Software Publishers, their products and success casesβ17Updated 11 months ago
- A lightweight scraper that extracts pages of job information from hk.jobsdb.com and th.jobsdb.com into to a JSON file.β11Updated 3 weeks ago
- A Google Trends Analytics Packageβ13Updated last year
- Homebrew formula for the ArchiveBox self-hosted internet archiving solution.β28Updated 8 months ago
- Use markdown as document (by casual-markdown parser)β13Updated last year
- The official Python library for Formulaicβ16Updated last year
- Download a webpage as an e-bookβ7Updated this week
- Summarize and ask questions about items in the Internet Archiveβ17Updated 2 years ago
- Datasette plugin for searching all searchable tables at onceβ24Updated 9 months ago
- Google Search Results Pages Dashboardβ37Updated 2 years ago
- Generate a list of keywords from any text.β31Updated 5 years ago
- DigestBox takes any webpage URL (news article, video link, comment thread, etc.) and gives you just the raw content. It's powered by Archβ¦β19Updated last year
- Track changes to GraphQL APIs by git scraping their schemasβ28Updated last month
- Ask AI to test your website with a specific goalβ13Updated last year
- Examples of different ways to embed Mastodon timelines (and posts) in HTMLβ19Updated last month
- Diff filtering, text mapping, and windowed transforms for LLM appsβ17Updated last month
- β14Updated 2 years ago