DBeath / feedsearchLinks
Search sites for RSS, Atom, and JSON feeds.
☆18Updated 2 years ago
Alternatives and similar repositories for feedsearch
Users that are interested in feedsearch are comparing it to the libraries listed below
Sorting:
- Extract text from HTML☆134Updated 4 years ago
- A Python library for finding feed links on websites.☆52Updated 3 years ago
- RSS feed reader for Python 3☆87Updated 2 years ago
- Parse numbers written in natural language☆119Updated 8 months ago
- A Python library for extracting titles, images, descriptions and canonical urls from HTML.☆150Updated 5 years ago
- Advanced news feeds extractor and finder library. Helps to automatically extract news from websites without RSS/ATOM feeds☆80Updated 2 years ago
- Crawl sites for RSS, Atom, and JSON feeds.☆76Updated last year
- Atom, RSS and JSON feed parser for Python 3☆117Updated 2 years ago
- Pre-built template for using newspaper3k on aws lambda☆17Updated 2 years ago
- A Scrapy extension to log items coverage when the spider shuts down☆19Updated 5 years ago
- A simple, Qt-Webengine powered web browser with built in functionality for basic scrapy webscraping support.☆109Updated last year
- python library for getting metadata☆146Updated 2 weeks ago
- Library that helps use puppeteer in scrapy.☆52Updated 3 weeks ago
- Python clients for Zyte AutoExtract API☆40Updated 3 years ago
- Python package to add text to images, textures and different backgrounds☆155Updated 6 months ago
- Next-generation Punkt sentence boundary detection with zero dependencies☆17Updated 2 months ago
- Find rss, atom, xml, and rdf feeds on webpages☆30Updated 8 months ago
- feedparser but faster and worse☆104Updated 3 years ago
- The most advanced debugging and testing tool for Scrapy☆16Updated 2 years ago
- This package is used to Clipped Images of Html Elements of Selenium Webdriver☆81Updated last week
- An easy-to-use python client for Google News feeds.☆50Updated 3 years ago
- Analyze scraped data☆46Updated 5 years ago
- A tiny library for Python text normalisation. Useful for ad-hoc text processing.☆152Updated 5 months ago
- Measure the readability of a given text using surface characteristics☆78Updated 5 months ago
- A Domain Specific Language (DSL) for building language patterns. These can be later compiled into spaCy patterns, pure regex, or any othe…☆68Updated 2 years ago
- Python client to use Sightengine's Image & Video Moderation services☆23Updated 6 years ago
- A library to extract a publication date from a web page, along with a measure of the accuracy.☆41Updated 5 years ago
- Common interface for data container classes☆68Updated 3 months ago
- CoCrawler is a versatile web crawler built using modern tools and concurrency.☆191Updated 3 years ago
- Extracts OpenGraph, TwitterCard and Schema properties from a webpage.☆83Updated last year