kaflesudip / grabfeed
Python package to detect and return RSS / Atom feeds for a given website. The tool supports major blogging platform including Wordpress, Blogger, Tumblr, Ghost, Svbtle, medium and many other.
β21Updated 3 years ago
Alternatives and similar repositories for grabfeed:
Users that are interested in grabfeed are comparing it to the libraries listed below
- π·Configuration based html scraperβ23Updated 2 weeks ago
- Easy extraction of keywords and engines from search engine results pages (SERPs).β90Updated 3 years ago
- Find which links on a web page are pagination linksβ29Updated 8 years ago
- Python library with common functionality for writing web scrapersβ102Updated 9 years ago
- πβοΈ Python/Django reference implementation of the ERAV data modelβ21Updated 5 years ago
- A Python library for finding feed links on websites.β52Updated 2 years ago
- E-commerce scraping and analytics platform.β52Updated 9 years ago
- Python implementation of the Parsley language for extracting structured data from web pagesβ92Updated 7 years ago
- Restrict crawl and scraping scope using matchers.β25Updated 8 years ago
- Scraper for categories and lists on ecommerce and other listing websitesβ42Updated 4 years ago
- Python code to scrape and collect data from the RSS feeds Facebook uses to augment its Trending Sectionβ57Updated 6 years ago
- RSS Aggregatorβ91Updated 3 years ago
- π TouristFriend API lets you query Google Places, Yelp and Foursquare at the same time, with Bayesian rankings!β29Updated 6 years ago
- Taking a screenshot of a webpage.β49Updated 9 years ago
- Automatically extracts and normalizes an online article or blog post publication dateβ117Updated last year
- A python autocompletion library. Easycomplete has a simple API and utilizes google's autocomplete results & the english dictionary for noβ¦β40Updated 11 years ago
- A component that tries to avoid downloading duplicate contentβ27Updated 6 years ago
- PyOpenGraph is a library written in Python for parsing Open Graph protocol information from web sites.β94Updated 10 years ago
- Must-read articles and books about Python. Inspired by https://github.com/s16h/py-must-watchβ11Updated 8 years ago
- Utility library to turn country names into ISO two-letter codesβ66Updated last month
- Paginating the webβ37Updated 11 years ago
- A Python library for extracting titles, images, descriptions and canonical urls from HTML.β149Updated 4 years ago
- Python 3 AsyncIO powered scraping framework with batteries includedβ20Updated 8 years ago
- A python implementation of DEPTAβ83Updated 8 years ago
- Lightweight library that converts a HTML webpage to JSON data using a template defined in JSON.β21Updated 4 years ago
- Python Elasticsearch Querysetsβ206Updated 3 years ago
- A Python script that generates a list of pairs of funny words for naming things such as app releases, internal projects, servers and chilβ¦β27Updated 8 years ago
- Small set of utilities to simplify writing Scrapy spiders.β49Updated 9 years ago
- A lightweight customisable RSS reader for Django.β171Updated 2 years ago
- Exporters is an extensible export pipeline library that supports filter, transform and several sources and destinationsβ40Updated 10 months ago