DBeath / feedsearchLinks
Search sites for RSS, Atom, and JSON feeds.
☆18Updated 2 years ago
Alternatives and similar repositories for feedsearch
Users that are interested in feedsearch are comparing it to the libraries listed below
Sorting:
- Extract text from HTML☆135Updated 4 years ago
- A Python library for finding feed links on websites.☆52Updated 2 years ago
- Crawl sites for RSS, Atom, and JSON feeds.☆75Updated last year
- A library to extract a publication date from a web page, along with a measure of the accuracy.☆41Updated 5 years ago
- CoCrawler is a versatile web crawler built using modern tools and concurrency.☆191Updated 3 years ago
- Parse numbers written in natural language☆116Updated 7 months ago
- Pre-built template for using newspaper3k on aws lambda☆17Updated 2 years ago
- RSS feed reader for Python 3☆87Updated 2 years ago
- an experimental implementation of Burrow's delta in Python 3☆21Updated 3 years ago
- Advanced news feeds extractor and finder library. Helps to automatically extract news from websites without RSS/ATOM feeds☆80Updated 2 years ago
- Python clients for Zyte AutoExtract API☆40Updated 3 years ago
- Extract dates from text☆64Updated 4 years ago
- python library for getting metadata☆146Updated this week
- Atom, RSS and JSON feed parser for Python 3☆117Updated 2 years ago
- 📖 Using deep learning and scraping to analyze/summarize articles! Just drop in any URL!☆19Updated 2 years ago
- Python package for converting xml and epubs to text files☆34Updated 4 years ago
- Web scraping Page Objects core library☆101Updated last week
- Restful Autocomplete service with Neo4j graph backend. Returns top suggestions.☆40Updated 5 months ago
- A Python library for extracting titles, images, descriptions and canonical urls from HTML.☆150Updated 5 years ago
- Django QuerySet like interface to query simple Python collections☆68Updated last year
- This is a REST Server endpoint built using Flask and Python.☆24Updated 2 years ago
- Extract structured data from HTML and XML documents like a boss.☆49Updated 6 months ago
- Scrapy middleware to add extra fields to items, like timestamp, response fields, spider attributes etc.☆56Updated 3 years ago
- This is the frontend layer of SearchX. SearchX is a scalable collaborative search system being developed by Lambda Lab of TU Delft.☆15Updated 2 years ago
- Versatile Metrics Collection for Python☆19Updated last year
- A tiny library for Python text normalisation. Useful for ad-hoc text processing.☆151Updated 4 months ago
- Pyppeteer integration for Scrapy☆58Updated 4 years ago
- Python 3 AsyncIO powered scraping framework with batteries included☆20Updated 8 years ago
- Scrapy middleware which allows to crawl only new content☆79Updated 2 years ago
- Python package that offers text scrubbing functionality, providing building blocks for string cleaning as well as normalizing geographica…☆22Updated 9 months ago