mawenbao / gofeedLinks
gofeed is disigned to extract full-text rss feeds from websites which only provide partial feeds or none
☆9Updated 10 years ago
Alternatives and similar repositories for gofeed
Users that are interested in gofeed are comparing it to the libraries listed below
Sorting:
- WarcMiddleware lets users seamlessly download a mirror copy of a website when running a web crawl with the Python web crawler Scrapy.☆47Updated 7 years ago
- Small set of utilities to simplify writing Scrapy spiders.☆49Updated 9 years ago
- Suite of tools for detecting changes in web pages and their rendering☆54Updated last year
- Calibre HTML and OPDS web server based on CakePHP☆39Updated 9 years ago
- A component based data flow framework with a drag-n-drop Web 2.0 interface. Based on Stackless Python and inspired by Yahoo! Pipes.☆150Updated 12 years ago
- Feed discovery to share :)☆41Updated 8 years ago
- Jabba's headless webkit browser for scraping AJAX-powered webpages.☆91Updated 10 years ago
- Demo of the Newspaper article extraction library.☆29Updated 10 years ago
- A queue-controlled browser automation tool for improving web crawl quality☆61Updated 4 months ago
- Word Graph utility built with NLTK and TextBlob☆18Updated 11 years ago
- A python library for the Tiny Tiny RSS web API☆56Updated 4 years ago
- This is a news bot which uses Superfeedr's API to send and receive RSS notifications.☆53Updated 8 years ago
- ☆13Updated 9 years ago
- Cross platform middleware for Social Networking Services: Twitter, Facebook, SinaWeibo, Renren, RSS, Email, Sqlite, ... (more coming)☆157Updated 3 years ago
- This is a telegram bot written in python. It uses the CLI of telegram by vysheng to connect. No longer developed. Checkout☆30Updated 10 years ago
- A pair of scripts to download videos and subtitles for the TED Talks (http://www.ted.com)☆42Updated 11 years ago
- Chambua is an open-source semantic tagging application that analyses text and extracts names of people, places (& geocodes them), organis…☆33Updated 3 years ago
- Bringing sanity to world of messed-up data☆66Updated 10 years ago
- Site Hound (previously THH) is a Domain Discovery Tool☆23Updated 4 years ago
- Blog crawler for the blogforever project.☆22Updated 11 years ago
- Save a bunch of web pages as a self-contained, compressed archive file for offline storage and sharing.☆35Updated 12 years ago
- Gevent Crawling in Python, with Utilities☆22Updated 10 years ago
- A Python utility for moving bookmarks/reading lists between services☆204Updated 9 years ago
- An eBook tool to extract ISBN or Metadata form eBook and rename them by using ISBN database and Metadata☆30Updated 10 years ago
- extract difference between two html pages☆32Updated 7 years ago
- Pipeline for distributed Natural Language Processing, made in Python☆65Updated 8 years ago
- Find which links on a web page are pagination links☆29Updated 8 years ago
- An online sentiment analyzer built with Flask and TextBlob☆15Updated 11 years ago
- A command-line interactive coursera-downloader.☆15Updated 7 years ago
- Collects multimedia content shared through social networks.☆19Updated 10 years ago