mawenbao / gofeedLinks

gofeed is disigned to extract full-text rss feeds from websites which only provide partial feeds or none

☆9

Alternatives and similar repositories for gofeed

Users that are interested in gofeed are comparing it to the libraries listed below

Sorting:

odie5533 / WarcMiddleware
WarcMiddleware lets users seamlessly download a mirror copy of a website when running a web crawl with the Python web crawler Scrapy.
☆47Updated 7 years ago
rmax / scrapy-boilerplate
Small set of utilities to simplify writing Scrapy spiders.
☆49Updated 9 years ago
openpreserve / pagelyzer
Suite of tools for detecting changes in web pages and their rendering
☆54Updated last year
openam / calibrephp
Calibre HTML and OPDS web server based on CakePHP
☆39Updated 9 years ago
fullscale / pypes
A component based data flow framework with a drag-n-drop Web 2.0 interface. Based on Stackless Python and inspired by Yahoo! Pipes.
☆150Updated 12 years ago
superfeedr / feediscovery
Feed discovery to share :)
☆41Updated 8 years ago
jabbalaci / Jabba-Webkit
Jabba's headless webkit browser for scraping AJAX-powered webpages.
☆91Updated 10 years ago
codelucas / newspaper-demo
Demo of the Newspaper article extraction library.
☆29Updated 10 years ago
internetarchive / umbra
A queue-controlled browser automation tool for improving web crawl quality
☆61Updated 4 months ago
tistaharahap / WordGraph
Word Graph utility built with NLTK and TextBlob
☆18Updated 11 years ago
Vassius / ttrss-python
A python library for the Tiny Tiny RSS web API
☆56Updated 4 years ago
superfeedr / news-bot
This is a news bot which uses Superfeedr's API to send and receive RSS notifications.
☆53Updated 8 years ago
VIDA-NYU / memex
☆13Updated 9 years ago
hupili / snsapi
Cross platform middleware for Social Networking Services: Twitter, Facebook, SinaWeibo, Renren, RSS, Email, Sqlite, ... (more coming)
☆157Updated 3 years ago
asdofindia / python-telegram-bot
This is a telegram bot written in python. It uses the CLI of telegram by vysheng to connect. No longer developed. Checkout
☆30Updated 10 years ago
joedicastro / ted-talks-download
A pair of scripts to download videos and subtitles for the TED Talks (http://www.ted.com)
☆42Updated 11 years ago
ushahidi / Chambua
Chambua is an open-source semantic tagging application that analyses text and extracts names of people, places (& geocodes them), organis…
☆33Updated 3 years ago
Alir3z4 / python-sanitize
Bringing sanity to world of messed-up data
☆66Updated 10 years ago
TeamHG-Memex / sitehound-frontend
Site Hound (previously THH) is a Domain Discovery Tool
☆23Updated 4 years ago
OlivierBlanvillain / crawler
Blog crawler for the blogforever project.
☆22Updated 11 years ago
iandennismiller / offline-pages
Save a bunch of web pages as a self-contained, compressed archive file for offline storage and sharing.
☆35Updated 12 years ago
seomoz / g-crawl-py
Gevent Crawling in Python, with Utilities
☆22Updated 10 years ago
codebox / reading-list-mover
A Python utility for moving bookmarks/reading lists between services
☆204Updated 9 years ago
EugenePig / ebook-isbn
An eBook tool to extract ISBN or Metadata form eBook and rename them by using ISBN database and Metadata
☆30Updated 10 years ago
TeamHG-Memex / extract-html-diff
extract difference between two html pages
☆32Updated 7 years ago
NAMD / pypln.backend
Pipeline for distributed Natural Language Processing, made in Python
☆65Updated 8 years ago
scrapinghub / page_finder
Find which links on a web page are pagination links
☆29Updated 8 years ago
sloria / textfeel-web
An online sentiment analyzer built with Flask and TextBlob
☆15Updated 11 years ago
yingchi / coursera-downloader
A command-line interactive coursera-downloader.
☆15Updated 7 years ago
socialsensor / storm-focused-crawler
Collects multimedia content shared through social networks.
☆19Updated 10 years ago