bookieio/breadability

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/bookieio/breadability)

bookieio / breadability

Reworked https://www.readability.com/ parsing library (now https://mercury.postlight.com/ is living alternative)

☆205

Alternatives and similar repositories for breadability

Users that are interested in breadability are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

buriy / python-readability
View on GitHub
fast python port of arc90's readability tool, updated to match latest readability.js!
☆2,894Jan 26, 2026Updated 5 months ago
PomanoB / lsse
View on GitHub
Serelex - lexico-semantic search engine
☆19Mar 19, 2017Updated 9 years ago
timbertson / python-readability
View on GitHub
[abandoned] python port of arc90's readability bookmarklet
☆542Jun 16, 2011Updated 15 years ago
reorx / readability
View on GitHub
html main body extractor
☆17Jul 15, 2015Updated 11 years ago
scrapinghub / product-extraction-benchmark
View on GitHub
☆16Apr 10, 2026Updated 3 months ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
rodricios / eatiht
View on GitHub
An exercise in unsupervised machine learning: Extract Article's Text in HTml documents.
☆430Jan 16, 2026Updated 6 months ago
changhiskhan / poseidon
View on GitHub
Python CLI for Digital Ocean API v2
☆67Mar 4, 2018Updated 8 years ago
ReadabilityHoldings / python-readability-api
View on GitHub
Python wrapper for the Readability API.
☆132Sep 8, 2021Updated 4 years ago
grangier / python-goose
View on GitHub
Html Content / Article Extractor, web scrapping lib in Python
☆4,100Mar 10, 2026Updated 4 months ago
srid / readability
View on GitHub
[unmaintained] Python version of arc90's *older* readability.js
☆47Oct 30, 2011Updated 14 years ago
srijiths / readabilityBUNDLE
View on GitHub
A bundle of html content extraction algorithms
☆121Mar 27, 2015Updated 11 years ago
dcramer / decruft
View on GitHub
python-readability, but faster (mirror-ish)
☆82Jan 24, 2012Updated 14 years ago
kohlschutter / boilerpipe
View on GitHub
Work in progress transmit from Google Code
☆1,126Jan 3, 2018Updated 8 years ago
pschwede / AnchorBot
View on GitHub
The more often you click a word in the headlines, the more interesting are your news.
☆13Mar 27, 2017Updated 9 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
harshavardhana / boilerpipy
View on GitHub
Readability/Boilerpipe extraction in Python
☆55May 6, 2016Updated 10 years ago
dragnet-org / dragnet
View on GitHub
Just the facts -- web page content extraction
☆1,274Jul 8, 2025Updated last year
domclick / pggraph
View on GitHub
Утилита для работы с зависимостями таблиц в PostgreSQL
☆10Aug 15, 2024Updated last year
goose3 / goose3
View on GitHub
A Python 3 compatible version of goose http://goose3.readthedocs.io/en/latest/index.html
☆912Updated this week
saippuakauppias / django-simple-open-graph
View on GitHub
Django package for simplicity embed open graph (og:) layout in templates for different objects
☆21Oct 8, 2013Updated 12 years ago
oxyum / django-payment-webmoney
View on GitHub
WebMoney Merchant Interface support for Django.
☆23Apr 28, 2014Updated 12 years ago
pydepta / pydepta
View on GitHub
A python implementation of DEPTA
☆84Jan 14, 2017Updated 9 years ago
SlapBot / gpt-2-demo
View on GitHub
Demonstration of gpt-2 model with flask+uwsgi+nginx in web environment containerized in docker for quick deployment.
☆13Mar 24, 2023Updated 3 years ago
miso-belica / sumy
View on GitHub
Module for automatic summarization of text documents and HTML pages.
☆3,696Updated this week
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ekianjo / GetGoogleBooks
View on GitHub
A Python application to download Google Books and convert them in PDF in a given folder.
☆10Oct 21, 2013Updated 12 years ago
blha303 / DO-runin
View on GitHub
A tool that starts a DigitalOcean droplet in a given region and runs a given command, displaying the output.
☆25Jun 1, 2016Updated 10 years ago
misja / python-boilerpipe
View on GitHub
Python interface to Boilerpipe, Boilerplate Removal and Fulltext Extraction from HTML pages
☆542Jul 17, 2021Updated 5 years ago
alehandrof / xander-taskpaper-styles
View on GitHub
Styles for TaskPaper 3
☆12Jan 27, 2018Updated 8 years ago
mozilla / readability
View on GitHub
A standalone version of the readability lib
☆11,355Jul 9, 2026Updated 2 weeks ago
jenskutilek / Glyphs-Scripts
View on GitHub
Scripts for Glyphs.app
☆14Jun 4, 2026Updated last month
jiyfeng / dclm
View on GitHub
Document context language models
☆21Nov 13, 2015Updated 10 years ago
BLE-LTER / Zotero-JavaScript-Search-Client
View on GitHub
Example HTML, CSS, and JavaScript for searching for items within a public Zotero user or group library
☆10Nov 11, 2022Updated 3 years ago
seanbrant / django-filch
View on GitHub
Custom model fields that make de-normalizing data in Django easier
☆23Nov 19, 2010Updated 15 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
dansheffler / zettelkasten-wiki
View on GitHub
An Atom package for creating a zettelkasten style wiki. Should be used with my Academic-Markdown syntax file
☆12Jun 3, 2021Updated 5 years ago
telekommunisten / c30s.org
View on GitHub
stuff which will eventually go public on our page
☆12Aug 5, 2019Updated 6 years ago
JNRowe / pyisbn
View on GitHub
A Python module for working with 10- and 13-digit ISBNs
☆42Jul 15, 2026Updated last week
vgarvardt / django-loginza
View on GitHub
Django application for Loginza service
☆39Oct 2, 2014Updated 11 years ago
ptwobrussell / python-boilerpipe
View on GitHub
Python interface to Boilerpipe, Boilerplate Removal and Fulltext Extraction from HTML pages
☆32Sep 2, 2016Updated 9 years ago
core-api / python-jsonhyperschema-codec
View on GitHub
A JSON Hyperschema codec for Core API.
☆17Feb 27, 2018Updated 8 years ago
scrapy / scrapely
View on GitHub
A pure-python HTML screen-scraping library
☆1,884Apr 4, 2022Updated 4 years ago