karussell/snacktory

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/karussell/snacktory)

karussell / snacktory

Readability clone in Java

☆462

Alternatives and similar repositories for snacktory

Users that are interested in snacktory are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

srijiths / readabilityBUNDLE
View on GitHub
A bundle of html content extraction algorithms
☆121Mar 27, 2015Updated 11 years ago
GravityLabs / goose
View on GitHub
Html Content / Article Extractor in Scala - open sourced from Gravity Labs
☆1,527Apr 18, 2017Updated 9 years ago
kohlschutter / boilerpipe
View on GitHub
Work in progress transmit from Google Code
☆1,126Jan 3, 2018Updated 8 years ago
luin / readability
View on GitHub
📚 Turn any web page into a clean view
☆2,521Apr 3, 2021Updated 5 years ago
jiminoc / goose
View on GitHub
Html Content / Article Extractor in Scala - open sourced from Gravity Labs - http://gravity.com
☆341Aug 20, 2019Updated 6 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
robbypond / boilerpipe
View on GitHub
boilerpipe 1.2.2 - a fork from 1.2.0 with additional features
☆10Nov 2, 2016Updated 9 years ago
Netbreeze-GmbH / boilerpipe
View on GitHub
boilerpipe 1.2.2 - a fork from 1.2.0 with additional features
☆43Jun 6, 2017Updated 9 years ago
rclayton / StringSimilarity
View on GitHub
A number of algorithms for calculating string similarity in Java
☆15Jan 23, 2011Updated 15 years ago
larsmans / lucene-stanford-lemmatizer
View on GitHub
A library that adds some NLP capabilities to the Lucene search engine
☆50Jul 16, 2013Updated 13 years ago
pvdlg / boilerpipe
View on GitHub
Repackaging of Boilerpipe published on Maven Central Repository.
☆54Dec 17, 2023Updated 2 years ago
ceteri / textrank
View on GitHub
Java implementation of the TextRank algorithm by Mihalcea, et al.
☆74Feb 27, 2021Updated 5 years ago
kxtells / vague-places
View on GitHub
☆14Dec 24, 2016Updated 9 years ago
dragnet-org / dragnet
View on GitHub
Just the facts -- web page content extraction
☆1,274Jul 8, 2025Updated last year
JakeWharton / WritingAgileAPKs
View on GitHub
AnDevCon III Presentation: Writing Agile APKs
☆26May 15, 2012Updated 14 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
nakfoury / TwInfluence
View on GitHub
Generates visualizations of influential tweets about a given hashtag.
☆11Jun 1, 2017Updated 9 years ago
stanzhai / Html2Article
View on GitHub
Html网页正文提取
☆496May 9, 2022Updated 4 years ago
outware / caveman
View on GitHub
Companion application used for dynamically managing the environment variables for a target application.
☆16Mar 9, 2016Updated 10 years ago
mozilla / readability
View on GitHub
A standalone version of the readability lib
☆11,345Jul 9, 2026Updated last week
crawler-commons / crawler-commons
View on GitHub
A set of reusable Java components that implement functionality common to any web crawler
☆259Jul 2, 2026Updated 2 weeks ago
socialsensor / multimedia-geotagging
View on GitHub
Contains the implementation of algorithms that estimate the geographic location of media content based on their content and metadata. It …
☆15Oct 15, 2016Updated 9 years ago
twitter-archive / twitter-text-java
View on GitHub
A Java implementation of Twitter's text processing library
☆364Dec 13, 2014Updated 11 years ago
socialsensor / storm-focused-crawler
View on GitHub
Collects multimedia content shared through social networks.
☆19Feb 18, 2015Updated 11 years ago
softwarenerd / TSNPeerBluetooth
View on GitHub
Bluetooth LE peer-to-peer library for iOS
☆14May 20, 2015Updated 11 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Hendekagon / cljs-web-audio
View on GitHub
A Clojurescript interface to the Web Audio API, intended for sonification
☆12Jul 22, 2020Updated 5 years ago
swipely / pipely
View on GitHub
Visualize pipeline definitions for AWS Data Pipeline
☆23Feb 3, 2026Updated 5 months ago
dreamzmaster / gulp-pa11y
View on GitHub
Audit accessibility of your site using Gulp
☆11Nov 19, 2015Updated 10 years ago
ushahidi / grimlock
View on GitHub
A simple transformation/data processing pipeline for CrisisNET
☆15Oct 2, 2014Updated 11 years ago
dice-group / n3-collection
View on GitHub
N3 - A Collection of Datasets for Named Entity Recognition and Disambiguation in the NLP Interchange Format
☆71Dec 2, 2017Updated 8 years ago
SilentCircle / libzina
View on GitHub
ZINA is a secure messaging protocol for mobile devices
☆23Mar 5, 2018Updated 8 years ago
socialsensor / topic-detection
View on GitHub
A set of methods for automatically detecting trending topics in streams of short texts (e.g. tweets).
☆53Nov 26, 2014Updated 11 years ago
TV4 / chronometro
View on GitHub
Annotation triggered library that helps tracking loading times of your app
☆44Feb 1, 2016Updated 10 years ago
Sotera / Datawake
View on GitHub
Browser add-on and web server to support collection and analysis of web browsing data.
☆14Mar 9, 2016Updated 10 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
jashmenn / cascading-simhash
View on GitHub
simple simhashing in hadoop with cascading
☆33May 9, 2011Updated 15 years ago
codelucas / newspaper
View on GitHub
newspaper3k is a news, full-text, and article metadata extraction in Python 3. Advanced docs:
☆15,114Updated this week
grangier / python-goose
View on GitHub
Html Content / Article Extractor, web scrapping lib in Python
☆4,100Mar 10, 2026Updated 4 months ago
sugoi-wada / appversionchecker
View on GitHub
[DEPRICATED] An Android library that checks for your application's updates on Google Play Store.
☆31Nov 15, 2018Updated 7 years ago
BartoszJarocki / android-boilerpipe
View on GitHub
Boilerplate Removal and Fulltext Extraction from HTML pages for Android
☆18Jun 17, 2014Updated 12 years ago
ChatSecure / RubDub
View on GitHub
A Node XMPP Push Service for XEP-0357: Push Notifications
☆16Oct 2, 2019Updated 6 years ago
kingwkb / readability
View on GitHub
a python readability
☆277Jun 22, 2017Updated 9 years ago