jiminoc / gooseView external linksLinks
Html Content / Article Extractor in Scala - open sourced from Gravity Labs - http://gravity.com
☆343Aug 20, 2019Updated 6 years ago
Alternatives and similar repositories for goose
Users that are interested in goose are comparing it to the libraries listed below
Sorting:
- Html Content / Article Extractor in Scala - open sourced from Gravity Labs☆1,531Apr 18, 2017Updated 8 years ago
- A port of the arclabs 'readability' package to Java☆72Sep 10, 2012Updated 13 years ago
- Work in progress transmit from Google Code☆1,127Jan 3, 2018Updated 8 years ago
- An exercise in unsupervised machine learning: Extract Article's Text in HTml documents.☆432Jan 16, 2026Updated 3 weeks ago
- Readability clone in Java☆461Oct 13, 2020Updated 5 years ago
- A Prudence-based web services API for the Goose HTML content extraction library☆38Jul 17, 2011Updated 14 years ago
- Python interface to Boilerpipe, Boilerplate Removal and Fulltext Extraction from HTML pages☆542Jul 17, 2021Updated 4 years ago
- Pure Scheme Gopher Server☆11Jan 21, 2012Updated 14 years ago
- a type level lisp interpreter on Rust's type system☆10Nov 11, 2016Updated 9 years ago
- Web scraper for Scala☆37Mar 11, 2013Updated 12 years ago
- My Emacs config☆18Feb 6, 2026Updated last week
- Repackaging of Boilerpipe published on Maven Central Repository.☆53Dec 17, 2023Updated 2 years ago
- ☆18Jun 24, 2017Updated 8 years ago
- An experiment using Play2/Slick/SecureSocial together☆31Oct 29, 2014Updated 11 years ago
- Twitter streaming with Akka☆12Oct 25, 2015Updated 10 years ago
- Html Content / Article Extractor, web scrapping lib in Python☆4,061Dec 26, 2021Updated 4 years ago
- Scrapes a remote page and creates a summary with statistics☆37Aug 24, 2014Updated 11 years ago
- A chess program written in Scala☆19Jul 21, 2020Updated 5 years ago
- play-webrtc☆15Oct 10, 2014Updated 11 years ago
- ☆21May 31, 2018Updated 7 years ago
- Open Source Python SDK for AI Agents Identity☆33Jan 20, 2026Updated 3 weeks ago
- Wikipedia Live Monitor☆22Dec 21, 2024Updated last year
- Heuristic based boilerplate removal tool☆811Feb 25, 2025Updated 11 months ago
- Since the original was abandoned to start a web service, I'm now going to attempt to maintain the JS+CSS portion☆167Sep 22, 2017Updated 8 years ago
- Seed app for Play, Heroku and Postgres with demo CRUD model, view, controller☆24Nov 16, 2014Updated 11 years ago
- ☆24Oct 10, 2017Updated 8 years ago
- The Berkeley Entity Resolution System jointly solves the problems of named entity recognition, coreference resolution, and entity linking…☆186Dec 7, 2019Updated 6 years ago
- Solution to Data Analytics Essentials course by Cisco☆13Dec 26, 2023Updated 2 years ago
- PlayAuthenticate Mongo Sample☆25Dec 3, 2014Updated 11 years ago
- Extract data from websites using basic statistical magic☆505Oct 2, 2020Updated 5 years ago
- Just the facts -- web page content extraction☆1,280Jul 8, 2025Updated 7 months ago
- Android application which provides product views based upon users' recommendations.☆10Aug 20, 2015Updated 10 years ago
- Android Live Indian & World wide TV app.☆13Mar 25, 2020Updated 5 years ago
- Data, analytic code, and findings supporting BuzzFeed News's analysis of diversity in the dialogue of Best Picture–nominated films☆10Jun 21, 2022Updated 3 years ago
- akka http gremlin 3 websocket connector☆31Nov 13, 2018Updated 7 years ago
- Common Crawl fork of Apache Nutch☆40Updated this week
- [not maintained] Custom Twitter Search via ElasticSearch&Wicket☆60Oct 13, 2020Updated 5 years ago
- ☆10Sep 17, 2023Updated 2 years ago
- TOTP (Time-based One-Time Password) authentication for Django REST Framework.☆13Feb 5, 2026Updated last week