Html Content / Article Extractor in Golang
☆450Aug 3, 2025Updated 7 months ago
Alternatives and similar repositories for GoOse
Users that are interested in GoOse are comparing it to the libraries listed below
Sorting:
- An implementation of the Goose HTML Content / Article Extractor algorithm in golang☆40Apr 20, 2021Updated 4 years ago
- News Content / Article Extractor written in Go☆32Mar 13, 2015Updated 11 years ago
- A Go implementation of the readability algorithm by arc90 labs☆135May 19, 2022Updated 3 years ago
- Summarizes text☆39Sep 16, 2015Updated 10 years ago
- Go package that cleans a HTML page for better readability.☆937Dec 5, 2025Updated 3 months ago
- Golang Natural Language Processing☆835Mar 16, 2023Updated 3 years ago
- Parse RSS, Atom and JSON feeds in Go☆2,820May 27, 2025Updated 9 months ago
- dmmclust is a package for clustering short texts, based on Yin and Wang (2014)☆26Dec 13, 2017Updated 8 years ago
- Golang package to extract useful text from a HTML document☆40Mar 6, 2026Updated 2 weeks ago
- Reusable Golang library to provide readability scores☆22Jul 7, 2021Updated 4 years ago
- Go library for performing computations in word2vec binary models☆203Jun 27, 2022Updated 3 years ago
- A multilingual command line sentence tokenizer in Golang☆465Feb 28, 2024Updated 2 years ago
- All-in-one text tokenizer for Go. Super-fast. Lots of features.☆13Dec 18, 2015Updated 10 years ago
- A simple and flexible web crawler that follows the robots.txt policies and crawl delays.☆790May 19, 2021Updated 4 years ago
- Webpage summary extractor using Facebook Open Graph and arc90's readability☆68Apr 22, 2019Updated 6 years ago
- A Golang library for text processing, including tokenization, part-of-speech tagging, and named-entity extraction.☆3,069May 2, 2023Updated 2 years ago
- A go library of word2vec☆27Jul 16, 2021Updated 4 years ago
- HyperLogLog++ for Go☆43Apr 10, 2018Updated 7 years ago
- A simple library for loading word2vec binary model.☆12Sep 17, 2015Updated 10 years ago
- ipcipher implementation in Go☆17Dec 19, 2023Updated 2 years ago
- ☆28Apr 18, 2017Updated 8 years ago
- Least Squares Regression of CSV in Golang☆11May 18, 2015Updated 10 years ago
- tiny Go library to normalize URLs☆485Mar 19, 2024Updated 2 years ago
- A little like that j-thing, only in Go.☆14,921Mar 15, 2026Updated last week
- Html article content extractor in Golang.☆12Oct 31, 2022Updated 3 years ago
- Named Entity Recognition for golang via MITIE☆35Oct 25, 2018Updated 7 years ago
- WebLoop: Scriptable, headless WebKit with a Go API. Like PhantomJS, but for Go.☆1,366Apr 1, 2024Updated last year
- Word Embeddings in Go!☆504Apr 2, 2023Updated 2 years ago
- Fast identification of character sequences in text or documents (multi-lingual)☆18Feb 9, 2016Updated 10 years ago
- Polite, slim and concurrent web crawler.☆2,052May 19, 2021Updated 4 years ago
- Golang Keyword extraction/replacement Datastructure using Tries instead of regexes☆89Dec 15, 2017Updated 8 years ago
- Text summarizer for golang using LexRank☆137Oct 3, 2025Updated 5 months ago
- Counter Data structure for Golang using CountMin Sketch with a fixed amount of memory☆46Jan 4, 2018Updated 8 years ago
- word2vec in go lang☆71Oct 27, 2013Updated 12 years ago
- TMFRAME, pronounced "time frame", is a binary standard for compactly encoding time series data☆27Aug 1, 2018Updated 7 years ago
- [UNMANTEINED] Extract values from strings and fill your structs with nlp.☆388Sep 18, 2017Updated 8 years ago
- go-readability☆17Jul 11, 2014Updated 11 years ago
- Spell checking and fuzzy search suggestion written in Go☆390Oct 21, 2021Updated 4 years ago
- This is an implementation of the LexVec word embedding model (similar to word2vec and GloVe) that achieves state of the art results in mu…☆801Dec 30, 2020Updated 5 years ago