cixtor / readability
Readability is a library written in Go (golang) to parse, analyze and convert HTML pages into readable content. Originally an Arc90 Experiment, it is now incorporated into Safari’s Reader View.
☆120Updated 2 years ago
Alternatives and similar repositories for readability:
Users that are interested in readability are comparing it to the libraries listed below
- Go-DomDistiller is a Go port of the DOM Distiller library which implements Reader mode in Chrome for Android and Desktop. It has no depen…☆67Updated 6 months ago
- go-trafilatura is a Go port of the trafilatura Python library.☆55Updated 4 months ago
- A reimplementation of https://github.com/otiai10/gosseract without CGo, running Tesseract compiled to WASM with Wazero☆146Updated last year
- Go sqlite3 vfs☆44Updated last year
- A simple perceptual hash library in pure Go.☆55Updated last year
- Browser detection in Go (golang)☆87Updated 10 months ago
- go parser for human readable dates ported from the dateparser python package☆59Updated 8 months ago
- Go module for fetching embeddings from embeddings providers☆49Updated 3 months ago
- Pagser is a simple, extensible, configurable parse and deserialize html page to struct based on goquery and struct tags for golang crawle…☆106Updated last year
- A tokenizer based on Unicode text segmentation (UAX #29), for Go. Split words, sentences and graphemes.☆54Updated 6 months ago
- A lemmatizer implemented in Go☆86Updated 2 years ago
- ☆107Updated 4 months ago
- A Go implementation of the Thumbhash image placeholder generation algorithm.☆82Updated 8 months ago
- Go client for txtai☆76Updated 2 weeks ago
- This is a Golang open-source module that makes it easy to access and parse data from Wikipedia (Wikipedia API wrapper)☆99Updated last year
- Go HTML Info package for extracting meaningful information from html page☆35Updated 2 years ago
- Image Metadata (Exif and XMP) extraction for JPEG, HEIC, AVIF, TIFF and Camera Raw in golang. Focus is on providing features and improved…☆125Updated 3 months ago
- Read and write WARC files in Go☆45Updated 6 years ago
- Go Client for the Unsplash API☆76Updated last year
- Go cascadia package command line CSS selector☆140Updated last year
- Webpage summary extractor using Facebook Open Graph and arc90's readability☆69Updated 5 years ago
- A port of dinero.js to Go☆68Updated last year
- Convert Golang structs to Typescript interfaces☆70Updated last year
- Go library for creating EPUB files☆65Updated 2 months ago
- µDiff - a micro Go diffing library☆176Updated last month
- Golang library to build sqlite extensions☆168Updated last month
- SQLite over stdin/stdout☆85Updated 2 weeks ago
- A Golang library designed to simplify the execution of shell commands and handle their output.☆45Updated 4 months ago
- XML Tokenizer is a low-memory high performance non-namespace parser library for parsing simple XML 1.0.☆42Updated 3 weeks ago
- Persist a Go object to a JSON file☆99Updated 10 months ago