cixtor / readability
Readability is a library written in Go (golang) to parse, analyze and convert HTML pages into readable content. Originally an Arc90 Experiment, it is now incorporated into Safari’s Reader View.
☆119Updated last year
Alternatives and similar repositories for readability:
Users that are interested in readability are comparing it to the libraries listed below
- Go-DomDistiller is a Go port of the DOM Distiller library which implements Reader mode in Chrome for Android and Desktop. It has no depen…☆66Updated 4 months ago
- A Go implementation of the Thumbhash image placeholder generation algorithm.☆79Updated 7 months ago
- Browser detection in Go (golang)☆87Updated 8 months ago
- A tokenizer based on Unicode text segmentation (UAX #29), for Go. Split words, sentences and graphemes.☆54Updated 5 months ago
- Read and write WARC files in Go☆44Updated 6 years ago
- Go HTML Info package for extracting meaningful information from html page☆35Updated 2 years ago
- A Golang library designed to simplify the execution of shell commands and handle their output.☆45Updated 3 months ago
- Go implementation of the SentencePiece tokenizer☆24Updated 5 months ago
- A reimplementation of https://github.com/otiai10/gosseract without CGo, running Tesseract compiled to WASM with Wazero☆146Updated last year
- XML Tokenizer is a low-memory high performance non-namespace parser library for parsing simple XML 1.0.☆41Updated 2 weeks ago
- Go package and CLI tool for saving web page as single HTML file☆274Updated last month
- go-trafilatura is a Go port of the trafilatura Python library.☆54Updated 3 months ago
- SQLite FTS5-based search engine for Hugo pages☆35Updated last month
- This is a Golang open-source module that makes it easy to access and parse data from Wikipedia (Wikipedia API wrapper)☆94Updated last year
- Persist a Go object to a JSON file☆95Updated 9 months ago
- Image Metadata (Exif and XMP) extraction for JPEG, HEIC, AVIF, TIFF and Camera Raw in golang. Focus is on providing features and improved…☆124Updated 2 months ago
- A lemmatizer implemented in Go☆86Updated 2 years ago
- A gnorm solution for generating database/sql wrapper for postgres☆25Updated 6 years ago
- Fake English word generator for Go and CLI☆40Updated 3 years ago
- A simple perceptual hash library in pure Go.☆53Updated last year
- versatile stream IO and RPC based IPC stack for Go☆41Updated last year
- Faster utf8.Valid using multi-byte processing without SIMD.☆81Updated 10 months ago
- Pagser is a simple, extensible, configurable parse and deserialize html page to struct based on goquery and struct tags for golang crawle…☆105Updated last year
- Newser is a simple utility to generate a pdf with you favorite news articles☆86Updated 5 months ago
- 🛡 Linter for Go that checks static call arguments against the function guards (aka contracts).☆25Updated last year
- Calling C functions from Go with minimal overhead☆12Updated 2 months ago
- A persistent rope in Go☆83Updated 2 years ago
- Structured HTML table data extraction from URLs in Go that has almost no external dependencies☆120Updated last month
- grobotstxt is a native Go port of Google's robots.txt parser and matcher library.☆108Updated 2 years ago
- A soothing face filter where you can appreciate the beauty but not fully identify the person.☆27Updated last year