markusmobius / go-trafilaturaLinks
go-trafilatura is a Go port of the trafilatura Python library.
☆104Updated last month
Alternatives and similar repositories for go-trafilatura
Users that are interested in go-trafilatura are comparing it to the libraries listed below
Sorting:
- Readability is a library written in Go (golang) to parse, analyze and convert HTML pages into readable content. Originally an Arc90 Exper…☆121Updated 2 years ago
- Go-DomDistiller is a Go port of the DOM Distiller library which implements Reader mode in Chrome for Android and Desktop. It has no depen…☆71Updated last year
- Go implementation of @qdrant/fastembed.☆90Updated last year
- Go implementation of the SentencePiece tokenizer☆37Updated last year
- Go client for txtai☆80Updated this week
- Production grade LLM-ops in Golang☆57Updated 2 weeks ago
- A lemmatizer implemented in Go☆90Updated 5 months ago
- Letters, or how to parse emails in Go☆86Updated 3 weeks ago
- Browser detection in Go (golang)☆89Updated last year
- structured outputs for llms☆178Updated 3 weeks ago
- A html/template-based hypertext preprocessor and rapid application development web server written in Go.☆88Updated 3 weeks ago
- Go package that cleans a HTML page for better readability.☆908Updated 6 months ago
- Go module for fetching embeddings from embeddings providers☆55Updated 3 months ago
- A high-performance Golang library for easily repairing invalid JSON documents. Designed to fix common JSON issues and optimize JSON conte…☆60Updated 3 weeks ago
- Headless browser for Go for TDD workflows☆226Updated 2 weeks ago
- SQLite FTS5-based search engine for Hugo pages☆36Updated 4 months ago
- A reimplementation of https://github.com/otiai10/gosseract without CGo, running Tesseract compiled to WASM with Wazero☆149Updated 4 months ago
- Generate OpenAPI 3.0 specifications from Go code.☆71Updated last year
- A Go implementation of the Thumbhash image placeholder generation algorithm.☆118Updated last week
- Write Python in Go - The most intuitive Python wrapper for Golang☆40Updated 11 months ago
- TextRank implementation in Golang with extendable features (summarization, phrase extraction) and multithreading (goroutine).☆218Updated 4 months ago
- Go bindings for Tiktoken & HuggingFace Tokenizer☆183Updated last month
- Extremely Fast Full-Text-Search Algorithm and Caching System☆157Updated 2 years ago
- Llama 2 inference in one file of pure Go☆107Updated 2 years ago
- A go client and cli for the openai APIs, focused on developer friendliness and convenience atop the basic building blocks for the OpenAI …☆64Updated last year
- Go Based Lightweight RAG / LLM Tool with CLI + API☆14Updated 2 years ago
- Golang library to build sqlite extensions☆174Updated 9 months ago
- This is a Golang open-source module that makes it easy to access and parse data from Wikipedia (Wikipedia API wrapper)☆111Updated 5 months ago
- Cybertron: the home planet of the Transformers in Go☆319Updated last year
- Pagser is a simple, extensible, configurable parse and deserialize html page to struct based on goquery and struct tags for golang crawle…☆111Updated 2 years ago