markusmobius / go-trafilaturaLinks
go-trafilatura is a Go port of the trafilatura Python library.
☆114Updated 3 months ago
Alternatives and similar repositories for go-trafilatura
Users that are interested in go-trafilatura are comparing it to the libraries listed below
Sorting:
- Go implementation of @qdrant/fastembed.☆98Updated last year
- Readability is a library written in Go (golang) to parse, analyze and convert HTML pages into readable content. Originally an Arc90 Exper…☆122Updated 2 years ago
- Go client for txtai☆80Updated 2 weeks ago
- Go-DomDistiller is a Go port of the DOM Distiller library which implements Reader mode in Chrome for Android and Desktop. It has no depen…☆72Updated last year
- A lemmatizer implemented in Go☆91Updated 8 months ago
- Go module for fetching embeddings from embeddings providers☆55Updated 5 months ago
- Production grade LLM-ops in Golang☆57Updated this week
- Browser detection in Go (golang)☆89Updated last year
- Letters, or how to parse emails in Go☆86Updated this week
- Cybertron: the home planet of the Transformers in Go☆323Updated last year
- Go library for embedded vector search and semantic embeddings using llama.cpp☆516Updated 6 months ago
- Write Python in Go - The most intuitive Python wrapper for Golang☆41Updated last year
- structured outputs for llms☆187Updated 3 months ago
- Go package that cleans a HTML page for better readability.☆929Updated last month
- Go implementation of the SentencePiece tokenizer☆42Updated last month
- Headless browser for Go for TDD workflows☆236Updated last week
- Extremely Fast Full-Text-Search Algorithm and Caching System☆157Updated 2 years ago
- Llama 2 inference in one file of pure Go☆110Updated 2 years ago
- Generate OpenAPI 3.0 specifications from Go code.☆74Updated last year
- SQLite FTS5-based search engine for Hugo pages☆38Updated 6 months ago
- This is a Golang open-source module that makes it easy to access and parse data from Wikipedia (Wikipedia API wrapper)☆113Updated 7 months ago
- Go bindings for Tiktoken & HuggingFace Tokenizer☆196Updated last month
- TextRank implementation in Golang with extendable features (summarization, phrase extraction) and multithreading (goroutine).☆221Updated 6 months ago
- A reimplementation of https://github.com/otiai10/gosseract without CGo, running Tesseract compiled to WASM with Wazero☆151Updated 6 months ago
- Golang library to build sqlite extensions☆177Updated 11 months ago
- A high-performance Golang library for easily repairing invalid JSON documents. Designed to fix common JSON issues and optimize JSON conte…☆73Updated 3 weeks ago
- NLP tokenizers written in Go language☆304Updated last month
- Complete rewrite of go-docx: Production-grade Word document creation with domain-driven architecture, full OOXML compliance, and comprehe…☆70Updated this week
- A high effective golang library for parsing big-sized sitemaps and avoiding high memory usage. The sitemap parser was written on golang w…☆39Updated 2 years ago
- Go library for creating EPUB files☆85Updated last week