markusmobius / go-trafilaturaLinks
go-trafilatura is a Go port of the trafilatura Python library.
☆118Updated 4 months ago
Alternatives and similar repositories for go-trafilatura
Users that are interested in go-trafilatura are comparing it to the libraries listed below
Sorting:
- Readability is a library written in Go (golang) to parse, analyze and convert HTML pages into readable content. Originally an Arc90 Exper…☆122Updated 2 years ago
- Go implementation of @qdrant/fastembed.☆101Updated last year
- Go-DomDistiller is a Go port of the DOM Distiller library which implements Reader mode in Chrome for Android and Desktop. It has no depen…☆74Updated last year
- Letters, or how to parse emails in Go☆88Updated last week
- structured outputs for llms☆189Updated 3 months ago
- Go client for txtai☆81Updated last week
- A lemmatizer implemented in Go☆92Updated 8 months ago
- Write Python in Go - The most intuitive Python wrapper for Golang☆41Updated last year
- Go, Wasm bindings for HF Tokenizers and Tiktoken☆205Updated last week
- Go module for fetching embeddings from embeddings providers☆55Updated 6 months ago
- Go implementation of the SentencePiece tokenizer☆44Updated last month
- Production grade LLM-ops in Golang☆58Updated 2 weeks ago
- Go library for embedded vector search and semantic embeddings using llama.cpp☆522Updated 7 months ago
- Browser detection in Go (golang)☆89Updated last year
- This is a Golang open-source module that makes it easy to access and parse data from Wikipedia (Wikipedia API wrapper)☆113Updated 8 months ago
- go parser for human readable dates ported from the dateparser python package☆63Updated 9 months ago
- Go package that cleans a HTML page for better readability.☆931Updated last month
- A Go package that implements the JusText boilerplate removal algorithm☆110Updated 3 years ago
- SQLite FTS5-based search engine for Hugo pages☆38Updated 7 months ago
- A high-performance Golang library for easily repairing invalid JSON documents. Designed to fix common JSON issues and optimize JSON conte…☆80Updated last week
- Generate OpenAPI 3.0 specifications from Go code.☆74Updated last year
- go native port of annoy. Approximate Nearest Neighbors in optimized for memory usage and loading/saving to disk.☆19Updated last year
- Headless browser for Go for TDD workflows☆244Updated last week
- TextRank implementation in Golang with extendable features (summarization, phrase extraction) and multithreading (goroutine).☆222Updated 7 months ago
- NLP transformers written in Go☆254Updated 2 years ago
- Pagser is a simple, extensible, configurable parse and deserialize html page to struct based on goquery and struct tags for golang crawle…☆111Updated 2 years ago
- Golang library to build sqlite extensions☆178Updated 11 months ago
- A reimplementation of https://github.com/otiai10/gosseract without CGo, running Tesseract compiled to WASM with Wazero☆154Updated 7 months ago
- NLP tokenizers written in Go language☆305Updated 2 months ago
- Complete rewrite of go-docx: Production-grade Word document creation with domain-driven architecture, full OOXML compliance, and comprehe…☆76Updated last week