markusmobius / go-trafilaturaLinks
go-trafilatura is a Go port of the trafilatura Python library.
☆113Updated 2 months ago
Alternatives and similar repositories for go-trafilatura
Users that are interested in go-trafilatura are comparing it to the libraries listed below
Sorting:
- Readability is a library written in Go (golang) to parse, analyze and convert HTML pages into readable content. Originally an Arc90 Exper…☆122Updated 2 years ago
- Go-DomDistiller is a Go port of the DOM Distiller library which implements Reader mode in Chrome for Android and Desktop. It has no depen…☆71Updated last year
- Go implementation of @qdrant/fastembed.☆94Updated last year
- Go client for txtai☆80Updated 3 weeks ago
- structured outputs for llms☆183Updated 2 months ago
- Go library for embedded vector search and semantic embeddings using llama.cpp☆514Updated 5 months ago
- Production grade LLM-ops in Golang☆57Updated last week
- A high-performance Golang library for easily repairing invalid JSON documents. Designed to fix common JSON issues and optimize JSON conte…☆70Updated this week
- A lemmatizer implemented in Go☆91Updated 7 months ago
- Go bindings for Tiktoken & HuggingFace Tokenizer☆192Updated last month
- Go package that cleans a HTML page for better readability.☆922Updated 2 weeks ago
- Go module for fetching embeddings from embeddings providers☆55Updated 5 months ago
- This is a Golang open-source module that makes it easy to access and parse data from Wikipedia (Wikipedia API wrapper)☆113Updated 7 months ago
- Write Python in Go - The most intuitive Python wrapper for Golang☆41Updated last year
- Cybertron: the home planet of the Transformers in Go☆321Updated last year
- TextRank implementation in Golang with extendable features (summarization, phrase extraction) and multithreading (goroutine).☆220Updated 6 months ago
- Go implementation of the SentencePiece tokenizer☆39Updated last week
- Headless browser for Go for TDD workflows☆236Updated this week
- Llama 2 inference in one file of pure Go☆109Updated 2 years ago
- Pagser is a simple, extensible, configurable parse and deserialize html page to struct based on goquery and struct tags for golang crawle…☆111Updated 2 years ago
- NLP transformers written in Go☆252Updated 2 years ago
- In-memory vector index for Go☆194Updated 4 months ago
- Browser detection in Go (golang)☆89Updated last year
- go parser for human readable dates ported from the dateparser python package☆64Updated 7 months ago
- A Go implementation of the Thumbhash image placeholder generation algorithm.☆123Updated last month
- Extremely Fast Full-Text-Search Algorithm and Caching System☆157Updated 2 years ago
- NLP tokenizers written in Go language☆299Updated 3 weeks ago
- SQLite FTS5-based search engine for Hugo pages☆38Updated 6 months ago
- Letters, or how to parse emails in Go☆86Updated last week
- A go client and cli for the openai APIs, focused on developer friendliness and convenience atop the basic building blocks for the OpenAI …☆65Updated last year