markusmobius / go-trafilatura
go-trafilatura is a Go port of the trafilatura Python library.
☆45Updated last week
Related projects ⓘ
Alternatives and complementary repositories for go-trafilatura
- Go-DomDistiller is a Go port of the DOM Distiller library which implements Reader mode in Chrome for Android and Desktop. It has no depen…☆60Updated last month
- A lemmatizer implemented in Go☆84Updated last year
- Unofficial (Golang) Go bindings for the Hugging Face Inference API☆61Updated this week
- Readability is a library written in Go (golang) to parse, analyze and convert HTML pages into readable content. Originally an Arc90 Exper…☆118Updated last year
- Go bindings for HuggingFace Tokenizer☆88Updated this week
- A Go package that implements the JusText boilerplate removal algorithm☆102Updated 2 years ago
- Go client for txtai☆72Updated last month
- Support for reading and writing PDF files in Go.☆30Updated 2 weeks ago
- Go module for fetching embeddings from embeddings providers☆42Updated 5 months ago
- Pagser is a simple, extensible, configurable parse and deserialize html page to struct based on goquery and struct tags for golang crawle…☆105Updated last year
- Go implementation of @qdrant/fastembed.☆49Updated 5 months ago
- NLP tokenizers written in Go language☆179Updated 5 months ago
- Natural Language Processing Toolkit in Golang☆63Updated 4 years ago
- A high effective golang library for parsing big-sized sitemaps and avoiding high memory usage. The sitemap parser was written on golang w…☆36Updated last year
- Go implementation of today's most used tokenizers☆39Updated 3 years ago
- Go library for creating EPUB files☆45Updated last week
- NLP transformers written in Go☆206Updated last year
- Mistral API Client in Golang☆64Updated 5 months ago
- A lightweight buffered event lib☆55Updated 2 years ago
- Face recognition in Go using MTCNN and QMagFace☆31Updated last year
- Article spinning and spintax/spinning syntax engine written in Go, useful for A/B, testing pieces of text/articles and creating more natu…☆59Updated 3 years ago
- Generate OpenAPI 3.0 specifications from Go code.☆56Updated 2 months ago
- TextRank implementation in Golang with extendable features (summarization, phrase extraction) and multithreading (goroutine).☆205Updated 3 years ago
- A high-performance Golang library for easily repairing invalid JSON documents. Designed to fix common JSON issues and optimize JSON conte…☆11Updated 4 months ago
- pgvector support for Go☆186Updated 3 weeks ago
- boxes and glue made easy - a PDF rendering library for Go using boxes and glue☆77Updated 3 weeks ago
- Easy to use PDF library using Go and PDFium☆193Updated last month
- efficient string matching in Golang via the aho-corasick algorithm.☆67Updated 7 months ago
- Image similarity in Golang. Version 4 (LATEST)☆84Updated 7 months ago
- Read and use word2vec vectors in Go☆56Updated 6 years ago