markusmobius / go-trafilatura
go-trafilatura is a Go port of the trafilatura Python library.
☆47Updated 3 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for go-trafilatura
- Go-DomDistiller is a Go port of the DOM Distiller library which implements Reader mode in Chrome for Android and Desktop. It has no depen…☆62Updated last month
- Readability is a library written in Go (golang) to parse, analyze and convert HTML pages into readable content. Originally an Arc90 Exper…☆118Updated last year
- A lemmatizer implemented in Go☆84Updated 2 years ago
- A Go package that implements the JusText boilerplate removal algorithm☆102Updated 2 years ago
- TextRank implementation in Golang with extendable features (summarization, phrase extraction) and multithreading (goroutine).☆205Updated 3 years ago
- Go client for txtai☆73Updated this week
- Golang API for spaCy with Python gRPC☆29Updated last year
- Restful, in-memory, full-text search engine☆32Updated 5 months ago
- NLP tokenizers written in Go language☆182Updated 5 months ago
- Go library for creating EPUB files☆49Updated this week
- Natural Language Processing Toolkit in Golang☆63Updated 4 years ago
- go parser for human readable dates ported from the dateparser python package☆49Updated 4 months ago
- Go bindings for HuggingFace Tokenizer☆91Updated 2 weeks ago
- A high effective golang library for parsing big-sized sitemaps and avoiding high memory usage. The sitemap parser was written on golang w…☆36Updated last year
- Pagser is a simple, extensible, configurable parse and deserialize html page to struct based on goquery and struct tags for golang crawle…☆105Updated last year
- Read and use word2vec vectors in Go☆56Updated 6 years ago
- Go module for fetching embeddings from embeddings providers☆41Updated 5 months ago
- Go library for performing computations in word2vec binary models☆194Updated 2 years ago
- SQLite FTS5-based search engine for Hugo pages☆30Updated 11 months ago
- pgvector support for Go☆188Updated last week
- Go implementation of today's most used tokenizers☆39Updated 3 years ago
- XML stream parser for GO☆105Updated 9 months ago
- Unofficial (Golang) Go bindings for the Hugging Face Inference API☆61Updated 2 weeks ago
- Html Content / Article Extractor in Golang☆439Updated 7 months ago
- A lightweight buffered event lib☆55Updated 2 years ago
- Little package to map hosts to a variety of http routers for Go API services☆69Updated 3 years ago
- A multilingual command line sentence tokenizer in Golang☆440Updated 8 months ago
- Go Bindings for BERT NLP Models☆99Updated 5 years ago
- A Go port of the Rapid Automatic Keyword Extraction algorithm (RAKE)☆117Updated 4 years ago
- package lingo provides the data structures and algorithms required for natural language processing